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NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 

1. BACKGROUND OF THE INVENTION 

1.1 TECHNICAL FIELD 

The present invention provides novel polynucleotides and proteins encoded by such 
polynucleotides, along with uses for these polynucleotides and proteins, for example in 
therapeutic, diagnostic and research methods. 

1.2 BACKGROUND 

Technology aimed at the discovery of protein factors (including e.g., cytokines, such 
as lymphokines, interferons, CSFs, chemokines, and interleukins) has matured rapidly over 
the past decade. The now routine hybridization cloning and expression cloning techniques 
clone novel polynucleotides "directly" in the sense that they rely on information directly 
related to the discovered protein (i.e., partial DNA/amino acid sequence of the protein in the 
case of hybridization cloning; activity of the protein in the case of expression cloning). More 
recent "indirect" cloning techniques such as signal sequence cloning, which isolates DNA 
sequences based on the presence of a now well-recognized secretory leader sequence motif, 
as well as various PCR-based or low stringency hybridization-based cloning techniques, have 
advanced the state of the art by making available large numbers of DNA/amino acid 
sequences for proteins that are known to have biological activity, for example, by virtue of 
their secreted nature in the case of leader sequence cloning, by virtue of their cell or tissue 
source in the case of PCR-based techniques, or by virtue of structural similarity to other 
genes of known biological activity. 

Identified polynucleotide and polypeptide sequences have numerous applications in, 
for example, diagnostics, forensics, gene mapping; identification of mutations responsible for 
genetic disorders or other traits, to 2issess biodiversity, and to produce many other types of 
data and products dependent on DNA and amino acid sequences. 

2. SUMMARY OF THE INVENTION 

The compositions of the present invention include novel isolated polypeptides, novel 
isolated polynucleotides encoding such polypeptides, including recombinant DNA molecules, 
cloned genes or degenerate variants thereof, especially naturally occurring variants such as 
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allelic variants, antisense polynucleotide molecules, and antibodies that specifically recognize 
one or more epitopes present on such polypeptides, as well as hybridomas producing such 
antibodies. 

The compositions of the present invention additionally include vectors, including 
5 expression vectors, containing the polynucleotides of the invention, cells genetically engineered 
to contain such polynucleotides and cells genetically engineered to express such 
polynucleotides. 

The present invention relates to a collection or library of at least one novel nucleic acid 
sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by 

1 0 hybridization (SBH), and in some cases, sequences obtained from one or more public databases. 
The invention relates also to the proteins encoded by such polynucleotides, along with 
therapeutic, diagnostic and research utilities for these polynucleotides and proteins. These 
nucleic acid sequences are designated as SEQ ID NO: 1 - 948. The polypeptide sequences are 
designated SEQ ID NOS: 949-1 896. The nucleic acids and polypeptides arc provided in the , 

1 5 Sequence Listing. In the nucleic acids provided in the Sequence Listing, A is adenine; C is 

cytosine; G is guanine; T is thymine; and N is any of the four bases. In the amino acids provided 
in the Sequence Listing, * corresponds to the stop codon. 

The nucleic acid sequences of the present invention also include, nucleic acid sequences 
that hybridize to the complement of SEQ ID NO; 1 - 948 under stringent hybridization 

20 conditions; nucleic acid sequences which are allelic variants or species homologues of any of the 
nucleic acid sequences recited above, or nucleic acid sequences that encode a peptide 
comprising a specific domain or truncation of the peptides encoded by SEQ ID NO: 1 - 948. A 
polynucleotide comprising a nucleotide sequence having at least 90% identity to an identifying 
sequence of SEQ ID NO: I - 948 or a degenerate variant or fragment thereof. The identifying 

25 sequence can be 100 base pairs in length. 

The nucleic acid sequences of the present invention also include the sequence 
information from the nucleic acid sequences of SEQ ID NO: 1 - 948. The sequence information 
can be a segment of any one of SEQ ID NO: 1 - 948 that uniquely identifies or represents the 
sequence information of SEQ ID NO: 1 - 948. 

30 A collection as used in this application can be a collection of only one polynucleotide. 

The collection of sequence information or identifying information of each sequence can be 
provided on a nucleic acid array. In one embodiment, segments of sequence information is 
provided on a nucleic acid array to detect the polynucleotide that contains the segment. The 
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array can be designed to detect foil-match or mismatch to the polynucleotide that contains the 
segment. The collection can also be provided in a computer-readable format. 

This invention also includes the reverse or direct complement of any of the nucleic acid 
sequences recited above; cloning or expression vectors containing the nucleic acid sequences; 
5 and host cells or organisms transformed with these expression vectors. Nucleic acid sequences 
(or their reverse or direct complements) according to the invention have numerous applications 
in a variety of techniques known to those skilled in the art of molecular biology, such as use as 
hybridization probes, use as primers for PCR, use in an array, use in computer-readable media, 
use in sequencing full-length genes, use for chromosome and gene mapping, use in the 

10 recombinant production of protein, and use in the generation of anti-sense DNA or RNA, their 
chemical analogs and the like. 

In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1-948 or novel 
segments or parts of the nucleic acids of the invention are used as primers in expression assays 
that are well known in the art. In a particularly preferred embodiment, the nucleic acid 

1 5 sequences of SEQ ID NO: 1-948 or novel segments or parts of the nucleic acids provided herein 
are used in diagnostics for identifying expressed genes or, as well known in the art and 
exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed sequence tags for 
physical mapping of the human genome. 

The isolated polynucleotides of the invention include, but are not limited to, a 

20 polynucleotide comprising any one of the nucleotide sequences set forth in SEQ ID NO: 949- 
1 896; a polynucleotide comprising any of the full length protein coding sequences of SEQ ID 
NO: 1-948; and a polynucleotide comprising any of the nucleotide sequences of the mature 
protein coding sequences of SEQ ID NO: 1-948. The polynucleotides of the present invention 
also include, but are not limited to, a polynucleotide that hybridizes under stringent hybridization 

25 conditions to (a) the complement of any one of the nucleotide sequences set forth in SEQ ID 

NO: 1-948; (b) a nucleotide sequence encoding any one of the amino acid sequences set forth in 
the Sequence Listing; (c) a polynucleotide which is an allelic variant of any polynucleotides 
recited above; (d) a polynucleotide which encodes a species homolog (e.g. orthologs) of any of 
the proteins recited above; or (e) a polynucleotide that encodes a polypeptide comprising a 

30 specific domain or truncation of any of the polypeptides comprising an amino acid sequence set 
forth in the Sequence Listing. 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 
comprising any of the amino acid sequences set forth in the Sequence Listing; or the 
corresponding full length or mature protein. Polypeptides of the invention also include 
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polypeptides with biological activity that are encoded by (a) any of the polynucleotides having a 
nucleotide sequence set forth in SEQ ID NO: 1-948; or (b) polynucleotides that hybridize to the 
complement of the polynucleotides of (a) under stringent hybridization conditions Biologically 
or immunolbgically active variants of any of the polypeptide sequences in the Sequence Listing, 
and "substantial equivalents" thereof (e.g., with at least about 65%, 70%, 75%, 80%, 85%, 
90%, 95%, 98% or 99% amino acid sequence identity) that preferably retain biological activity 
are also contemplated. The polypeptides of the invention may be wholly or partially chemically 
synthesized but are preferably produced by recombinant means using the genetically engineered 
cells (e.g. host cells) of the invention. 

The invention also provides compositions comprising a polypeptide of the invention. 
Polypeptide compositions of the invention may further comprise an acceptable carrier, such 
as a hydrophilic, e.g., pharmaceutically acceptable, carrier. 

The invention also provides host cells transformed or transfected with a 
polynucleotide of the invention. 

The invention also relates to methods for producing a polypeptide of the invention - 
comprising growing a culture of the host cells of the invention in a suitable culture medium 
under conditions permitting expression of the desired polypeptide, and purifying the 
polypeptide from the culture or from the host cells. Preferred embodiments include those in 
which the protein produced by such process is a mature form of the protein. 

Polynucleotides according to the invention have numerous applications in a variety of 
techniques known to those skilled in the art of molecular biology. These techniques include 
use as hybridization probes, use as oligomers, or primers, for PCR, use for chromosome and 
gene mapping, use in the recombinant production of protein, and use in generation of 
anti-sense DNA or RNA, their chemical analogs and the like. For example, when the 
expression of an mRNA is largely restricted to a particular cell or tissue type, polynucleotides 
of the invention can be used as hybridization probes to detect the presence of the particular 
cell or tissue mKNA in a sample using, e.g., in situ hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 
expressed sequence tags for identifying expressed genes or, as well known in the art and 
exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed sequence tags for 
physical mapping of the human genome. 

The polypeptides according to the invention can be used in a variety of conventional 
procedures and methods that are currently applied to other proteins. For example, a 
polypeptide of the invention can be used to generate an antibody that specifically binds the 
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polypeptide. Such antibodies* particularly mQnotlonal antibodies, are useful for detecting or 
quantitating the polypeptide in tissue. The polypeptides of the invention can also be used as 
molecular weight markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical 
5 condition which comprises the step of administering to a mammalian subject a therapeutically 
effective amount of a composition comprising a polypeptide 1 of the present invention and a 
pharmaceutical^ acceptable carrier. 

In particular, the polypeptides and polynucleotides of the invention can be utilized, for 
example, in methods for the prevention and/or treatment of disorders involving aberrant 

10 protein expression or biological activity. 

The present invention further relates to methods for detecting the presence of the 
polynucleotides or polypeptides of the invention in a sample. Such methods can, for 
example, be utilized as part of prognostic and diagnostic evaluation of disorders as recited 
herein and for the identification of subjects exhibiting a predisposition to such conditions. 

1 5 The invention provides a method for detecting the polynucleotides of the invention in a 
sample, comprising contacting the sample with a compound that binds to and forms a 
complex with the polynucleotide of interest for a period sufficient to form the complex and 
under conditions sufficient to form a complex and detecting the complex such that if a 
complex is detected, the polynucleotide of interest is detected. The invention also provides a 

20 method for detecting the polypeptides of the invention in a sample comprising contacting the 
sample with a compound that binds to and forms a complex with the polypeptide under 
conditions and for a period sufficient to form the complex and detecting the formation of the 
complex such that if a complex is formed, the polypeptide is detected. 

The invention also provides kits comprising polynucleotide probes and/or monoclonal 

25 antibodies, and optionally quantitative standards, for carrying out methods of the invention. 
Furthermore, the invention provides methods for evaluating the efficacy of drugs, and 
monitoring the progress of patients, involved in clinical trials for the treatment of disorders as 
recited above. 

The invention also provides methods for the identification of compounds that 

30 modulate (i.e., increase or decrease) the expression or activity of the polynucleotides and/or 

polypeptides of the invention. Such methods can be utilized, for example, for the 

identification of compounds that can ameliorate symptoms of disorders as recited herein. 

Such methods can include, but are not limited to, assays for identifying compounds and other 

substances that interact with (e.g., bind to) the polypeptides of the invention. The invention 

5 
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provides a method for identifying a compound that binds to the polypeptides of the invention 
comprising contacting the compound with a polypeptide of the invention in a cell for a time 
sufficient to form a polypeptide/compound complex, wherein the complex drives expression 
of a reporter gene sequence in the cell; and detecting the complex by detecting the reporter 
5 gene sequence expression such that if expression of the reporter gene is detected the 
compound the binds to a polypeptide of the invention is identified. . 

The methods of the invention also provides methods for treatment which involve the 
administration of the polynucleotides or polypeptides of the invention to individuals 
exhibiting symptoms or tendencies. In addition, the invention encompasses methods for 

1 0 treatingdiseases or disorders as recited herein comprising administering compounds and 

other substances that modulate the overall activity of the target gene products. Compounds 
and other substances can effect such modulation either on the level of target gene/protein 
expression or target protein activity. 

The polypeptides of the present invention and the polynucleotides encoding them are 

1 5 also useful for the same functions known to one of skill in the art as the polypeptides and 
polynucleotides to which they have homology (set forth in Table 2); for which they have a 
signature region (as set forth in Table 3); or for which they have homology to a gene family 
(as set forth in Table 4). If no homology is set forth for a sequence, then the polypeptides and 
polynucleotides of the present invention are useful for a variety of applications, as described 

20 herein, including use in arrays for detection. 

3. DETAILED DESCRIPTION OF THE INVENTION 
3.1 DEFINITIONS 

25 It must be noted that as used herein and in the appended claims, the singular forms 

"a", "an" and "the" include plural references unless the context clearly dictates otherwise. 

The term "active" refers to those forms of the polypeptide which retain the biologic 
and/or immunologic activities of any naturally occurring polypeptide. According to the 
invention, the terms "biologically active" or "biological activity" refer to a protein or peptide 

30 having structural, regulatory or biochemical functions of a naturally occurring molecule. 

Likewise "immunologically active" or "immunological activity" refers to the capability of the 
natural, recombinant or synthetic polypeptide to induce a specific immune response in 
appropriate animals or cells and to bind with specific antibodies. 
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The term "activated cells" as used in this application are those cells which are 
engaged in extracellular or intracellular membrane trafficking, including the export of 
secretory or enzymatic molecules as part of a normal or disease process. 

The terms "complementary" or "complementarity" refer to the natural binding of 
5 polynucleotides by base pairing. For example, the sequence 5 ? -AGT-3' binds to the 
complementary sequence 3'-T CA-5V Complementarity between two single-stranded 
molecules may be "partial" such that only sbine of the nucleic acids bind or it may be 
"complete" such that total complementarity exists between the single stranded molecules. 
The degree of complementarity between the nucleic acid strands has significant effects on the 
10 efficiency and strength of the hybridization between the nucleic acid strands. 

The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 
differentiated cell types in an embryo or an adult, including the germ cells. The term "germ 
line stem cells (GSCs)" refers to stem cells derived from primordial stem cells that provide a 
steady and continuous source of germ cells for the production of gametes. The term 
1 5 "primordial germ cells (PGCs)" refers to a small population of cells set aside from other cell 
lineages particularly from the yolk sac, mesenteries, or gonadal ridges during embryogenesis 
that have the potential to differentiate into germ cells and other cells. PGCs are the source 
from which GSCs and ES cells are derived The PGCs, the GSCs and the ES cells are capable 
of self-renewal. Thus these cells not only populate the germ line and give rise to a plurality 
20 of terminally differentiated cells that comprise the adult specialized organs, but are able to 
regenerate themselves. 

The term "expression modulating fragment," EMF, means a series of nucleotides 
which modulates the expression of an operably linked ORF or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably linked 
25 sequence 11 when the expression of the sequence is altered by the presence of the EMF. EMFs 
include, but are not limited to, promoters, and promoter modulating sequences (inducible 
elements). One class of EMFs are nucleic acid fragments which induce the expression of an 
operably linked ORF in response to a specific regulatory factor or physiological event. 

The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 
30 "oligonculeotide" are used interchangeably and refer to a heteropolymer of nucleotides or the 
sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the 
sense or the antisense strand, to peptide nucleic acid (PNA) or to any DN A-like or RNA-like 
material. In the sequences herein A is adenine, C is cytosine, T is thymine, G is guanine and 
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N is A, C, G or T (U). It is contemplated that where the polynucleotide is RNA, the T 
(thymine) in the sequences provided herein is substituted with U (uracil). Generally, nucleic 
acid segments provided by this invention may be assembled from fragments of the genome 
and short oligonucleotide linkers, or from a series of oligonucleotides, or from individual 
5 nucleotides, to provide a synthetic nucleic acid which is capable of being expressed in a 

recombinant transcriptional unit comprising regulatory elements derived from a microbial or 
viral operon, or a eukaryotic gene. 

The terms "oligonucleotide fragment 1 ' or a "polynucleotide fragment", "portion," or 
"segment" or "probe" or "primer 5 ' are used interchangeably and refer to a sequence of 

10 nucleotide residues which are at least about 5 nucleotides, more preferably at least about 7 
nucleotides, more preferably at least about 9 nucleotides, more preferably at least about 1 1 
nucleotides and most preferably at least about 17 nucleotides. The fragment is preferably less 
than about 500 nucleotides, preferably less than about 200 nucleotides, more preferably less 
than about 100 nucleotides, more preferably less than about 50 nucleotides and most 

1 5 preferably less than 30 nucleotides. Preferably the probe is from about 6 nucleotides to about ^ 
200 nucleotides, preferably from about 15 to about 50 nucleotides, more preferably from 
about 17 to 30 nucleotides and most preferably from about 20 to 25 nucleotides. Preferably 
the fragments can be used in polymerase chain reaction (PCR), various hybridization 
procedures or microarray procedures to identify or amplify identical or related parts of 

20 mRNA or DNA molecules. A fragment or segment may uniquely identify each 

polynucleotide sequence of the present invention. Preferably the fragment comprises a 
sequence substantially similar to any one of SEQ ID NOs: 1-948. 

Probes may, for example, be used to determine whether specific mRNA molecules are 
present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal DNA 

25 as described by Walsh etal. (Walsh, P.S.etal., 1992, PCR Methods Appl 1:241-250). They 
may be labeled by nick translation, Klenow fill-in reaction, PCR, or other methods well 
known in the art. Probes of the present invention, their preparation and/or labeling are 
elaborated in Sambrook, J. et al., 1989, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory, NY; or Ausubel, F.M. et al., 1989, Current Protocols in Molecular 

30 Biology, John Wiley & Sons, New York NY, both of which are incorporated herein by 
reference in their entirety. 

The nucleic acid sequences of the present invention also include the sequence 
information from the nucleic acid sequences of SEQ ID NOs: 1-948. The sequence 
information can be a segment of any one of SEQ ID NOs: 1-948 that uniquely identifies or 
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represents the sequence information of that sequence of SEQ ID NO: 1-948. One such 
segment can be a twenty-mer nucleic acid sequence because the probability that a twenty-mer 
is fully matched in the human genome is 1 in 300. In the human genome, there are three 
billion base pairs in one set of chromosomes. Because 4 20 possible twenty-mers exist, there 
5 are 300 times more twenty-mers than there are base pairs in a set of human chromosomes. 
Using the same analysis, the probability for a seventeen-mer to be fully matched in the 
human genome is approximately 1 in 5. When these segments are used in arrays for 
expression studies, fifteen-mer segments can be used. The probability that the fifteen-mer is 
fully matched in the expressed sequences is also approximately one in five because expressed 

10 sequences comprise less than approximately 5% of the entire genome sequence. 

Similarly, when using sequence information for detecting a single mismatch, a segment 
can be a twenty-five mer. The probability that the twenty-five mer would appear in a human 
genome with a single mismatch is calculated by multiplying the probability for a full match 
(1 -S4 25 ) times the increased probability for mismatch at each nucleotide position (3 x 25). The 

1 5 probability that an eighteen mer with a single mismatch can be detected in an array for 

expression studies is approximately one in five. The probability that a twenty-mer with a single 
mismatch can be detected in a human genome is approximately one in five. 

The term "open reading frame," ORF, means a series of nucleotide triplets coding for 
amino acids without any termination codons and is a sequence translatable into protein. 

20 The terms "operably linked" or "operably associated" refer to functionally related 

nucleic acid sequences. For example, a promoter is operably associated or operably linked 
with a coding sequence if the promoter controls the transcription of the coding sequence. 
While operably linked nucleic acid sequences can be contiguous and in the same reading 
frame, certain genetic elements e.g. repressor genes are not contiguously linked to the coding 

25 sequence but still control transcription/translation of the coding sequence. 

The term "pluripotenf * refers to the capability of a cell to differentiate into a number 
of differentiated cell types that are present in an adult organism. A pluripotent cell is 
restricted in its differentiation capability in comparison to a totipotent celt. 

The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an 

30 oligopeptide, peptide, polypeptide or protein sequence or fragment thereof and to naturally 
occurring or synthetic molecules. A polypeptide "fragment," "portion," or "segment" is a 
stretch of amino acid residues of at least about 5 amino acids, preferably at least about 7 
amino acids, more preferably at least about 9 amino acids and most preferably at least about 
17 or more amino acids. The peptide preferably is not greater than about 200 amino acids, 
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more preferably less than 1 50 amino acids and most preferably less than 1 00 amino acids. 
Preferably the peptide is from about 5 to about 200 amino acids. To be active, any 
polypeptide must have sufficient length to display biological and/or immunological activity. 
The term "naturally occurring polypeptide" refers to polypeptides produced by cells 
5 that have not been genetically engineered and specifically contemplates various polypeptides 
arising from post-translational modifications of the polypeptide including, but not limited to, 
acetylation, carboxylation, glycosylate, phosphorylation, lipidation and acylation. 

The term "translated protein coding portion" means a sequence which encodes for the 
foil length protein which may include any leader sequence or any processing sequence. 

10 The term ''mature protein coding sequence" means a sequence which encodes a 

peptide or protein without a signal or leader sequence. The "mature protein portion" means 
that portion of the protein which does not include a signal or leader sequence. The peptide 
may have been produced by processing in the cell which removes any leader/signal sequence. 
The mature protein portion may or may not include the initial methionine residue. The 

1 5 methionine residue may be removed from the protein during processing in the cell. The 
peptide may be produced synthetically or the protein may have been produced using a 
polynucleotide only encoding for the mature protein coding sequence. 

The term "derivative" refers to polypeptides chemically modified by such techniques 
as ubiquitination, labeling (e.g., with radionuclides or various enzymes), covalent polymer 

20 attachment such as pegylation (derivatization with polyethylene glycol) and insertion or 

substitution by chemical synthesis of amino acids such as ornithine, which do not normally 
occur in human proteins. 

The term "variant"(or "analog") refers to any polypeptide differing from naturally 
occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, e 

25 g., recombinant DNA techniques. Guidance in determining which amino acid residues may 
be replaced, added or deleted without abolishing activities of interest, may be found by 
comparing the sequence of the particular polypeptide with that of homologous peptides and 
minimizing the number of amino acid sequence changes made in regions of high homology 
(conserved regions) or by replacing amino acids with consensus sequence. 

30 Alternatively, recombinant variants encoding these same or similar polypeptides may 

be synthesized or selected by making use of the "redundancy" in the genetic code. Various 
codon substitutions, such as the silent changes which produce various restriction sites, may 
be introduced to optimize cloning into a plasmid or viral vector or expression in a particular 
prokaryotic or eukaryotic system. Mutations in the polynucleotide sequence may be reflected 
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in the polypeptide or domains of other peptides added to the polypeptide to modify the 
properties of any part of the polypeptide, to change characteristics such as ligand-binding 
affinities, interchain affinities, or degradation/turnover rate. 

Preferably, amino acid "substitutions" are the result of replacing one amino acid with 
5 another amino acid having similar structural and/or chemical properties, /.e. 5 conservative 

amino acid replacements. "Conservative" amino acid substitutions may be made on the basis 
of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the 
amphipathic nature of the residues involved. For example, nonpqlar (hydrophobic) amino 
acids include alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and 

10 methionine; polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, 
asparagine, and glutamine; positively charged (basic) amino acids include arginine, lysine, 
and histidine; and negatively charged (acidic) amino acids include aspartic acid and glutamic 
acid. "Insertions" or "deletions" are preferably in the range of about 1 to 20 amino acids, 
more preferably 1 to 10 amino acids. The variation allowed may be experimentally 

15 determined by systematically making insertions, deletions, or substitutions of amino acids in 
a polypeptide molecule using recombinant DNA techniques and assaying the resulting 
recombinant variants for activity. 

Alternatively, where alteration of function is desired, insertions, deletions or 
non-conservative alterations can be engineered to produce altered polypeptides. Such 

20 alterations can, for example, alter one or more of the biological functions or biochemical 
characteristics of the polypeptides of the invention. For example, such alterations may 
change polypeptide characteristics such as ligand-binding affinities, interchain affinities, or 
degradation/turnover rate. Further, such alterations can be selected so as to generate 
polypeptides that are better suited for expression, scale up and the like in the host cells chosen 

25 for expression. For example, cysteine residues can be deleted or substituted with another 
amino acid residue in order to eliminate disulfide bridges. 

The terms "purified" or "substantially purified" as used herein denotes that the 
indicated nucleic acid or polypeptide is present in the substantial absence of other biological 
macromolecules, e.g., polynucleotides, proteins, and the like. In one embodiment, the 

30 polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, more 
preferably at least 99% by weight, of the indicated biological macromolecules present (but 
water, buffers, and other small molecules, especially molecules having a molecular weight of 
less than 1000 daltons, can be present). 
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• The term "isolated" as used herein refers to a nucleic acid or polypeptide separated 
from at least one other component (e.g., nucleic acid or polypeptide) present with the nucleic 
acid or polypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide 
is found in the presence of (if anything) only a solvent,.buffer, ion, or other component 
normally present iri a solution 6f the same. The terms "isolated" and "purified" do not 
encompass nucleic acids or polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protein, means 
that a polypeptide or protein is derived from recombinant (e.g., microbial, insect, or 
mammalian) expression systems. "Microbial" refers to recombinant polypeptides or proteins 
made in bacterial or fungal (e.g., yeast) expression systems. As a product, "recombinant 
microbial" defines a polypeptide or protein essentially free of native endogenous substances 
and unaccompanied by associated native glycosylation. Polypeptides or proteins expressed in 
most bacterial cultures, e.g., K coli, will be free of glycosylation modifications; polypeptides 
or proteins expressed in yeast will have a glycosylation pattern in general different from those 
expressed in mammalian cells. 

The term "recombinant expression vehicle or vector" refers to a plasmid or phage or 
virus or vector, for expressing a polypeptide from a DNA (RNA) sequence. An expression 
vehicle can comprise a transcriptional unit comprising an assembly of (1) a genetic element 
or elements having a regulatory role in gene expression, for example, promoters or enhancers, 
(2) a structural or coding sequence which is transcribed into mRNA and translated into 
protein, and (3) appropriate transcription initiation and termination sequences. Structural 
units intended for use in yeast or eukaryotic expression systems preferably include a leader 
sequence enabling extracellular secretion of translated protein by a host cell. Alternatively, 
where recombinant protein is expressed without a leader or transport sequence, it may include 
an amino terminal methionine residue. This residue may or may not be subsequently cleaved 
from the expressed recombinant protein to provide a final product. 

The term "recombinant expression system" means host cells which have stably 

integrated a recombinant transcriptional unit into chromosomal DNA or carry the 

recombinant transcriptional unit extrachromosomally. Recombinant expression systems as 

defined herein will express heterologous polypeptides or proteins upon induction of the 

regulatory elements linked to the DNA segment or synthetic gene to be expressed. This term 

also means host cells which have stably integrated a recombinant genetic element or elements 

having a regulatory role in gene expression, for example, promoters or enhancers. 

Recombinant expression systems as defined herein will express polypeptides or proteins 
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endogenous to the cell upon induction of the regulatory elements linked to the endogenous 
DNA segment or gene to be expressed. The cells can be prokaryotic or eukaryotic. 

The term "secreted" includes a protein that is transported across or through a 
membrane, including transport as a result of signal sequences in its amino acid sequence 
5 when it is expressed in a suitable host cell. "Secreted" proteins include without limitation 
proteins secreted wholly (e.g., soluble proteins) or partially (e.g., receptors) from the cell in 
which they are expressed. "Secreted" proteins also include without limitation proteins that 
are transported across the membrane of the endoplasmic reticulum. "Secreted" proteins are 
also intended to include proteins containing non-typical signal sequences (e.g. Interleukin-1 

10 Beta, see Krasney, PA. and Young, Pit. (1992) Cytokine 4(2):134 -143) and factors released 
from damaged cells (e.g. Interleukin-1 Receptor Antagonist, see Arend, W.P. et al. (1998) 
Annu. Rev. Immunol. 16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or leader 
sequence" which will direct the polypeptide through the membrane of a cell. Such a 

1 5 sequence may be naturally present on the polypeptides of the present invention or provided 

from heterologous protein sources by recombinant DNA techniques. 

The term "stringent" is used to refer to conditions that are commonly understood in 
the art as stringent. Stringent conditions can include highly stringent conditions (i.e., 
hybridization to filter-bound DNA in 0.5 MNaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 

20 mM EDTA at 65°C, and washing in 0.1X SSC/0.1% SDS at 68°C), and moderately stringent 
conditions (i.e., washing in 0.2X SSC/0.1% SDS at 42°C). Other exemplary hybridization 
conditions are described herein in the examples. 

In instances of hybridization of deoxyoligonucleotides, additional exemplary stringent 
hybridization conditions include washing in 6X SSC/0.05% sodium pyrophosphate at 37°C 

25 (for 1 4-base oligonucleotides), 48°C (for 1 7-base oligos), 55°C (for 20-base 
oligonucleotides), and 60°C (for 23-base oligonucleotides). 

As used herein, "substantially equivalent" or "substantially similar" can refer both to 
nucleotide and amino acid sequences, for example a mutant sequence, that varies from a 
reference sequence by one or more substitutions, deletions, or additions, the net effect of 

30 which does not result in an adverse functional dissimilarity between the reference and subject 
sequences. Typically, such a substantially equivalent sequence varies from one of those 
listed herein by no more than about 35% (Le. 9 the number of individual residue substitutions, 
additions, and/or deletions in a substantially equivalent sequence, as compared to the 
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corresponding reference sequence, divided by the total number of residues in the substantially 
equivalent sequence is about 0.35 or less). Such a sequence is said to have 65% sequence 
identity to the listed sequence. In one embodiment, a substantially equivalent, e.g., mutant, 
sequence of the invention varies from a listed sequence by no more than 30% (70% sequence 
identity); in a variation of this embodiment, by no more than 25% (75% sequence identity); 
and in a further variation of this embodiment, by no more than 20% (80% sequence identity) 
and in a further variation of this embodiment, by no more than 10% (90% sequence identity) 
and in a further variation of this embodiment, by no more that 5% (95% sequence identity). 
Substantially equivalent, e.g., mutant, amino acid sequences according to the invention 
preferably have at least 80% sequence identity with a listed amino acid sequence, more 
preferably at least 85% sequence identity, more preferably at least 90% sequence identity, 
more preferably at least 95% sequence identity, more preferably at least 98% sequence 
identity, and most preferably at least 99% sequence identity. Substantially equivalent 
nucleotide sequence of the invention can have lower percent sequence identities, taking into 
account, for example, the redundancy or degeneracy of the genetic code. Preferably, the - 
nucleotide sequence has at least about 65% identity, more preferably at least about 75% 
identity, more preferably at least about 80% sequence identity, more preferably at least 85% 
sequence identity, more preferably at least 90% sequence identity, more preferably at least 
about 95% sequence identity, more preferably at least 98% sequence identity, and most 
preferably at least 99% sequence identity. For the purposes of the present invention, 
sequences having substantially equivalent biological activity and substantially equivalent 
expression characteristics are considered substantially equivalent. For the purposes of 
determining equivalence, truncation of the mature sequence (e.g., via a mutation which 
creates a spurious stop codon) should be disregarded. Sequence identity may be determined, 
e.g., using the Jotun Hein method (Hein, J. (1 990) Methods Enzymol. 1 83:626-645). Identity 
between sequences can also be determined by other methods known in the art, e.g. by varying 
hybridization conditions. 

The term totipotent" refers to the capability of a cell to differentiate into all of the 
cell types of an adult organism. 

The term "transformation" means introducing DNA into a suitable host cell so that the 

DNA is replicable, either as an extrachromosomal element, or by chromosomal integration. 

The term "transfection" refers to the taking up of an expression vector by a suitable host cell, 

whether or not any coding sequences are in fact expressed. The term "infection" refers to the 

introduction of nucleic acids into a suitable host cell by use of a virus or viral vector. 
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As used herein, an "uptake modulating fragment," UMF, means a series of nucleotides 
which mediate the uptake of a linked DNA fragment into a cell. UMFs can be readily 
identified using known UMFs as a target sequence or target motif with the computer-based 
systems described below. The presence and activity of a UMF can be confirmed by attaching 
the suspected UMF to a marker sequence. The resulting nucleic acid molecule is then 
incubated with an appropriate host under appropriate conditions and the uptake of the marker 
sequence is determined. As described above, a UMF will increase the frequency of uptake of 
a linked marker sequence. 

Each of the above terms is meant to encompass all that is described for each, unless 
the context dictates otherwise. 

3.2 NUCLEIC ACIDS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 

The isolated polynucleotides of the invention include a polynucleotide comprising the 
nucleotide sequences of SEQ ED NO: 1 - 948; a polynucleotide encoding any one of the 
peptide sequences of SEQ ID NO: 949-1896; and a polynucleotide comprising the nucleotide 
sequence encoding the mature protein coding sequence of the polynucleotides of any one of 
SEQ ID NO: 1 - 948. The polynucleotides of the present invention also include, but are not 
limited to, a polynucleotide that hybridizes under stringent conditions to (a) the complement 
of any of the nucleotides sequences of SEQ ID NO: 1 - 948; (b) nucleotide sequences 
encoding any one of the amino acid sequences set forth in the Sequence Listing as SEQ ID 
NO: 949-1896; (c) a polynucleotide which is an allelic variant of any polynucleotide recited 
above; (d) a polynucleotide which encodes a species homolog of any of the proteins recited 
above; or (e) a polynucleotide that encodes a polypeptide comprising a specific domain or 
truncation of the polypeptides of SEQ ID NO: 949-1896. Domains of interest may depend on 
the nature of the encoded polypeptide; e.g., domains in receptor-like polypeptides include 
ligand-binding, extracellular, transmembrane, or cytoplasmic domains, or combinations 
thereof; domains in immunoglobulin-like proteins include the variable immunoglobulin-like 
domains; domains in enzyme-like polypeptides include catalytic and substrate binding 
domains; and domains in ligand polypeptides include receptor-binding domains. 

The polynucleotides of the invention include naturally occurring or wholly or partially 
synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The polynucleotides 
may include all of the coding region of the cDNA or may represent a portion of the coding 
region of the cDNA. 
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The present invention also provides genes corresponding to the cDNA sequences 
disclosed herein. The corresponding genes can be isolated in accordance with known methods 
using the sequence information disclosed herein. Such methods include the preparation of 
probes or primers from the disclosed sequence information for identification and/or 
amplification of genes in appropriate genomic libraries or other sources of genomic materials. 
Further 5' and 3' sequence can be obtained using methods known in the art. For example, full 
length cDNA or genomic DNA that corresponds to any of the polynucleotides of SEQ ID NO: 1 
- 948 can be obtained by screening appropriate cDNA or genomic DNA libraries under suitable 
hybridization conditions using any of the polynucleotides of SEQ ID NO: 1 - 948 or a portion 
thereof as a probe. Alternatively, the polynucleotides of SEQ ID NO: 1 - 948 may be used as 
the basis for suitable primer(s) that allow identification and/or amplification of genes in 
appropriate genomic DNA or cDNA libraries. 

The nucleic acid sequences of the invention can be assembled from ESTs and sequences 
(including cDNA and genomic sequences) obtained from one or more public databases, such as 
dbEST, gbpri, and UniGene. The EST sequences can provide identifying sequence information; 
representative fragment or segment information, or novel segment information for the full-length 
gene. 

The polynucleotides of the invention also provide polynucleotides including 
nucleotide sequences that are substantially equivalent to the polynucleotides recited above. 
Polynucleotides according to the invention can have, e.g., at least about 65%, at least about 
70%, at least about 75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least 
about 85%, 86%, 87%, 88%, 89%, more typically at least about 90%, 91%, 92%, 93%, 94%, 
and even more typically at least about 95%, 96%, 97%, 98%, 99% sequence identity to a 
polynucleotide recited above. 

Included within the scope of the nucleic acid sequences of the invention are nucleic 
acid sequence fragments that hybridize under stringent conditions to any of the nucleotide 
sequences of SEQ ID NO: 1 - 948, or complements thereof, which fragment is greater than 
about 5 nucleotides, preferably 7 nucleotides, more preferably greater than 9 nucleotides and 
most preferably greater than 17 nucleotides. Fragments ofc e.g. 15, 17, or 20 nucleotides or 
more that are selective for (i.e. specifically hybridize to any one of the polynucleotides of the 
invention) are contemplated. Probes capable of specifically hybridizing to a polynucleotide 
can differentiate polynucleotide sequences of the invention from other polynucleotide 
sequences in the same family of genes or can differentiate human genes from genes of other 
species, and are preferably based on unique nucleotide sequences. 
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The sequences falling within the scope of the present invention are not limited to these 
specific sequences, but also include allelic and species variations thereof. Allelic and species 
variations can be routinely determined by comparing the sequence provided in SEQ ID NO: 1 - 
948, a representative fragment thereof, or a nucleotide sequence at least 90% identical, 
5 preferably 95% identical, to SEQ ID NOs: 1 - 948 with a sequence from another isolate of the 
same species. Furthermore, to accommodate codon variability, the invention includes nucleic 
acid molecules coding for the same amino acid sequences as do the specific ORFs disclosed 
herein. In other words, in the coding region of an ORF, substitution of one codon for another 
codon that encodes the same amino acid is expressly contemplated. 

1 0 The nearest neighbor or homology result for the nucleic acids of the present invention, 

including SEQ ID NOs: 1 - 948, can be obtained by searching a database using an algorithm or a 
program. Preferably, a BLAST which stands for Basic Local Alignment Search Tool is used to 
search for local sequence alignments (Altshul, S.F. J Mol. EvoL 36 290-300 (1993) and Altschul 
S.F. et al. J. Mol. Biol. 21 :403-410 (1990)). Alternatively a FASTA version 3 search against 

15 Genpept, using Fastxy algorithm. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are 
also provided by the present invention. Species homologs may be isolated and identified by 
making suitable probes or primers from the sequences provided herein and screening a 
suitable nucleic acid source from the desired species. 

20 The invention also encompasses allelic variants of the disclosed polynucleotides or 

proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide which 
also encode proteins which are identical, homologous or related to that encoded by the 
polynucleotides. 

The nucleic acid sequences of the invention are further directed to sequences which 
25 encode variants of the described nucleic acids. These amino acid sequence variants may be 
prepared by methods known in the art by introducing appropriate nucleotide changes into a 
native or variant polynucleotide. There are two variables in the construction of amino acid 
sequence variants: the location of the mutation and the nature of the mutation. Nucleic acids 
encoding the amino acid sequence variants are preferably constructed by mutating the 
30 polynucleotide to encode an amino acid sequence that does not occur in nature. These 
nucleic acid alterations can be made at sites that differ in the nucleic acids from different 
species (variable positions) or in highly conserved regions (constant regions). Sites at such 
locations will typically be modified in series, e.g., by substituting first with conservative 
choices (e.g., hydrophobic amino acid to a different hydrophobic amino acid) and then with 
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more distant choices (e.g., hydrophobic amino acid to a charged amino acid), and then 
deletions or insertions may be made at the target site. Amino acid sequence deletions 
generally range from about 1 to 30 residues, preferably about 1 to 10 residues, and are 
typically contiguous. Amino acid insertions include amino- and/or carboxyl-terminal fusions 
ranging in length from one to one hundred or more residues, as well as intrasequence 
insertions of single or multiple amino acid residues. Intrasequence insertions may range 
generally from about 1 to 10 amino residues, preferably from 1 to 5 residues. Examples of 
terminal insertions include the heterologous signal sequences necessary for secretion or for 
intracellular targeting in different host cells and sequences such as FLAG or poly-histidine 
sequences useful for purifying the expressed protein. 

In a preferred method, polynucleotides encoding the novel amino acid sequences are 
changed via site-directed mutagenesis. This method uses oligonucleotide sequences to alter a 
polynucleotide to encode the desired amino acid variant, as well as sufficient adjacent 
nucleotides on both sides of the changed amino acid to form a stable duplex on either side of 
the site of being changed. In general, the techniques of site-directed mutagenesis are well 
known to those of skill in the art and this technique is exemplified by publications such as, 
Edelman et a!., DNA 2:183 (1983). A versatile and efficient method for producing 
site-specific changes in a polynucleotide sequence was published by Zpller and Smith, 
Nucleic Acids Res, 10:6487-6500 (1982). PCR may also be used to create amino acid 
sequence variants of the novel nucleic acids. When small amounts of template DNA are used 
as starting material, primer(s) that differs slightly in sequence from the corresponding region 
in the template DNA can generate the desired amino acid variant. PCR amplification results 
in a population of product DNA fragments that differ from the polynucleotide template 
encoding the polypeptide at the position specified by the primer. The product DNA 
fragments replace the corresponding region in the plasmid and this gives a polynucleotide 
encoding the desired amino acid variant. 

A further technique for generating amino acid variants is the cassette mutagenesis 
technique described in Wells et al., Gene 34:315 (1985); and other mutagenesis techniques 
well known in the art, such as, for example, the techniques in Sambrook et al., supra, and 
Current Protocols in Molecular Biology, Ausubel et al. Due to the inherent degeneracy of 
the genetic code, other DNA sequences which encode substantially the same or a functionally 
equivalent amino acid sequence may be used in the practice of the invention for the cloning 
and expression of these novel nucleic acids. Such DNA sequences include those which are 
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capable of hybridizing to the appropriate novel nucleic acid sequence under stringent 
conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention can be 
used to generate polynucleotides encoding chimeric or fusion proteins comprising one or 
5 more domains of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any of 
the polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, 
amplified, or synthetic) or RNA. Methods and algorithms for obtaining such polynucleotides 
are well known to those of skill in the art and can include, for example, methods for 
1 0 determining hybridization conditions that can routinely isolate polynucleotides of the desired 
sequence identities. 

In accordance with the invention, polynucleotide sequences comprising the mature 
protein coding sequences corresponding to any one of SEQ ID NO: 1-948, or functional 
equivalents thereof, may be used to generate recombinant DNA molecules that direct the 

15 expression of that nucleic acid, or a functional equivalent thereof, in appropriate host cells. 
Also included are the cDNA inserts of any of the clones identified herein. 

A polynucleotide according to the invention can be joined to any of a variety of other 
nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et 
al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). 

20 Useful nucleotide sequences for joining to polynucleotides include an assortment of vectors, 
e.g., plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well 
known in the art. Accordingly, the invention also provides a vector including a 
polynucleotide of the invention and a host cell containing the polynucleotide. In general, the 
vector contains an origin of replication functional in at least one organism, convenient 

25 restriction endonuclease sites, and a selectable marker for the host cell. Vectors according to 
the invention include expression vectors, replication vectors, probe generation vectors, and 
sequencing vectors. A host cell according to the invention can be a prokaryotic or eukaryotic 
cell and can be a unicellular organism or part of a multicellular organism. 

The present invention further provides recombinant constructs comprising a nucleic 

30 acid having any of the nucleotide sequences of SEQ ID NOs: 1 - 948 or a fragment thereof or 
any other polynucleotides of the invention. In one embodiment, the recombinant constructs of 
the present invention comprise a vector, such as a plasmid or viral vector, into which a 
nucleic acid having any of the nucleotide sequences of SEQ ID NOs: 1 - 948 or a fragment 
thereof is inserted, in a forward or reverse orientation. In the case of a vector comprising one 
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of the ORFs of the present invention, the vector may further comprise regulatory sequences, 

including for example, a promoter, operably linked to the ORF. Large numbers of suitable 

vectors and promoters are known to those of skill in the art and are commercially available 

for generating the recombinant constructs of the present invention. The following vectors are 

provided by way of example. Bacterial: pBs,phagescript,PsiX174,pBluescriptSK,pBs 

KS, pNH8a, pNH16a, pNH18a, pNH46a (Stratagene); pTrc99A,.pKK223-3, pKK233-3, 

pDR540, pRIT5 (Pharmacia). Eukaryotic: pWLneo, pSV2cat, pOG44, PXTI, pSG 

(Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an expression 

control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et al., 

Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein recombinantly. 

Many suitable expression control sequences are known in the art. General methods of 

expressing recombinant proteins are also known and are exemplified in R. Kaufman, Methods 

in Enzymology 1 85, 537-566 (1990). As defined herein "operably linked" means that the 

isolated polynucleotide of the invention and an expression control sequence are situated 

within a vector or cell in such a way that the protein is expressed by a host cell which has 

been transformed (transfected) with the ligated polynucleotide/expression control sequence. 

Promoter regions can be selected from any desired gene using CAT (chloramphenicol 

transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 

pKK232-8 and pCM7. Particular named bacterial promoters include lad, lacZ, T3, T7, gpt, 

lambda PR, and trc. Eukaryotic promoters include CMV immediate early, HSV thymidine 

kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-I. Selection 

of the appropriate vector and promoter is well within the level of ordinary skill in the art. 

Generally, recombinant expression vectors will include origins of replication and selectable 

markers permitting transformation of the host cell, e.g., the ampicillin resistance gene of E. 

coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly-expressed gene to 

direct transcription of a downstream structural sequence. Such promoters can be derived 

from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), 

a-factor, acid phosphatase, or heat shock proteins, among others. The heterologous structural 

sequence is assembled in appropriate phase with translation initiation and termination 

sequences, and preferably, a leader sequence capable of directing secretion of translated 

protein into the periplasmic space or extracellular medium. Optionally, the heterologous 

sequence can encode a fusion protein including an amino terminal identification peptide 

imparting desired characteristics, e.g., stabilization or simplified purification of expressed 

20 
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recombinant product. Useful expression vectors for bacterial use are constructed by inserting 
a structural DNA sequence encoding a desired protein together with suitable translation 
initiation and termination signals in operable reading phase with a functional promoter. The 
vector will comprise one or more phenotypic selectable markers and an origin of replication 
5 to ensure maintenance of the vector and to, if desirable, provide amplification within the host. 
Suitable prokaryotic hosts for transformation include E.coli, Bacillus subtilis, Salmonella 
typhimurium and various species within the genera Pseudomonas, Streptomyces, and 
Staphylococcus, although others may also be employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for bacterial 

1 0 use can comprise a selectable marker and bacterial origin of replication derived from 

commercially available plasmids comprising genetic elements of the well known cloning 
vector pBR322 (ATCC 37017). Such commercial vectors include, for example, pKK223-3 
(Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, 
USA). These pBR322 "backbone" sections are combined with an appropriate promoter and 

15 the structural sequence to be expressed. Following transformation of a suitable host strain 
and growth of the host strain to an appropriate cell density, the selected promoter is induced 
or derepressed by appropriate means (e.g., temperature shift or chemical induction) and cells 
are cultured for an additional period. Cells are typically harvested by centrifugation, 
disrupted by physical or chemical means, and the resulting crude extract retained for further 

20 purification. 

Polynucleotides of the invention can also be used to induce immune responses. For 
example, as described in Fan et al., Nat Biotech. 17:870-872 (1999), incorporated herein by 
reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies 
against the encoded polypeptide following topical administration of naked plasmid DNA or 

25 following injection, and preferably intra-muscular injection of the DNA. The nucleic acid 
sequences are preferably inserted in a recombinant expression vector and may be in the form 
of naked DNA. 

33 ANTISENSE 

30 Another aspect of the invention pertains to isolated antisense nucleic acid molecules 

that are hybridizable to or complementary to the nucleic acid molecule comprising the 
nucleotide sequence of SEQ ID NO: 1 - 948, or fragments, analogs or derivatives thereof. An 
"antisense" nucleic acid comprises a nucleotide sequence that is complementary to a "sense" 
nucleic acid encoding a protein, e.g., complementary to the coding strand of a 
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double-stranded cDNA molecule or complementary to an mRNA sequence. In specific 
aspects, antisense nucleic acid molecules are provided that comprise a sequence 
complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire coding 
strand, or to only a portion thereof. Nucleic acid molecules encoding fragments, homologs, 
derivatives and analogs of a protein of any of SEQ ID NO: 949-1896 or antisense nucleic 
acids complementary to a nucleic acid sequence of SEQ ID NO: I - 948 are additionally 
provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence of the invention. The term "coding 
region" refers to the region of the nucleotide sequence comprising codons which are 
translated into amino acid residues. In another embodiment, the antisense nucleic acid 
molecule is antisense to a "noncoding region" of the coding strand of a nucleotide sequence 
of the invention. The term "noncoding region" refers to 5 f and 3' sequences that flank the 
coding region that are not translated into amino acids (i.e., also referred to as 5' and 3' 
untranslated regions). 

Given the coding strand sequences encoding a nucleic acid disclosed herein {e.g., 
SEQ ID NO: 1 - 948, antisense nucleic acids of the invention can be designed according to 
the rules of Watson and Crick or Hoogsteen base pairing. The antisense nucleic acid 
molecule can be complementary to the entire coding region of an mRNA, but more preferably 
is an oligonucleotide that is antisense to only a portion of the coding or noncoding region of 
an mRNA. For example, the antisense oligonucleotide can be complementary to the region 
surrounding the translation start site of an mRNA. An antisense oligonucleotide can be, for 
example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense 
nucleic acid of the invention can be constructed using chemical synthesis or enzymatic 
ligation reactions using procedures known in the art. For example, an antisense nucleic acid 
(e.g., an antisense oligonucleotide) can be chemically synthesized using naturally occurring 
nucleotides or variously modified nucleotides designed to increase the biological stability of 
the molecules or to increase the physical stability of the duplex formed between the antisense 
and sense nucleic acids, e.g., phosphorothioate derivatives and acridine substituted 
nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense nucleic 
acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, 
xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 

2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil,beta-D-galactosylqueosine, 
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inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-inethyIguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, S-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5 -methoxycarboxymethyluracil, 5-methoxyuracil, 
5 2-methylthio-N6-isopentenyIadenine, uracil-5-oxyacetic acid (v), wybutoxosine, 

pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil* 2-thiouracil, 4-thiouracil, 
5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 
5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 
2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically 

1 0 using an expression vector into which a nucleic acid has been subcloned in an antisense 
orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 

1 5 genomic DNA encoding a protein according to the invention to thereby inhibit expression of 
the protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the case 
of an antisense nucleic acid molecule that binds to DNA duplexes, through specific 
interactions in the major groove of the double helix. An example of a route of administration 

20 of antisense nucleic acid molecules of the invention includes direct injection at a tissue site. 
Alternatively, antisense nucleic acid molecules can be modified to target selected cells and 
then administered systemically. For example, for systemic administration, antisense 
molecules can be modified such that they specifically bind to receptors or antigens expressed 
on a selected cell surface, e.g., by linking the antisense nucleic acid molecules to peptides or 

25 antibodies that bind to cell surface receptors or antigens. The antisense nucleic acid 

molecules can also be delivered to cells using the vectors described herein. To achieve 
sufficient intracellular concentrations of antisense molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the control of a strong pol II or pol III 
promoter are preferred. 

30 In yet another embodiment, the antisense nucleic acid molecule of the invention is an 

a-anomeric nucleic acid molecule. An oc-anomeric nucleic acid molecule forms specific 

double-stranded hybrids with complementary RNA in which, contrary to the usual a-units, the 

strands run parallel to each other (Gaultier et al (1987) Nucleic Acids Res 15: 6625-6641). 

The antisense nucleic acid molecule can also comprise a 2'-o-methylribonucleotide (Inoue et 
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al (1987) Nucleic Acids Res 15: 6131-6148) or a chimeric RNA -DNA analogue (Inoue etal 
(1987) FEES Lett 215: 327-330). 

3.4 RTOOZYMESANDPNA MOIETIES 

In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of 
cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 
complementary region. Thus, ribozymes (e.g., hammerhead ribozymes (described in 
Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to catalytically cleave 
mRNA transcripts to thereby inhibit translation of an mRNA. A ribozyme having specificity 
for a nucleic acid of the invention can be designed based upon the nucleotide sequence of a 
DNA disclosed herein (/.e.,SEQ ID NO: 1 -948). For example, a derivative of Tetrahymena 
L-19 IVS RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved in a SECX-encoding mRNA. See, 
e.g., Cech et al U.S. Pat. No. 4,987,071; and Cech et al U.S. Pat. No. 5,1 16,742. 
Alternatively, SECX mRNA can be used to select a catalytic RNA having a specific 
ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel et a/., (1993) Science 
261:1411-1418. 

Alternatively, gene expression can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region {e.g., promoter and/or enhancers) to form triple 
helical structures that prevent transcription of the gene in target cells. See generally, Helene. 
(1991) Anticancer Drug Des. 6: 569-84; Helene. etal (1992) Ann. NY. Acad. Sci. 660:27-36; 
and Maher (1992) Bioassays 14: 807-15. 

In various embodiments, the nucleic acids of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or 
solubility of the molecule. For example, the deoxyribose phosphate backbone of the nucleic 
acids can be modified to generate peptide nucleic acids (see Hyrup et al (1996) BioorgMed 
Chem 4: 5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic 
acid mimics, e.g., DNA mimics, in which the deoxyribose phosphate backbone is replaced by 
a pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA 
under conditions of low ionic strength. The synthesis of PNA oligomers can be performed 
using standard solid phase peptide synthesis protocols as described in Hyrup et al (1996) 
above; Perry-O'Keefe et al (1996) PNAS 93: 14670-675. 
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PNAs of the invention can be used in therapeutic and diagnostic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific modulation 
of gene expression by, e.g., inducing transcription or translation arrest or inhibiting 
replication. PNAs of the invention can also be used, e.g., in the analysis of single base pair 

5 mutations in a gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes 
when used in combination with other enzymes, e.g., SI nucleases (Hyrup B. (1996) above); 
or as probes or primers for DNA sequence and hybridization (Hyrup et al (1996), above; 
Perry-O'Keefe (1996), above). 

In another embodiment, PNAs of the invention can be modified, e.g., to enhance their 

10 stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
delivery known in the art. For example, PNA-DNA chimeras can be generated that may 
combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA portion 

1 5 while the PNA portion would provide high binding affinity and specificity. PNA-DNA 

chimeras can be linked using linkers of appropriate lengths selected in terms of base stacking, 
number of bonds between the nucleobases, and orientation (Hyrup (1996) above). The 
synthesis of PNA-DNA chimeras can be performed as described in Hyrup (1996) above and 
Finn et al (1996) Nad Acids Res 24: 3357-63. For example, a DNA chain can be 

20 synthesized on a solid support using standard phosphoramidite coupling chemistry, and 
modified nucleoside analogs, e.g., S'^-methoxytritylJamino-S'-deoxy-thymidine 
phosphoramidite, can be used between the PNA and the 5' end of DNA (Mag et al (1989) 
Nucl Acid Res 17: 5973-88). PNA monomers are then coupled in a stepwise manner to 
produce a chimeric molecule with a 5' PNA segment and a 3' DNA segment (Finn et al 

IS (1996) above). Alternatively, chimeric molecules can be synthesized with a 5' DNA segment 
and a 3' PNA segment. See, Petersen et al (1975) Bioorg Med Chem Lett 5:1119-11 124. 

In other embodiments, the oligonucleotide may include other appended groups such 
as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport 
across the cell membrane (see, e.g., Letsinger et al, 1989, Proc. Natl. Acad. Set U.S.A. 

30 86:6553-6556; Lemaitre et al, 1987, Proc. Natl Acad Sci. 84:648-652; PCT Publication No. 
W088/09810) or the blood-brain barrier (see, e.g., PCT Publication No. W089/10134). In 
addition, oligonucleotides can be modified with hybridization triggered cleavage agents (See, 
e.g., Krol et al, 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., Zon, 
1988, Pharm. Res. 5: 539-549). To this end, the oligonucleotide may be conjugated to 
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another molecule, e.g., a peptide, a hybridization triggered cross-linking agent, a transport 
agent, a hybridization-triggered cleavage agent, etc. 

3.5 HOSTS 

The Dresent invention farther nmvi/W hnot ™>iic 

. . . ~- i — • — - ..ww* www gwiJi^HUm) i/iigiiiccjcu tu uuiiia/n the 

polynucleotides of the invention. For example, such host cells may contain nucleic acids of 
the invention introduced into the host cell using known transformation, transfection or 
infection methods. The present invention still further provides host cells genetically 
engineered to express the polynucleotides of the invention, wherein such polynucleotides are 
in operative association with a regulatory sequence heterologous to the host cell which drives 
expression of the polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, or 
increase, expression of endogenous polypeptide. Cells can be modified (e.g., by homologous 
recombination) to provide increased polypeptide expression by replacing, in whole or in part, 
the naturally occurring promoter with all or part of a heterologous promoter so that the cells 
express the polypeptide at higher levels. The heterologous promoter is inserted in such a 
manner that it is operatively linked to the encoding sequences. ' See, for example, PCT 
International Publication No. WO94/12650, PCT International Publication No. WO92/20S08, 
and PCT International Publication No. WO91/09955. It is also contemplated that, in addition 
to heterologous promoter DNA, amplifiable marker DNA (e.g., ada, dhfr, and the 
multifunctional CAD gene which encodes carbamyl phosphate synthase, aspartate 
transcarbamylase, and dihydroorotase) and/or intron DNA may be inserted along with the 
heterologous promoter DNA. If linked to the coding sequence, amplification of the marker 
DNA by standard selection methods results in co-amplification of the desired protein coding 
sequences in the cells. 

The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower 
eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the recombinant construct into the host cell can be effected by 
calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation 
(Davis, L. et al, Basic Methods in Molecular Biology (1986)). The host cells containing one 
of the polynucleotides of the invention, can be used in conventional manners to produce the 
gene product encoded by the isolated fragment (in the case of an ORF) or can be used to 
produce a heterologous protein under the control of the EMF. 
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Any host/vector system can be used to express one or more of the ORFs of the present 
invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, Cv-1 
cell, COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. coli and B. 
subtilis. The most preferred cells are those which do not normally express the particular 

5 polypeptide or protein or which expresses the polypeptide or protein at low natural level. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under the 
control of appropriate promoters. Cell-free translation systems can also be employed to 
produce such proteins using RNAs derived from the DNA constructs of the present invention. 
Appropriate cloning and expression vectors for use with prokaryotic and eukaryotic hosts are 

10 described by Sambrook, et al., in Molecular Cloning: A Laboratory Manual, Second Edition, 
Cold Spring Harbor, New York (1989), the disclosure of which is hereby incorporated by 
reference. 

Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the COS-7 lines 

15 of monkey kidney fibroblasts, described by Gluzman, Cell 23:175 (1981). Other cell lines 
capable of expressing a compatible vector are, for example, the C127, monkey COS cells, 
Chinese Hamster Ovary (CHO) cells, human kidney 293 cells, human epidermal A431 cells, 
human Colo205 cells, 3T3 cells, CV-1 cells, other transformed primate cell lines, normal 
diploid cells, cell strains derived from in vitro culture of primary tissue, primary explants, 

20 HeLa cells, mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells. Mammalian expression 
vectors will comprise an origin of replication, a suitable promoter and also any necessary 
ribosome binding sites, polyadenylation site, splice donor and acceptor sites, transcriptional 
termination sequences, and 5 ' flanking nontranscribed sequences. DNA sequences derived 
from the SV40 viral genome, for example, SV40 origin, early promoter, enhancer, splice, and 

25 polyadenylation sites may be used to provide the required nontranscribed genetic elements. 

Recombinant polypeptides and proteins produced in bacterial culture are usually isolated by 
initial extraction from cell pellets, followed by one or more salting-out, aqueous ion exchange 
or size exclusion chromatography steps. Protein refolding steps can be used, as necessary, in 
completing configuration of the mature protein. Finally, high performance liquid 

30 chromatography (HPLC) can be employed for final purification steps. Microbial cells 

employed in expression of proteins can be disrupted by any convenient method, including 
freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents. 

Alternatively, it may be possible to produce the protein in lower eukaryotes such as 
yeast or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
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Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or 
any yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 
strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial 
strain capable of expressing heterologous proteins. If the protein is made in yeast or bacteria, 
it may be necessary to modify the protein produced therein, for example by phosphorylation 
or glycosylation of the appropriate sites, in order to obtain the functional protein. Such 
covalent attachments may be accomplished using known chemical or enzymatic methods. 

In another embodiment of the present invention, cells and tissues may be engineered 
to express an endogenous gene comprising the polynucleotides of the invention under the 
control of inducible regulatory elements, in which case the regulatory sequences of the 
endogenous gene may be replaced by homologous recombination. As described herein, gene 
targeting can be used to replace a gene's existing regulatory region with a regulatory 
sequence isolated from a different gene or a novel regulatory sequence synthesized by genetic 
engineering methods. Such regulatory sequences may be comprised of promoters, enhancers, 
scaffold-attachment regions, negative regulatory elements, transcriptional initiation sites, 
regulatory protein binding sites or combinations of said sequences. Alternatively, sequences 
which affect the structure or stability of the RNA or protein produced may be replaced, 
removed, added, or otherwise modified by targeting. These sequence include 
polyadenylation signals, mRNA stability elements, splice sites, leader sequences for 
enhancing or modifying transport or secretion properties of the protein, or other sequences 
which alter or improve the function or stability of protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the 
gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple 
deletion of a regulatory element, such as the deletion of a tissue-specific negative regulatory 
element. Alternatively, the targeting event may replace an existing element; for example, a 
tissue-specific enhancer can be replaced by an enhancer that has broader or different cell-type 
specificity than the naturally occurring elements. Here, the naturally occuiring sequences are 
deleted and new sequences are added. In all cases, the identification of the targeting event 
may be facilitated by the use of one or more selectable marker genes that are contiguous with 
the targeting DNA, allowing for the selection of cells in which the exogenous DNA has 
integrated into the host cell genome. The identification of the targeting event may also be 
facilitated by the use of one or more marker genes exhibiting Hie property of negative 
selection, such that the negatively selectable marker is linked to the exogenous DNA, but 
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configured such that the negatively selectable marker flanks the targeting sequence, and such 
that a correct homologous recombination event with sequences in the host cell genome does 
not result in the stable integration of the negatively selectable marker. Markers useful for this 
purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial 

5 xanthine-guanine phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance 
with this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 
to Chappel; U.S. Patent No. 5,578,461 to Sherwin et al.; International Application No. 
PCT/US92/09627 (WO93/09222) by Selden et al.; and International Application No. 

10 PCT/US90/06436 (WO91/06667) by.Skoultchi et al., each of which is incorporated by 
reference herein in its entirety. 

3.6 POLYPEPTIDES OF THE INVENTION 

The isolated polypeptides of the invention include, but are not limited to, a 

1 5 polypeptide comprising: the amino acid sequences set forth as any one of SEQ ID NO: 949- 
1 896 or an amino acid sequence encoded by any one of the nucleotide sequences SEQ ID 
NOs: 1 - 948 or the corresponding full length or mature protein. Polypeptides of the invention 
also include polypeptides preferably with biological or immunological activity that are 
encoded by: (a) a polynucleotide having any one of the nucleotide sequences set forth in SEQ 

20 ID NOs: 1 - 948 or (b) polynucleotides encoding any one of the amino acid sequences set 

forth as SEQ ID NO: 949-1896 or (c) polynucleotides that hybridize to the complement of the 
polynucleotides of either (a) or (b) under stringent hybridization conditions. The invention 
also provides biologically active or immunologically active variants of any of the amino acid 
sequences set forth as SEQ ID NO: 949-1896 or the corresponding full length or mature 

25 protein; and "substantial equivalents" thereof (e.g., with at least about 65%, at least about 
70%, at least about 75%, at least about 80%, at least about 85%, 86%, 87%, 88%, 89%, at 
least about 90%, 91%, 92%, 93%, 94%, typically at least about 95%, 96%, 97%, more 
typically at least about 98%, or most typically at least about 99% amino acid identity) that 
retain biological activity. Polypeptides encoded by allelic variants may have a similar, 

30 increased, or decreased activity compared to polypeptides comprising SEQ ID NO: 949-1 896. 

Fragments of the proteins of the present invention which are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the protein 
may be in linear form or they may be cyclized using known methods, for example, as 
described in H. U. Saragovi, et al., Bio/Technology 10, 773-778 (1992) and in R. S. 
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McDowell, et al., J. Amer. Chem. Soc. 1 14, 9245-9253 (1992), both of which are 
incorporated herein by reference. Such fragments may be fused to earner molecules such as 
immunoglobulins for many purposes, including increasing the valency of protein binding 
sites. 

The present invention also provides both full-length and mature forms (for example, 
without a signal sequence or precursor sequence) of the disclosed proteins. The protein 
coding sequence is identified in the sequence listing by translation of the disclosed nucleotide 
sequences. The mature fonn of such protein may be obtained by expression of a full-length 
polynucleotide in a suitable mammalian cell or other host cell. The sequence of the mature 
form of the protein is also determinable from the amino acid sequence of the full-length form. 
Where proteins of the present invention are membrane bound, soluble forms of the proteins 
are also provided. In such forms, part or all of the regions causing the proteins to be 
membrane bound are deleted so that the proteins are fully secreted from the cell in which they 
are expressed. 

Protein compositions of the present invention may further comprise an acceptable ' 
carrier, such as a hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 

The present invention further provides isolated polypeptides encoded by the nucleic 
acid fragments of the present invention or by degenerate variants of the nucleic acid 
fragments of the present invention. By "degenerate variant" is intended nucleotide fragments 
which differ from a nucleic acid fragment of the present invention {e.g., an ORF) by 
nucleotide sequence but, due to the degeneracy of the genetic code, encode an identical 
polypeptide sequence. Preferred nucleic acid fragments of the present invention are the 
ORFs that encode proteins. 

A variety of methodologies known in the art can be utilized to obtain any one of the 
isolated polypeptides or proteins of the present invention. At the simplest level, the amino 
acid sequence can be synthesized using commercially available peptide synthesizers. The 
synthetically-constructed protein sequences, by virtue of sharing primary, secondary or 
tertiary structural and/or conformational characteristics with proteins may possess biological 
properties in common therewith, including protein activity. This technique is particularly 
useful in producing small peptides and fragments of larger polypeptides. Fragments are 
useful, for example, in generating antibodies against the native polypeptide, Thus, they may 
be employed as biologically active or immunological substitutes for natural, purified proteins 
in screening of therapeutic compounds and in immunological processes for the development 
of antibodies. 

30 



WO 02/070539 — - PCT/US02/05095 



The polypeptides and proteins of the present invention can alternatively be purified 

from cells which have been altered to express the desired polypeptide or protein. As used 

herein, a cell is said to be altered to express a desired polypeptide or protein when the cell, 

through genetic manipulation, is made to produce a polypeptide or protein which it normally 

5 does not produce or which the cell normally produces at a lower level. One skilled in the art 

can readily adapt procedures for introducing and expressing either recombinant or synthetic 

sequences into eukaryotic or prokaryotic cells in order to generate a cell which produces one 

of the polypeptides or proteins of the present invention. 

The invention also relates to methods for producing a polypeptide comprising 

1 0 growing a culture of host cells of the invention in a suitable culture medium, and purifying 
the protein from the cells or the culture in which the cells are grown. For example, the 
methods of the invention include a process for producing a polypeptide in which a host cell 
containing a suitable expression vector that includes a polynucleotide of the invention is 
cultured under conditions that allow expression of the encoded polypeptide. The polypeptide 

1 5 can be recovered from the culture, conveniently from the culture medium, or from a lysate 
prepared from the host cells and further purified. Preferred embodiments include those in 
which the protein produced by such process is a full length or mature form of the protein. 

In an alternative method, the polypeptide or protein is purified from bacterial cells 
which naturally produce the polypeptide or protein. One skilled in the art can readily follow 

20 known methods for isolating polypeptides and proteins in order to obtain one of the isolated 
polypeptides or proteins of the present invention. These include, but are not limited to, 
immunochromatography, HPLC, size-exclusion chromatography, ion-exchange 
chromatography, and immuno-affmity chromatography. See, e.g., Scopes, Protein 
Purification: Principles and Practice, Springer-Verlag (1994); Sambrook, et al., in Molecular 

25 Cloning: A Laboratory Manual; Ausubel et al., Current Protocols in Molecular Biology. 
Polypeptide fragments that retain biological/immunological activity include fragments 
comprising greater than about 100 amino acids, or greater than about 200 amino acids, and 
fragments that encode specific protein domains. 

The purified polypeptides can be used in in vitro binding assays which are well 

30 known in the art to identify molecules which bind to the polypeptides. These molecules 
include but are not limited to, for e.g., small molecules, molecules from combinatorial 
libraries, antibodies or other proteins. The molecules identified in the binding assay are then 
tested for antagonist or agonist activity in in vivo tissue culture or animal models that are well 
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known in the art. In brief, the molecules are titrated into a plurality of cell cultures or animals 
and then tested for either cell/animal death or prolonged survival of the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the 
peptides may be complexed with toxins, e.g., ricin or cholera, or with other compounds that 
are toxic to cells. The toxin-binding molecule complex is then targeted to a tumor or other 
cell by the specificity of the binding molecule for SEQ ID NO: 949-1 896. 

The protein of the invention may also be expressed as a product of transgenic animals, 
e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which are 
characterized by somatic or germ cells containing a nucleotide sequence encoding the protein. 

The proteins provided herein also include proteins characterized by amino acid 
sequences similar to those of purified proteins but into which modification are naturally 
provided or deliberately engineered. For example, modifications, in the peptide or DNA 
sequence, can be made by those skilled in the art using known techniques. Modifications of 
interest in the protein sequences may include the alteration, substitution, replacement, 
insertion or deletion of a selected amino acid residue in the coding sequence. For example, " 
one or more of the cysteine residues may be deleted or replaced with another amino acid to 
alter the conformation of the molecule. Techniques for such alteration, substitution, 
replacement, insertion or deletion are well known to those skilled in the art (see, e.g., U.S. 
Pat. No. 4,518,584). Preferably, such alteration, substitution, replacement, insertion or 
deletion retains the desired activity of the protein. Regions of the protein that are important 
for the protein function can be determined by various methods known in the art including the 
alanine-scanning method which involved systematic substitution of single or strings of amino 
acids with alanine, followed by testing the resulting alanine-containing variant for biological 
activity. This type of analysis determines the importance of the substituted amino acid(s) in 
biological activity. Regions of the protein that are important for protein function may be 
determined by the eMATRIX program. 

Other fragments and derivatives of the sequences of proteins which would be 
expected to retain protein activity in whole or in part and are useful for screening or other 
immunological methodologies may also be easily made by those skilled in the art given the 
disclosures herein. Such modifications are encompassed by the present invention. 

The protein may also be produced by operably linking the isolated polynucleotide of 
the invention to suitable control sequences in one or more insect expression vectors, and 
employing an insect expression system. Materials and methods for baculovirus/insect cell 
expression systems are commercially available in kit form from, e.g., Invitrogen, San Diego, 
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Calif., U.S.A. (the MaxBat™ kit), and such methods are well known in the art, as described 
in Summers and Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), 
incorporated herein by reference. As used herein, an insect cell capable of expressing a 
polynucleotide of the present invention is "transformed." 
5 The protein of the invention may be prepared by culturing transformed host cells 

under culture conditions suitable to express the recombinant protein. The resulting expressed 
protein may then be purified from such culture (i.e., from culture medium or cell extracts) 
using known purification processes, such as gel filtration and ion exchange chromatography. 
The purification of the protein may also include an affinity column containing agents which 
10 will bind to the protein; one or more column steps over such affinity resins as concanavalin 
A-agarose, heparin-toyopearl™ or Cibacrom blue 3GA Sepharose™; one or more steps 
involving hydrophobic interaction chromatography using such resins as phenyl ether, butyl 
ether, or propyl ether; or immunoaffinity chromatography. 

Alternatively, the protein of the invention may also be expressed in a form which will 
1 5 facilitate purification. For example, it may be expressed as a fusion protein, such as those of 
maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin (TRX), or as 
a His tag. Kits for expression and purification of such fusion proteins are commercially 
available from New England BioLab (Beverly, Mass.), Pharmacia (Piscataway, N.J.) and 
Invitrogen, respectively. The protein can also be tagged with an epitope and subsequently 
20 purified by using a specific antibody directed to such epitope. One such epitope ("FLAG®") 
is commercially available from Kodak (New Haven, Conn.). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- 
HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl 
or other aliphatic groups, can be employed to further purify the protein. Some or all of the 
25 foregoing purification steps, in various combinations, can also be employed to provide a 
substantially homogeneous isolated recombinant protein. The protein thus purified is 
substantially free of other mammalian proteins and is defined in accordance with the present 
invention as an "isolated protein." 

The polypeptides of the invention include analogs (variants). This embraces 
30 fragments, as well as peptides in which one or more amino acids has been deleted, inserted, 
or substituted. Also, analogs of the polypeptides of the invention embrace fusions of the 
polypeptides or modifications of the polypeptides of the invention, wherein the polypeptide 
or analog is fused to another moiety or moieties, e.g., targeting moiety or another therapeutic 
agent. Such analogs may exhibit improved properties such as activity and/or stability. 
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Examples of moieties which may be fused to the polypeptide or an analog include, for 
example, targeting moieties which provide for the delivery of polypeptide to pancreatic cells, 
e.g., antibodies to pancreatic cells, antibodies to immune cells such as T-cells, monocytes, 
dendritic cells, granulocytes, etc, as well as receptor and ligands expressed on pancreatic or 
immune cells. Other moieties which may be fused to the polypeptide include therapeutic 
agents which are used for treatment, for example, immunosuppressive drugs such as 
cyclosporin, SK506, azathioprine, CD3 antibodies and steroids. Also, polypeptides may be 
fused to immune modulators, and other cytokines such as alpha or beta interferon. 

3.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE IDENTITY 
AND SIMILARITY 

Preferred identity and/or similarity are designed to give the largest match between the 
sequences tested. Methods to determine identity and similarity are codified in computer 
programs including, but are not limited to, the GCG program package, including GAP 
(Devereux, J, et al., Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, n 
University of Wisconsin, Madison, WI), BLASTP, BLASTN, BLASTX, FASTA (Altschul, 
S.F. et al., J. Molec. Biol. 215:403-410 (1990), PSI-BLAST (Altschul S.F. et al., Nucleic 
Acids Res. vol. 25, pp. 3389-3402, herein incorporated by reference), eMatrix software ( Wu 
et al, J. Comp. Biol, Vol. 6, pp. 219-235 (1999), herein incorporated by reference), eMotif 
software (Nevill-Manning et al, ISMB-97, Vol. 4, pp. 202-209, herein incotporated by 
reference), pFam software (Sonnhammeretal, Nucleic Acids Res, Vol. 26(1), pp. 320-322 
(1998), herein incorporated by reference) and the Kyte-Doolittle hydrophobocity prediction 
algorithm (J. Mol Biol, 157, pp. 105-31 (1982), incorporated herein by reference). The 
BLAST programs are publicly available from the National Center for Biotechnology 
Information (NCBI) and other sources (BLAST Manual, Altschul, S, et al. NCB NLM NIH 
Bethesda, MD 20894; Altschul, S, et al, J. Mol. Biol. 215:403-410 (1990). 

3.7 CHIMERIC AND FUSION PROTEINS 

The invention also provides chimeric or fusion proteins. As used herein, a "chimeric 
protein" or "fusion protein" comprises a polypeptide of the invention operatively linked to 
another polypeptide. Within a fusion protein the polypeptide according to the invention can 
correspond to all or a portion of a protein according to the invention. In one embodiment, a 
fusion protein comprises at least one biologically active portion of a protein according to the 
invention. In another embodiment, a fusion protein comprises at least two biologically active 
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portions of a protein according to the invention. Within the fusion protein, the term 
"operatively linked" is intended to indicate that the polypeptide according to the invention 
and the other polypeptide are fused in-frame to each other. The polypeptide can be fused to 
the N-terminus or C-terminus, or to the middle. 

5 For example, in one embodiment a fusion protein comprises a polypeptide according 

to the invention operably linked to the extracellular domain of a second protein. 

In another embodiment, the fusion protein is a GST-fusion protein in which the 
polypeptide sequences of the invention are fused to the C-terminus of the GST (i.e., 
glutathione S-transferase) sequences. 

10 In another embodiment, the fusion protein is an immunoglobulin fusion protein in 

which the polypeptide sequences according to the invention comprise one or more domains 
fused to sequences derived from a member of the immunoglobulin protein family. The 
immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical 
compositions and administered to a subject to inhibit an interaction between a ligand and a 

1 5 protein of the invention on the surface of a cell, to thereby suppress signal transduction in 
vivo. The immunoglobulin fusion proteins can be used to affect the bioavailability of a 
cognate ligand. Inhibition of the ligand/protein interaction may be useful therapeutically for 
both the treatment of proliferative and differentiative disorders, e.g., cancer as well as 
modulating (e.g., promoting or inhibiting) cell survival. Moreover, the immunoglobulin 

20 fusion proteins of the invention can be used as immunogens to produce antibodies in a 
subject, to purify ligands, and in screening assays to identify molecules that inhibit the 
interaction of a polypeptide of the invention with a ligand. 

A chimeric or fusion protein of the invention can be produced by standard 
recombinant DNA techniques. For example, DNA fragments coding for the different 

25 polypeptide sequences are ligated together in-frame in accordance with conventional 

techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, restriction 
enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as 
appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic 
ligation. In another embodiment, the fusion gene can be synthesized by conventional 

30 techniques including automated DNA synthesizers. Alternatively, PCR amplification of gene 
fragments can be carried out using anchor primers that give rise to complementary overhangs 
between two consecutive gene fragments that can subsequently be annealed and reamplified 
to generate a chimeric gene sequence (see, for example, Ausubel et al. (eds.) Current 
Protocols in Molecular Biology, John Wiley & Sons, 1992). Moreover, many 
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expression vectors are commercially available that already encode a fusion moiety (e.g., a 
GST polypeptide). A nucleic acid encoding a polypeptide of the invention can be cloned into 
such an expression vector such that the fusion moiety is linked in-frame to the protein of the 
invention. 

3.8 GENE THERAPY 

Mutations in the polynucleotides of the invention gene may result in loss of normal 
function of the encoded protein. The invention thus provides gene therapy to restore normal 
activity of the polypeptides of the invention; or to treat disease states involving polypeptides 
of the invention. Delivery of a functional gene encoding polypeptides of the invention to 
appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more particularly 
viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of 
physical DNA transfer methods (e.g., liposomes or chemical treatments). See, for example, 
Anderson, Nature, supplement to vol. 392, no. 6679, pp.25-20 (1 998). For additional reviews 
of gene therapy technology see Friedmann, Science, 244: 1275-1281 (1989); Verma, 
Scientific American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). Introduction of 
any one of the nucleotides of the present invention or a gene encoding the polypeptides of the 
present invention can also be accomplished with extrachromosomal substrates (transient 
expression) or artificial chromosomes (stable expression). Cells may also be cultured ex vivo 
in the presence of proteins of the present invention in order to proliferate or to produce a 
desired effect on or activity in such cells. Treated cells can then be introduced in vivo for 
therapeutic purposes. Alternatively, it is contemplated that in other human disease states, 
preventing the expression of or inhibiting the activity of polypeptides of the invention will be 
useful in treating the disease states. It is contemplated that antisense therapy or gene therapy 
could be applied to negatively regulate the expression of polypeptides of the invention. 

Other methods inhibiting expression of a protein include the introduction of antisense 
molecules to the nucleic acids of the present invention, their complements, or their translated 
RNA sequences, by methods known in the ait Further, the polypeptides of the present 
invention can be inhibited by using targeted deletion methods, or the insertion of a negative 
regulatory element such as a silencer, which is tissue specific. 

The present invention still further provides cells genetically engineered in vivo to express 
the polynucleotides of the invention, wherein such polynucleotides are in operative association 
with a regulatory sequence heterologous to the host cell which drives expression of the 
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polynucleotides in the cell. These methods can be used to increase or decrease the expression of 
the polynucleotides of the present invention. 

Knowledge of DNA sequences provided by the invention allows for modification of 
cells to permit, increase, or decrease, expression of endogenous polypeptide. Cells can be 
modified (e.g., by homologous recombination) to provide increased polypeptide expression by 
replacing, in whole or in part, the naturally occurring promoter with all or part of a heterologous 
promoter so that the cells express the protein at higher levels. The heterologous promoter is 
inserted in such a manner that it is operativehy linked to the desired protein encoding sequences. 
See, for example, PCT International Publication No. WO 94/12650, PCT International 
Publication No. WO 92/20808, and PCT International Publication No. WO 91/09955. It is also 
contemplated that, in addition to heterologous promoter DNA, amplifiable marker DNA (e.g., 
ada, dhfr, and the multifunctional CAD gene which encodes carbamyl phosphate synthase, 
aspartate transcarbamylase, and dihydroorotase) and/or intron DNA may be inserted along with 
the heterologous promoter DNA. If linked to the desired protein coding sequence, amplification 
of the marker DNA by standard selection methods results in co-amplification of the desired 
protein coding sequences in the cells. 

In another embodiment of the present invention, cells and tissues may be engineered to 
express an endogenous gene comprising the polynucleotides of the invention under the control 
of inducible regulatory elements, in which case the regulatory sequences of the endogenous gene 
may be replaced by homologous recombination. As described herein, gene targeting can be 
used to replace a gene's existing regulatory region with a regulatory sequence isolated from a 
different gene or a novel regulatory sequence synthesized by genetic engineering methods. Such 
regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, 
negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or 
combinations of said sequences. Alternatively, sequences which affect the structure or stability 
of the RNA or protein produced may be replaced, removed, added, or otherwise modified by 
targeting. These sequences include polyadenylation signals, mRNA stability elements, splice 
sites, leader sequences for enhancing or modifying transport or secretion properties of the 
protein, or other sequences which alter or improve the function or stability of protein or RNA 
molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the 
gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple deletion 
of a regulatory element, such as the deletion of a tissue-specific negative regulatory element. 
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Alternatively, the targeting event may replace an existing element; for example, a tissue-specific 
enhancer can be replaced by an enhancer that has broader or different cell-type specificity than 
the naturally occurring elements. Here, the naturally occurring sequences are deleted and new 
sequences are added. In all cases, the identification of the targeting event may be facilitated by 
the use of one or more selectable marker genes that are contiguous with the targeting DNA. 
allowing for the selection of cells in which the exogenous DNA has integrated into the cell 
genome. The identification of the targeting event may also be facilitated by the use of one or 
more marker genes exhibiting the property of negative selection, such that the negatively 
selectable marker is linked to the exogenous DNA, but configured such that the negatively 
selectable marker flanks the targeting sequence, and such that a correct homologous 
recombination event with sequences in the host cell genome does not result in the stable 
integration of the negatively selectable marker. Markers usefiil for this purpose include the 
Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with '*? 
this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to 
Chappel; U.S. Patent No. 5,578,461 to Sherwin et al.; International Application No. 
PCT/US92/09627 (WO93/09222) by Selden et al.; and International Application No. 
PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by reference 
herein in its entirety. 

3.9 TRANSGENIC ANIMALS 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed or 
inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination 
are referred to as "knockout" animals. Knockout animals, preferably non-human mammals, 
can be prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. 
Transgenic animals are useful to determine the roles polypeptides of the invention play in 
biological processes, and preferably in disease states. Transgenic animals are useful as model 
systems to identify compounds that modulate lipid metabolism. Transgenic animals, 
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preferably non-human mammals, are produced using methods as described in U.S. Patent No 
5,489,743 and PCT Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of a promoter of the 
polynucleotides of the invention is either activated or inactivated to alter the level of 

5 expression of the polypeptides of the invention. Inactivation can be carried out using 
homologous recombination methods described above. Activation can be achieved by 
supplementing or even replacing the homologous promoter to provide for increased protein 
expression. The homologous promoter can be supplemented by insertion of one or more 
heterologous enhancer elements known to confer promoter activation in a particular tissue. 

1 0 The polynucleotides of the present invention also make possible the development, 

through, e.g., homologous recombination or knock out strategies, of animals that fail to 
express polypeptides of the invention or that express a variant polypeptide. Such animals are 
useful as models for studying the in vivo activities of polypeptide as well as for studying 
modulators of the polypeptides of the invention. 

1 5 in preferred methods to determine biological functions of the polypeptides of the 

invention in vivo, one or more genes provided by the invention are either over expressed or 
inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 

20 Animals in which an endogenous gene has been inactivated by homologous recombination 
are referred to as "knockout" animals. Knockout animals, preferably non-human mammals, 
can be prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. 
Transgenic animals are useful to determine the roles polypeptides of the invention play in 
biological processes, and preferably in disease states. Transgenic animals are useful as model 

25 systems to identify compounds that modulate lipid metabolism. Transgenic animals, 

preferably non-human mammals, are produced using methods as described in U.S. Patent No 
5,489,743 and PCT Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of the polynucleotides of the 
invention promoter is either activated or inactivated to alter the level of expression of the 

30 polypeptides of the invention. Inactivation can be carried out using homologous 

recombination methods described above. Activation can be achieved by supplementing or 
even replacing the homologous promoter to provide for increased protein expression. The 
homologous promoter can be supplemented by insertion of one or more heterologous 

enhancer elements known to confer promoter activation in a particular tissue. 
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3.10 USES AND BIOLOGICAL ACTIVITY 

The polynucleotides and proteins of the present invention are expected to exhibit one 
or more of the uses or biological activities (including those associated with assays cited 
herein) identified herein. Uses or activities described for proteins of the present invention 
may be provided by administration or use of such proteins or of polynucleotides encoding 
such proteins (such as, for example, in gene therapies or vectors suitable for introduction of 
DNA). The mechanism underlying the particular condition or pathology will dictate whether 
the polypeptides of the invention, the polynucleotides of the invention or modulators 
(activators or inhibitors) thereof would be beneficial to the subject in need of treatment. 
Thus, "therapeutic compositions of the invention" include compositions comprising isolated 
polynucleotides (including recombinant DNA molecules, cloned genes and degenerate 
variants thereof) or polypeptides of the invention (including full length protein, mature 
protein and truncations or domains thereof), or compounds and other substances that 
modulate the overall activity of the target gene products, either at the level of target 
gene/protein expression or target protein activity. Such modulators include polypeptides, 
analogs, (variants), including fragments and fusion proteins, antibodies and other binding 
proteins; chemical compounds that directly or indirectly activate or inhibit the polypeptides 
of the invention (identified, e.g., via drug screening assays as described herein); antisense 
polynucleotides and polynucleotides suitable for triple helix formation; and in particular 
antibodies or other binding partners that specifically recognize one or more epitopes of the 
polypeptides of the invention. 

The polypeptides of the present invention may likewise be involved in cellular 
activation or in one of the other physiological pathways described herein. 

3.10.1 RESEARCH USES AND UTILITIES 

The polynucleotides provided by the present invention can be used by the research 
community for various purposes. The polynucleotides can be used to express recombinant 
protein for analysis, characterization or therapeutic use; as markers for tissues in which the 
corresponding protein is preferentially expressed (either constitutively or at a particular stage 
of tissue differentiation or development or in disease states); as molecular weight markers on 
gels; as chromosome markers or tags (when labeled) to identify chromosomes or to map 
related gene positions; to compare with endogenous DNA sequences in patients to identify 
potential genetic disorders; as probes to hybridize and thus discover novel, related DNA 
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sequences; as a source of information to derive PCR primers for genetic fingerprinting; as a 
probe to "subtract-out" known sequences in the process of discovering other novel 
polynucleotides; for selecting and making oligomers for attachment to a "gene chip" or other 
support, including for examination of expression patterns; to raise anti-protein antibodies 
using DNA immunization techniques; and as an antigen to raise anti-DNA antibodies or elicit 
another immune response. Where the polynucleotide encodes a protein which binds or 
potentially binds to another protein (such as, for example, in a receptor-ligand interaction), 
the polynucleotide can also be used in interaction trap assays (such as, for example, that 
described in Gyuris et al., Cell 75:791-803 (1993)) to identify polynucleotides encoding the 
other protein with which binding occurs or to identify inhibitors of the binding interaction. 

The polypeptides provided by the present invention can similarly be used in assays to 
determine biological activity, including in a panel of multiple proteins for high-throughput 
screening; to raise antibodies or to elicit another immune response; as a reagent (including the 
labeled reagent) in assays designed to quantitatively determine levels of the protein (or its 
receptor) in biological fluids; as markers for tissues in which the corresponding polypeptide 
is preferentially expressed (either constitutively or at a particular stage of tissue 
differentiation or development or in a disease state); and, of course, to isolate correlative 
' receptors or ligands. Proteins involved in these binding interactions can also be used to 
screen for peptide or small molecule inhibitors or agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent grade 
or kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in the 
art. References disclosing such methods include without limitation "Molecular Cloning: A 
Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. 
Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular 
Cloning Techniques", Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987. 

3.10.2 NUTRITIONAL USES 

Polynucleotides and polypeptides of the present invention can also be used as nutritional 
sources or supplements. Such uses include without limitation use as a protein or amino, acid 
supplement, use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. 
In such cases the polypeptide or polynucleotide of the invention can be added to the feed of a 
particular organism or can be administered as a separate solid or liquid preparation, such as in 
the foim of powder, pills, solutions, suspensions or capsules. In the case of microorganisms, the 
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polypeptide or polynucleotide of the invention can be added to the medium in or on which the 
microorganism is cultured. 

3.103 CYTOKINE AND CELL PROIJinERATIONyDIFrERENTIATIOiV 
ACTIVITY 

A polypeptide of the present invention may exhibit activity relating to cytokine, cell 
proliferation (either inducing or inhibiting) or cell differentiation (either inducing or 
inhibiting) activity or may induce production of other cytokines in certain cell populations. A 
polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Many 
protein factors discovered to date, including all known cytokines, have exhibited activity in 
one or more factor-dependent cell proliferation assays, and hence the assays serve as a 
convenient confirmation of cytokine activity. The activity of therapeutic compositions of the 
present invention is evidenced by any one of a number of routine factor dependent cell 
proliferation assays for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, 
B9/1 1, BaF3, MC9/G, M+(preB M+), 2E8, RB5, DAI, 123, Tl 165, HT2, CTLL2, TF-1, !i 
Mo7e, CMK, HUVEC, and Caco. Therapeutic compositions of the invention can be used in 
the following: 

Assays for T-cell or thymocyte proliferation include without limitation those 
described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. 
Marplies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; 
Bertagnolli etal., J. Immunol. 145:1706-1712, 1990; Bertagnolli etal., Cellular Immunology 
133:327-341, 1991; Bertagnolli, et al., I. Immunol. 149:3778-3783, 1992; Bowman et al., I. 
Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node cells 
or thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, 
Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan 
eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of 
mouse and human interleukin^, Schreiber, R. D. In Current Protocols in Immunology. J. E. 
e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells 

include, without limitation, those described in: Measurement of Human and Murine 

Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. B. In Current 
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Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and 
Sons, Toronto. 1991; deVries et al., J.Exp. Med. 173:1205-1211, 1991; MoreauetaL, Nature 
336:690-692, 1988; Greenberger et al, Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; 
Measurement of mouse and human interleukin 6-~Nordan, R. In Current Protocols in 

5 Immunology. J. E. Coligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; 
Smith et al., Proc. Natl. Aced. Sci. U.SA. 83:1857-1861, 1986; Measurement of human 
Interleukin 1 1 -Bennett, R, Giannotti, J., Clark, S. C and Turner, K. J. In Current Protocols 
in Immunology. J. E. Coligan eds. Vol 1 pp. 6.15.1 John Wiley and Sons, Toronto. 1991; 
Measurement of mouse and human Interleukin 9-Ciarletta, A,, Giannotti, J., Clark, S. C. and 

10 Turner, K. J. In Current Protocols in Immunology. J. R Coligan eds. Vol 1 pp. 6.13.1, John 
Wiley and Sons, Toronto. 1991. 

-Assays for T-cell clone responses to antigens (which will identify, among others, 
proteins that affect APC-T cell interactions as well as direct T-cell effects by measuring 
proliferation and cytokine production) include, without limitation, those described in: Current 

15 Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. 

Shevach, W Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function; Chapter 6, Cytokines and their cellular 
receptors; Chapter 7, Immunologic studies in Humans); Weinberger et al., Proc. Natl. Acad. 
Sci. USA 77:6091-6095, 1980; Weinberger et al., Eur. J. Immun. 1 1:405-411, 1981; Takai et 

20 al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988. 

3.10.4 STEM CELL GROWTH FACTOR ACTIVITY 
A polypeptide of the present invention may exhibit stem cell growth factor activity 
and be involved in the proliferation, differentiation and survival of pluripotent and totipotent 
25 stem cells including primordial germ cells, embryonic stem cells, hematopoietic stem cells 
and/or germ line stem cells. Administration of the polypeptide of the invention to stem cells 
in vivo or ex vivo is expected to maintain and expand cell populations in a totipotential or 
pluripotential state which would be useful for re-engineering damaged or diseased tissues, 
transplantation, manufacture of bio-pharmaceuticals and the development of bio-sensors. 
30 The ability to produce large quantities of human cells has important working applications for 
the production of human proteins which currently must be obtained from non-human sources 
or donors, implantation of cells to treat diseases such as Parkinson's, Alzheimer's and other 
neurodegenerative diseases; tissues for grafting such as bone marrow, skin, cartilage, tendons, 
bone, muscle (including cardiac muscle), blood vessels, cornea, neural cells, gastrointestinal 
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cells and others; and organs for transplantation such as kidney, liver, pancreas (including islet 
cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or cytokines 
may be administered in combination with the polypeptide of the invention to achieve the 
desired effect, including any of the growth factors listed herein, other stem cell maintenance 
factors, and specifically including stem cell factor (SCF), leukemia inhibitory factor (LIF), 
Flt-3 ligand (F]t-3L), any of the interleukins, recombinant soluble IL-6 receptor fused to EL-6, 
macrophage inflammatory protein 1-alpha (MIP-1 -alpha), G-CSF, GM-CSF, thrombopoietin 
(TPO), platelet factor 4 (PF-4), platelet-derived growth factor (PDGF), neural growth factors 
and basic fibroblast growth factor (bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, expansion of 
these cells in culture will facilitate the production of large quantities of mature cells. 
Techniques for culturing stem cells are known in the art and administration of polypeptides of 
the invention, optionally with other growth factors and/or cytokines, is expected to enhance 
the survival and proliferation of the stem cell populations. This can be accomplished by 
direct administration of the polypeptide of the invention to the culture medium. 
Alternatively, stroma cells transfected with a polynucleotide that encodes for the polypeptide 
of the invention can be used as a feeder layer for the stem cell populations in culture or in 
vivo. Stromal support cells for feeder layers may include embryonic bone marrow 
fibroblasts, bone marrow stromal cells, fetal liver cells, or cultured embryonic fibroblasts (see 
U.S. Patent No. 5,690,926). 

Stem cells themselves can be transfected with a polynucleotide of the invention to 
induce autocrine expression of the polypeptide of the invention. This will allow for 
generation of undifferentiated totipotential/pluripotential stem cell lines that are useful as is 
or that can then be differentiated into the desired mature cell types. These stable cell lines 
can also serve as a source of undifferentiated totipotential/pluripotential mRNA to create 
cDNA libraries and templates for polymerase chain reaction experiments. These studies 
would allow for the isolation and identification of differentially expressed genes in stem cell 
populations that regulate stem cell proliferation and/or maintenance. 

Expansion and maintenance of totipotent stem cell populations will be useful in the 
treatment of many pathological conditions. For example, polypeptides of the present 
invention may be used to manipulate stem cells in culture to give rise to neuroepithelial cells 
that can be used to augment or replace cells damaged by illness, autoimmune disease, 

accidental damage or genetic disorders. The polypeptide of the invention may be useful for 
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inducing the proliferation of neural cells and for the regeneration of nerve and brain tissue, 
i.e. for the treatment of central and peripheral nervous system diseases and neuropathies, as 
well as mechanical and traumatic disorders which involve degeneration, death or trauma to 
neural cells or nerve tissue. In addition, the expanded stem cell populations can also be 

5 genetically altered for gene therapy purposes and to decrease host rejection of replacement 
tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can also be 
manipulated to achieve controlled differentiation of the stem cells into more differentiated 
cell types. A broadly applicable method of obtaining pure populations of a specific 

10 differentiated cell type from undifferentiated stem cell populations involves the use of a cell- 
type specific promoter driving a selectable marker. The selectable marker allows only cells 
of the desired type to survive. For example, stem cells can be induced to differentiate into 
cardiomyocytes (Wobus et al., Differentiation, 48: 173-182, (1991); Klug et al., J. Clin. 
Invest., 98(1): 216-224, (1998)) or skeletal muscle cells (Browder, L. W. In: Principles of 

15 Tissue Engineering eds. Lanza et al, Academic Press (1997)). Alternatively, directed 

differentiation of stem cells can be accomplished by culturing the stem cells in the presence 
of a differentiation factor such as retinoic acid and an antagonist of the polypeptide of the 
invention which would inhibit the effects of endogenous stem cell factor activity and allow 
differentiation to proceed. 

20 In vitro cultures of stem cells can be used to determine if the polypeptide of the 

invention exhibits stem cell growth factor activity. Stem cells are isolated from any one of 
various cell sources (including hematopoietic stem cells and embryonic stem cells) and 
cultured on a feeder layer, as described by Thompson et al. Proc. Natl. Acad. Sci, U.S.A., 92: 
7844-7848 (1995), in the presence of the polypeptide of the invention alone or in 

25 combination with other growth factors or cytokines. The ability of the polypeptide of the 

invention to induce stem cells proliferation is determined by colony formation on semi-solid 
support e.g. as described by Bernstein et al., Blood, 77: 23 16-2321 (1991). 

3.10.5 HEMATOPODESIS REGULATING ACTIVITY 
30 A polypeptide of the present invention may be involved in regulation of 

hematopoiesis and, consequently, in the treatment of myeloid or lymphoid cell disorders. 
Even marginal biological activity in support of colony forming cells or of factor-dependent 
cell lines indicates involvement in regulating hematopoiesis, e.g. in supporting the growth 
and proliferation of erythroid progenitor cells alone or in combination with other cytokines, 
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thereby indicating utility, for example, in treating various anemias or for use in conjunction 
with irradiation/chemotherapy to stimulate the production of erythroid precursors and/or 
erythroid cells; in supporting the growth and proliferation of myeloid cells such as 
granulocytes and monocytes/macrophages (i.e., traditional CSF activity) useful, for example, 
in conjunction with chemotherapy to prevent or treat consequent myelo-suppression: in 
supporting the growth and proliferation of megakaryocytes and consequently of platelets 
thereby allowing prevention or treatment of various platelet disorders such as 
thrombocytopenia, and generally for use in place of or complimentary to platelet transfusions; 
and/or in supporting the growth and proliferation of hematopoietic stem cells which are 
capable of maturing to any and all of the above-mentioned hematopoietic cells and therefore 
find therapeutic utility in various stem cell disorders (such as those usually treated with 
transplantation, including, without limitation, aplastic anemia and paroxysmal nocturnal 
hemoglobinuria), as well as in repopulating the stem cell compartment post 
kradiation/chemotherapy, either w-vfvo or ex-vivo (i.e., in conjunction with bone mairow 
transplantation or with peripheral progenitor cell transplantation (homologous or - 
heterologous)) as normal cells or genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 
* Suitable assays for proliferation and differentiation of various hematopoietic lines are 
cited above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 
proteins that influence embryonic differentiation hematopoiesis) include, without limitation, 
those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al„ 
Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 81:2903-2915, 
1993. 

Assays for stem cell survival and differentiation (which will identify, among others, 

proteins that regulate lympho-hematopoiesis) include, without limitation, those described in: 

Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. 

R. I. Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, N.Y. 1994; Hirayama 

et al., Proc. Natl. Acad. Sci. USA 89:5907-591 1, 1992; Primitive hematopoietic colony 

forming cells with high proliferative potential, McNiece, I. K. and Briddell, R. A. In Culture 

of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, 

N.Y. 1994; Neben et al., Experimental Hematology 22:353-359, 1994; Cobblestone area 

forming cell assay, Ploemacher, R. E. In Culture of Hematopoietic Cells. R. I. Freshney, et al. 

eds. Vol pp. 1-21, Wiley-Liss, Inc., New York, N.Y. 1994; Long term bone marrow cultures 
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in the presence of stromal cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of 
Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 163-179, Wiley-Liss, Inc., New York, 
N.Y. 1994; Long term culture initiating cell assay, Sutherland, H. J. In Culture of 
Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 139-162, Wiley-Liss, Inc., New York, 
5 N.Y. 1994. 

3.10.6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also maybe involved in bone, cartilage, 
tendon, ligament and/or nerve tissue growth or regeneration, as well as in wound healing and 
1 0 tissue repair and replacement, and in healing of bums, incisions and ulcers. 

A polypeptide of the present invention which induces cartilage and/or bone growth in 
circumstances where bone is not normally formed, has application in the healing of bone 
fractures and cartilage damage ot defects in humans and other animals. Compositions of a 
polypeptide, antibody, binding partner, or other modulator of the invention may have 
1 5 prophylactic use in closed as well as open fracture reduction and also in the improved fixation 
of artificial joints. De novo bone formation induced by an osteogenic agent contributes to the 
repair of congenital, trauma induced, or oncologic resection induced craniofacial defects, and 
also is useful in cosmetic plastic surgery. 

A polypeptide of this invention may also be involved in attracting bone-forming cells, 
20 stimulating growth of bone-forming cells, or inducing differentiation of progenitors of 

bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or 
periodontal disease, such as through stimulation of bone and/or cartilage repair or by 
blocking inflammation or processes of tissue destruction (collagenase activity, osteoclast 
activity, etc.) mediated by inflammatory processes may also be possible using the 
25 composition of the invention. 

Another category of tissue regeneration activity that may involve the polypeptide of 
the present invention is tendon/ligament formation. Induction of tendon/ligament-like tissue 
or other tissue formation in circumstances where such tissue is not normally formed, has 
application in the healing of tendon or ligament tears, deformities and other tendon or 
30 ligament defects in humans and other animals. Such a preparation employing a 

tendon/ligament-like tissue inducing protein may have prophylactic use in preventing damage 
to tendon or ligament tissue, as well as use in the improved fixation of tendon or ligament to 
bone or other tissues, and in repairing defects to tendon or ligament tissue. De novo 
tendon/ligament-like tissue formation induced by a composition of the present invention 
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contributes to the repair of congenital, trauma induced, or other tendon or ligament defects of 
other origin, and is also useful in cosmetic plastic surgery for attachment or repair of tendons 
or ligaments. The compositions of the present invention may provide environment to attract 
tendon- or ligament-forming cells, stimulate growth of tendon- or ligament-forming cells, 
induce differentiation of progenitors of tendon- or ligament-forming cells, or induce growth 
of tendon/ligament cells or progenitors ex vivo for return in vivo to effect tissue repair. The 
compositions of the invention may also be useful in the treatment of tendinitis, caipal tunnel 
syndrome and other tendon or ligament defects. The compositions may also include an 
appropriate matrix and/or sequestering agent as a carrier as is well known in the art. 

The compositions of the present invention may also be useful for proliferation of 
neural cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and 
peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic 
disorders, which involve degeneration, death or trauma to neural cells or nerve tissue. More 
specifically, a composition may be used in the treatment of diseases of the peripheral nervous 
system, such as peripheral nerve injuries, peripheral neuropathy and localized neuropathies, • 
and central nervous system diseases, such as Alzheimer's, Parkinson's disease, Huntington's 
disease, amyotrophic lateral sclerosis, and Shy-Drager syndrome. Further conditions which 
may be treated in accordance with the present invention include mechanical and traumatic 
disorders, such as spinal cord disorders, head trauma and cerebrovascular diseases such as 
stroke. Peripheral neuropathies resulting from chemotherapy or other medical therapies may 
also be treatable using a composition of the invention. 

Compositions of the invention may also be useful to promote better or faster closure 
of non-healing wounds, including without limitation pressure ulcers, ulcers associated with 
vascular insufficiency, surgical and traumatic wounds, and the like. 

Compositions of the present invention may also be involved in the generation or 
regeneration of other tissues, such as organs (including, for example, pancreas, liver, 
intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular 
(including vascular endothelium) tissue, or for promoting the growth of cells comprising such 
tissues. Part of the desired effects may be by inhibition or modulation of fibrotic scarring 
may allow normal tissue to regenerate. A polypeptide of the present invention may also 
exhibit angiogenic activity. 

A composition of the present invention may also be useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and 
conditions resulting from systemic cytokine damage. 
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A composition of Hie present invention may also be useful for promoting or inhibiting 
differentiation of tissues described above from precursor tissues or cells; or for inhibiting the 
growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 

Assays for tissue generation activity include, without limitation, those described in: 
International Patent Publication No. WO95/16035 (bone, cartilage, tendon); International 
Patent Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. 
WO91/07491 (skin, endothelium). 

Assays for wound healing activity include, without limitation, those described in: 
Winter, Epidermal Wound Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T., eds.), Year 
Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. 
Dermatol 71:382-84 (1978). 

3.10.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 

A polypeptide of the present invention may also exhibit immune stimulating or 
immune suppressing activity, including without limitation the activities for which assays are 
described herein. A polynucleotide of the invention can encode a polypeptide exhibiting such 
activities. A protein may be useful in the treatment of various immune deficiencies and 
disorders (including severe combined immunodeficiency (SCID)), e.g., in regulating (up or 
down) growth and proliferation of T and/or B lymphocytes, as well as effecting the cytolytic 
activity of NK cells and other cell populations. These immune deficiencies may be genetic or 
be caused by viral (e.g., HIV) as well as bacterial or fungal infections, or may result from 
autoimmune disorders. More specifically, infectious diseases causes by viral, bacterial, fungal 
or other infection may be treatable using a protein of the present invention, including 
.infections by HTV, hepatitis viruses, herpes viruses, mycobacteria, Leishmania spp., malaria 
spp. and various fungal infections such as candidiasis. Of course, in this regard, proteins of 
the present invention may also be useful where a boost to the immune system generally may 
be desirable, i.e., in the treatment of cancer. 

Autoimmune disorders which may be treated using a protein of the present invention 
include, for example, connective tissue disease, multiple sclerosis, systemic lupus 
erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre 
syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, 
graft-versus-host disease and autoimmune inflammatory eye disease. Such a protein (or 
antagonists thereof, including antibodies) of the present invention may also to be useful in the 
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treatment of allergic reactions and conditions (e.g., anaphylaxis, serum sickness, drug 
reactions, food allergies, insect venom allergies, mastocytosis, allergic rhinitis, 
hypersensitivity pneumonitis, urticaria, angioedema, eczema, atopic dermatitis, allergic 
contact dermatitis, erythema multiforme, Stevens-Johnson syndrome, allergic conjunctivitis, 
atopic keratoconjunctivitis, venereal keratoconjunctivitis, giant papillary conjunctivitis and 
contact allergies), such as asthma (particularly allergic asthma) or other respiratory problems. 
Other conditions, in which immune suppression is desired (including, for example, organ 
transplantation), may also be treatable using a protein (or antagonists thereof) of the present 
invention. The therapeutic effects of the polypeptides or antagonists thereof on allergic 
reactions can be evaluated by in vivo animals models such as the cumulative contact 
enhancement test (Lastbom et al., Toxicology 125: 59-66, 1998), skin prick test (Hoffinann et 
al. Allergy 54: 446-54, 1999), guinea pig skin sensitization test (Vohr et al. Arch. ToxocoL 
73: 501-9), and murine local lymph node assay (Kimber et al, J. Toxicol. Environ. Health 
53: 563-79), 

Using the proteins of the invention it may also be possible to modulate immune 
responses, in a number of ways. Down regulation may be in the form of inhibiting or 
blocking an immune response already in progress or may involve preventing the induction of 
an immune response. The functions of activated T cells may be inhibited by suppressing T 
cell responses or by inducing specific tolerance in T cells, or both. Immunosuppression of T 
cell responses is generally an active, non-antigen-specific, process which requires continuous 
exposure of the T cells to the suppressive agent. Tolerance, which involves inducing 
non-responsiveness or anergy in T cells, is distinguishable from immunosuppression in that it 
is generally antigen-specific and persists after exposure to the tolerizing agent has ceased. 
Operationally, tolerance can be demonstrated by die lack of a T cell response upon 
reexposure to specific antigen in the absence of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 

limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing high 

level Iymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and 

organ transplantation and in graft-versus-host disease (GVHD). For example, blockage of T 

cell function should result in reduced tissue destruction in tissue transplantation. Typically, in 

tissue transplants, rejection of the transplant is initiated through its recognition as foreign by 

T cells, followed by an immune reaction that destroys the transplant. The administration of a 

therapeutic composition of the invention may prevent cytokine synthesis by immune cells, 

such as T cells, and thus acts as an immunosuppressant. Moreover, a lack of costimulation 
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may also be sufficient to anergize the T cells, thereby inducing tolerance in a subject. 
Induction of long-term tolerance by B lymphocyte antigen-blocking reagents may avoid the 
necessity of repeated administration of these blocking reagents. To achieve sufficient 
immunosuppression or tolerance in a subject, it may also be necessary to block the function 

5 of a combination ofB lymphocyte antigens. 

The efficacy of particular therapeutic compositions in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
humans. Examples of appropriate systems which can be used include allogeneic cardiac 
grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been used 

1 0 to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as described 
in Lenschow et aU Science 257:789-792 (1992) and Turka et al., Proc. Natl. Acad. Sci USA, 
89:11102-11105 (1992). In addition, murine models of GVHD (see Paul ed., Fundamental 
Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to determine the 
effect of therapeutic compositions of the invention on the development of that disease. 

1 5 Blocking antigen function may also be therapeutically useful for treating autoimmune 

diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that 
are reactive against self tissue and which promote the production of cytokines and 
autoantibodies involved in the pathology of the diseases. Preventing the activation of 
autoreactive T cells may reduce or eliminate disease symptoms. Administration of reagents 

20 which block stimulation of T cells can be used to inhibit T cell activation and prevent 
production of autoantibodies or T cell-derived cytokines which may be involved in the 
disease process. Additionally, blocking reagents may induce antigen-specific tolerance of 
autoreactive T cells which could lead to long-term relief from the disease. The efficacy of 
blocking reagents in preventing or alleviating autoimmune disorders can be determined using 

25 a number of well-characterized animal models of human autoimmune diseases. Examples 
include murine experimental autoimmune encephalitis, systemic lupus erythmatosis in 
MRL/lpr/lpr mice or NZB hybrid mice, murine autoimmune collagen arthritis, diabetes 
mellitus in NOD mice and BB rats, and murine experimental myasthenia gravis (see Paul ed., 
Fundamental Immunology, Raven Press, New York, 1989, pp. 840-856). 

30 Upregulation of an antigen function (e.g., a B lymphocyte antigen function), as a 

means of up regulating immune responses, may also be useful in therapy. Upregulation of, 
immune responses may be in the form of enhancing an existing immune response or eliciting 
an initial immune response. For example, enhancing an immune response may be useful in 
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cases of viral infection, including systemic viral diseases such as influenza, the common cold, 
and encephalitis. 

Alternatively, anti-viral immune responses may be enhanced in an infected patient by 
removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed 
APCs either expressing a peptide of the present invention or together with a stimulatory form 
of a soluble peptide of the present invention and reintroducing the in vitro activated T cells 
into the patient. Another method of enhancing anti-viral immune responses would be to 
isolate infected cells from a patient, transfect them with a nucleic acid encoding a protein of 
the present invention as described herein such that the cells express all or a portion of the 
protein on their surface, and reintroduce the transfected cells into the patient. The infected 
cells would now be capable of delivering a costimulatory signal to, and thereby activate, T 
cells in vivo. 

A polypeptide of the present invention may provide the necessary stimulation signal 
to T cells to induce a T cell mediated immune response against the transfected tumor cells. In 
addition, tumor cells which lack MHC class I or MHC class II molecules, or which fail to - 
reexpress sufficient mounts of MHC class I or MHC class II molecules, can be transfected 
with nucleic acid encoding all or a portion of (e.g., a cytoplasmic-domain truncated portion) 
of an MHC class I alpha chain protein and f5 2 microglobulin protein or an MHC class II alpha 
chain protein and an MHC class II beta chain protein to thereby express MHC class I or 
MHC class II proteins on the cell surface. Expression of the appropriate class I or class II 
MHC in conjunction with a peptide having the activity of a B lymphocyte antigen (e.g., B7-1, 
B7-2, B7-3) induces a T cell mediated immune response against the transfected tumor cell. 
Optionally, a gene encoding an antisense construct which blocks expression of an MHC class 
II associated protein, such as the invariant chain, can also be cotransfected with a DNA 
encoding a peptide having the activity of a B lymphocyte antigen to promote presentation of 
tumor associated antigens and induce tumor specific immunity. Thus, the induction of a T 
cell mediated immune response in a human subject may be sufficient to overcome 
tumor-specific tolerance in the subject. 

The activity of a protein of the invention may, among other means, be measured by 
the following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, 

those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, 

D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 

Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3,1-3.19; 
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Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 

78:2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handaetal., J. 

Immunol. 135:1564-1572, 1985; Takai et al., I. Immunol. 137:3494-3500, 1986; Takai et al., 

J. Immunol. 140:508-512, 1988; Bowman et al., J. Virology 61:1992-1998; Bertagnolli et al., 
5 Cellular Immunology 133:327-341, 1991; Brown et al., J. Immunol. 153:3079-3092, 1994. 

Assays for T-cell-dependent immunoglobulin responses and isotype switching (which 

will identify, among others, proteins that modulate T-cell dependent antibody responses and 

that affect Thl/Th2 profiles) include, without limitation, those described in: Maliszewski, J. 

Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro antibody production, 
1 0 Mond, J. J. and Brunswick, M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. 

Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, 

proteins that generate predominantly Thl and CTL responses) include, without limitation, 

those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, 
15 D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 

Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 

Chapter 7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; 

Takai et al., J. Immunol. 140:508-512, 1988; Bertagnolli etal., J. Immunol. 149:3778-3783, 

1992. 

20 Dendritic cell-dependent assays (which will identify, among others, proteins 

expressed by dendritic cells that activate naive T-cells) include, without limitation, those 
described in: Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al, Journal of 
Experimental Medicine 173:549-559, 1991; Macatonia et al., Journal of Immunology 
154:5071-5079, 1995; Porgador et al., Journal of Experimental Medicine 182:255-260, 1995; 

25 Nair et al., Journal of Virology 67:4062-4069, 1993; Huang et al., Science 264:961-965, 

1994; Macatonia et al., Journal of Experimental Medicine 169:1255-1264, 1989; Bhardwaj et 
al., Journal of Clinical Investigation 94:797-807, 1994; and Inaba et al., Journal of 
Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, proteins 

30 that prevent apoptosis after superantigen induction and proteins that regulate lymphocyte 

homeostasis) include, without limitation, those described in: Darzynkiewicz et al., Cytometry 
13:795-808, 1992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorczyca et al., Cancer 
Research 53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; Zacharchuk, Journal of 
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Immunology 145:4037-4045, 1990; Zamaietal., Cytometry 14:891-897, 1993; Gorczyca et 
al., International Journal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and development 
include, without limitation, those described in: Arnica et al. Blood 84: 1 1 1-1 17, 1 994; Fine et 
al., Cellular Immunology 155:111-122, 1994: Galy etal. Blood 85:2770-2778, 1995; Toki et 
al., Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 

3.10.8 ACTIVIN/TNHIBIN ACTIVITY 

A polypeptide of the present invention may also exhibit activin- or inhibin-related 
activities. A polynucleotide of the invention may encode a polypeptide exhibiting such 
characteristics. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to stimulate 
the release of follicle stimulating hormone (FSH). Thus, a polypeptide of the present 
invention, alone or in heterodimers with a member of the inhibin family, may be useful as a 
contraceptive based on the ability of inhibins to decrease fertility in female mammals and ■ 
decrease spermatogenesis in male mammals. Administration of sufficient amounts of other 
inhibins can induce infertility in these mammals. Alternatively, the polypeptide of the 
invention, as a homodimer or as a heterodimer with other protein subunits of the inhibin 
group, may be useful as a fertility inducing therapeutic, based upon the ability of activin 
molecules in stimulating FSH release from cells of the anterior pituitary. See, for example, 
U.S. Pat. No. 4,798,885. A polypeptide of the invention may also be useful for advancement 
of the onset of fertility in sexually immature mammals, so as to increase the lifetime 
reproductive performance of domestic animals such as, but not limited to, cows, sheep and 
pigs. 

The activity of a polypeptide of the invention may, among other means, be measured 
by the following methods. 

Assays for activin/inhibin activity include, without limitation, those described in: Vale 
et al., Endocrinology 91 :562-572, 1972; Ling et al., Nature 321 =779-782, 1986; Vale et al., 
Nature 321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al., Proc. Natl. 
Acad. Sci. USA 83:3091-3095, 1986. 

3.10.9 CHEMOTACT1C/CHEMOKINETIC ACTIVITY 
A polypeptide of the present invention may be involved in chemotactic or 
chemokinetic activity for mammalian cells, including, for example, monocytes, fibroblasts, 
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neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. A 
polynucleotide of the invention can encode a polypeptide exhibiting such attributes. 
, Chemotactic and chemokinetic receptor activation can be used to mobilize or attract a desired 
cell population to a desired site of action. Chemotactic or chemokinetic compositions (e.g. 
5 proteins, antibodies, binding partners, or modulators of the invention) provide particular 
advantages in treatment of wounds and other trauma to tissues, as well as in treatment of 
localized infections. For example, attraction of lymphocytes, monocytes or neutrophils to 
tumors or sites of infection may result in improved immune responses against the tumor or 
infecting agent. 

I o A protein or peptide has chemotactic activity for a particular cell population if it can 

stimulate, directly or indirectly, the directed orientation or movement of such cell population. 
Preferably, the protein or peptide has the ability to directly stimulate directed movement of 
cells. Whether a particular protein has chemotactic activity for a population of cells can be 
readily determined by employing such protein or peptide in any known assay for cell 

15 chemotaxis. 

Therapeutic compositions of the invention can be used in the following: 
Assays for chemotactic activity (which will identify proteins that induce or prevent 
chemotaxis) consist of assays that measure the ability of a protein to induce the migration of 
cells across a membrane as well as the ability of a protein to induce the adhesion of one cell 

20 population to another cell population. Suitable assays for movement and adhesion include, 

without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, 
A. M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. Strober, Pub. Greene Publishing 
Associates and Wiley-Interscience (Chapter 6. 12, Measurement of alpha and beta 
Chemokines 6.12.1-6.1228; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. 

25 APMIS 103:140-146, 1 995 ; Muller et al Eur. J. Immunol. 25:1744-1748; Gruberetal. J. of 
Immunol. 152:5860-5867, 1994; Johnston et al. J. of Immunol. 153:1762-1768, 1994. 

3.10.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

A polypeptide of the invention may also be involved in hemostatis or thrombolysis or 
30 thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such 
attributes. Compositions may be useful in treatment of various coagulation disorders 
(including hereditary disorders, such as hemophilias) or to enhance coagulation and other . 
hemostatic events in treating wounds resulting from trauma, surgery or other causes. A 
composition of the invention may also be useful for dissolving or inhibiting formation of 
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thromboses and for treatment and prevention of conditions resulting therefrom (such as, for 
example, infarction of cardiac and central nervous system vessels (e.g., stroke). 
Therapeutic compositions of the invention can be used in the following: 
Assay for hemostatic and thrombolytic activity include, without limitation, those - 
described in: Linet et al., J. Clin. Pharmacol. 26:131-140. 1986: Burdick et a j ?; Thrombosis 
Res. 45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 
35:467474, 1988. 

3.10.11 CANCER DIAGNOSIS AND THERAPY 

Polypeptides of the invention maybe involved in cancer cell generation, proliferation 
or metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the 
invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. 
For example, the presence or increased expression of a polynucleotide/polypeptide of the 
invention may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing 
malignancy. Conversely, a defect in the gene or absence of the polypeptide may be 
associated with a cancer condition. Identification of single nucleotide polymorphisms 
associated with cancer or a predisposition to cancer may also be useful for diagnosis or 
prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, 

inhibiting angiogenesis (growth of new blood vessels that is necessary to support tumor 

growth) and/or prohibiting metastasis by reducing tumor cell motility or invasiveness. 

Therapeutic compositions of the invention may be effective in adult and pediatric oncology 

including in solid phase tumors/malignancies, locally advanced tumors, human soft tissue 

sarcomas, metastatic cancer, including lymphatic metastases, blood cell malignancies 

including multiple myeloma, acute and chronic leukemias, and lymphomas, head and neck 

cancers including mouth cancer, larynx cancer and thyroid cancer, lung cancers including 

small cell carcinoma and non-small cell cancers, breast cancers including small cell 

carcinoma and ductal carcinoma, gastrointestinal cancers including esophageal cancer, 

stomach cancer, colon cancer, colorectal cancer and polyps associated with colorectal 

neoplasia, pancreatic cancers, liver cancer, urologic cancers including bladder cancer and 

prostate cancer, malignancies of the female genital tract including ovarian carcinoma, uterine 

(including endometrial) cancers, and solid tumor in the ovarian follicle, kidney cancers 

including renal cell carcinoma, brain cancers including intrinsic brain tumors, neuroblastoma, 

astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the central nervous system, 
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bone cancers including osteomas, skin cancers including malignant melanoma, tumor 
progression of human skin keratinocytes, squamous cell carcinoma, basal cell carcinoma, 
hemangiopericytoma and Karposi's sarcoma. 

Polypeptides, polynucleotides, or modulators of polypeptides of the invention 
(including inhibitors and stimulators of the biological activity of the polypeptide of the 
invention) may be administered to treat cancer. Therapeutic compositions can be 
administered in therapeutically effective dosages alone or in combination with adjuvant 
cancer therapy such as surgery, chemotherapy, radiotherapy, thermotherapy, and laser 
therapy, and may provide a beneficial effect, e.g. reducing tumor size, slowing rate of tumor 
growth, inhibiting metastasis, or otherwise improving overall clinical condition, without 
necessarily eradicating the cancer. 

The composition can also be administered in therapeutically effective amounts as a 
portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or 
modulator of the invention with one or more anti-cancer drugs in addition to a 
pharmaceutical^ acceptable carrier for delivery. The use of anti-cancer cocktails as a cancer 
treatment is routine. Anti-cancer drugs that are well known in the art and can be used as a 
treatment in combination with the polypeptide or modulator of the invention include: 
Actinomycin D, Aminoglutethimide, Asparaginase, Bleomycin, Busulfan, Carboplatin, 
Carmustine, Chlorambucil, Cisplatin (cis-DDP), Cyclophosphamide, Cytarabine HC1 
(Cytosine arabinoside), Dacarbazine, Dactinomycin, Daunorubicin HC1, Doxorubicin HC1, 
Estramustine phosphate sodium, Etoposide (V16-213), Floxuridine, 5-Fluorouracil (5-Fu), 
Flutamide, Hydroxyurea (hydroxycarbamide), Ifosfamide, Interferon Alpha-2a, Interferon 
Alpha-2b, Leuprolide acetate (LHRH-releasing fector analog), Lomustine, Mechlorethamine 
HC1 (nitrogen mustard), Melphalan, Mercaptopurine, Mesna, Methotrexate (MTX), 
Mitomycin, Mitoxantrone HC1, Octreotide, Plicamycin, Procarbazine HC1, Streptozocin, 
Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine sulfate, Vincristine sulfate, 
Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, Mitoguazone, Pentostatin, 
Semustine, Teniposide, and Vindesine sulfate. 

In addition, therapeutic compositions of the invention may be used for prophylactic 
treatment of cancer. There are hereditary conditions and/or environmental situations (e.g. 
exposure to carcinogens) known in the art that predispose an individual to developing 
cancers. Under these circumstances, it may be beneficial to treat these individuals with 
therapeutically effective doses of the polypeptide of the invention to reduce the risk of 
developing cancers. 
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In vitro models can be used to determine the effective doses of the polypeptide of the 
invention as a potential cancer treatment. These in vitro models include proliferation assays 
of cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1987) 
Culture of Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, NY Ch 18 
and Ch 21), tumor systems in nude mice as described in Giovanella et al M J. Natl. Can. Inst.. 
52: 921-30 (1974), mobility and invasive potential of tumor cells in Boyden Chamber assays 
as described in Pilkington et al., Anticancer Res., 17: 4107-9 (1997), and angiogenesis assays 
such as induction of vascularization of the chick chorioallantoic membrane or induction of 
vascular endothelial cell migration as described in Ribatta et ah, Intl. J. Dev. Biol., 40: 1 189- 
97 (1999) and Li et al., Clin. Exp. Metastasis, 17:423-9 (1999), respectively. Suitable tumor 
cells lines are available, e.g. from American Type Tissue Culture Collection catalogs. 

3.10.12 RECEPTOR/LIGAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as receptor, 
receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of the 
invention can encode a polypeptide exhibiting such characteristics. Examples of such 
receptors and ligands include, without limitation, cytokine receptors and their ligands, 
receptor kinases and their ligands, receptor phosphatases and their ligands, receptors involved 
in cell-cell interactions and their ligands (including without limitation, cellular adhesion 
molecules (such as selecting integrins and their ligands) and receptor/ligand pairs involved in 
antigen presentation, antigen recognition and development of cellular and humoral immune 
responses. Receptors and ligands are also useful for screening of potential peptide or small 
molecule inhibitors of the relevant receptor/ligand interaction. A protein of the present 
invention (including, without limitation, fragments of receptors and ligands) may themselves 
be useful as inhibitors of receptor/ligand interactions. 

The activity of a polypeptide of the invention may, among other means, be measured 
by the following methods: 

Suitable assays for receptor-ligand activity include without limitation those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. 
Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- 
Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static conditions 7.28.1- 
7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; Bierer et al., J. Exp. 
Med. 168:1 145-1 156, 1988; Rosenstein et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et 
al., J. Immunol. Methods 175:59-68, 1994; Stitt et al., Cell 80:661-670, 1995. 
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By way of example, the polypeptides of the invention may be used as a receptor for a 
ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may be 
identified through binding assays, affinity chromatography, dihybrid screening assays, 
BIAcore assays, gel overlay assays, or other methods known in the art. 

5 Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or 

a partial antagonist require the use of other proteins as competing ligands. The polypeptides 
of the present invention or ligand(s) thereof may be labeled by being coupled to 
radioisotopes, colorimetric molecules or a toxin molecules by conventional methods. ("Guide 
to Protein Purification" Murray P. Deutscher (ed) Methods in Enzymology Vol. 182 (1990) 

10 Academic Press, Inc. San Diego). Examples of radioisotopes include, but are not limited to, 
tritium and carbon-14 . Examples of colorimetric molecules include, but are not limited to, 
fluorescent molecules such as fluorescamine, or rhodamine or other colorimetric molecules. 
Examples of toxins include, but are not limited, to rich. 

15 3.10,13 DRUG SCREENING 

This invention is particularly useful for screening chemical compounds by using the 
novel polypeptides or binding fragments thereof in any of a variety of drug screening 
techniques. The polypeptides or fragments employed in such a test may either be free in 
solution, affixed to a solid support, borne on a cell surface or located intracellularly. One 

20 method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably , 
transformed with recombinant nucleic acids expressing the polypeptide or a fragment 
thereof. Drugs are screened against such transformed cells in competitive binding assays. 
Such cells, either in viable or fixed form, can be used for standard binding assays. One may 
measure, for example, the formation of complexes between polypeptides of the invention or 

25 fragments and the agent being tested or examine the diminution in complex formation 

between the novel polypeptides and an appropriate cell line, which are well known in the art. 

Sources for test compounds that may be screened for ability to bind to or modulate 
(i.e., increase or decrease) the activity of polypeptides of the invention include (1) inorganic 
and organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 

30 comprised of either random or mimetic peptides, oligonucleotides or organic molecules. 

Chemical libraries may be readily synthesized or purchased from a number of 
commercial sources, and may include structural analogs of known compounds or compounds 
that are identified as "hits" or "leads" via natural product screening. 
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The sources of natural product libraries are microorganisms (including bacteria and 
fungi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for 
screening may be created by: (1) fermentation and extraction of broths from soil, plant or 
marine microorganisms or (2) extraction of the organisms themselves. Natural product 
libraries include polyketides, non-ribosomal peptides, and (non-natural I v occurring variance 
thereof. For a review, see Science 282-.63-6S (1998). 

Combinatorial libraries are composed of large numbers of peptides, oligonucleotides 
or organic compounds and can be readily prepared by traditional automated synthesis 
methods, PCR, cloning or proprietary synthetic methods. Of particular interest are peptide 
and oligonucleotide combinatorial libraries. Still other libraries of interest include peptide, 
protein, peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide 
libraries. For a review of combinatorial chemistry and libraries created therefrom, see Myers, 
Curr. Opin. Bioteclmol. 8:701-707 (1997). For reviews and examples of peptidomimetic 
libraries, see Al-Obeidi et al., Mol. Biotechnol, 9(3):205-23 (1998); Hruby et al., Cun Opin 
Chem Biol, 1(1):114-19 (1997); Domeretei.,BioorgMedChem, 4(5):709-15 (1996) 
(alkylated dipeptides). 

Identification of modulators through use of the various libraries described herein 
permits modification of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to 
bind a polypeptide of the invention. The molecules identified in the binding assay are then 
tested for antagonist or agonist activity in in vivo tissue culture or animal models that are well 
known in the art. In brief, the molecules are titrated into a plurality of cell cultures or animals 
and then tested for either cell/animal death or prolonged survival of the animal/cells. 

The binding molecules thus identified may be complexed with toxins, e.g., ricin or 
cholera, or with other compounds that are toxic to cells such as radioisotopes. The 
toxin-binding molecule complex is then targeted to a tumor or other cell by the specificity of 
the binding molecule for a polypeptide of the invention. Alternatively, the binding molecules 
may be complexed with imaging agents for targeting and imaging purposes. 

3.10.14 ASSAY FOR RECEPTOR ACTIVITY 

The invention also provides methods to detect specific binding of a polypeptide e.g. a 

Iigand or a receptor. The art provides numerous assays particularly useful for identifying 

previously unknown binding partners for receptor polypeptides of the invention. For 

example, expression cloning using mammalian or bacterial cells, or dihybrid screening assays 

can be used to identify polynucleotides encoding binding partners. As another example, 
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affinity chromatography with the appropriate immobilized polypeptide of the invention can 
be used to isolate polypeptides that recognize and bind polypeptides of the invention. There 
are a number of different libraries used for the identification of compounds, and in particular 
small molecules, that modulate (i.e., increase or decrease) biological activity of a polypeptide 

5 of the invention. Ligands for receptor polypeptides of the invention can also be identified by 
adding exogenous ligands, or cocktails of ligands to two cells populations that are genetically 
identical except for the expression of the receptor of the invention: one cell population 
expresses the receptor of the invention whereas the other does not. The response of the two 
cell populations to the addition of ligands(s) are then compared. Alternatively, an expression 

1 0 library can be co-expressed with the polypeptide of the invention in cells and assayed for an 
autocrine response to identify potential ligand(s). As still another example, BIAcore assays, 
gel overlay assays, or other methods known in the art can be used to identify binding partner 
polypeptides, including, (1) organic and inorganic chemical libraries, (2) natural product 
libraries, and (3) combinatorial libraries comprised of random peptides, oligonucleotides or 

15 organic molecules. 

The role of downstream intracellular signaling molecules in the signaling cascade of 
the polypeptide of the invention can be determined. For example, a chimeric protein in which 
the cytoplasmic domain of the polypeptide of the invention is fused to the extracellular 
portion of a protein, whose ligand has been identified, is produced in a host cell. The cell is 

20 then incubated with the ligand specific for the extracellular portion of the chimeric protein, 
thereby activating the chimeric receptor. Known downstream proteins involved in 
intracellular signaling can then be assayed for expected modifications i.e. phosphorylation. 
Other methods known to those in the art can also be used to identify signaling molecules 
involved in receptor activity. 

25 

3.10.15 ANTI-INFLAMMATORY ACTIVITY 

Compositions of the present invention may also exhibit anti-inflammatory activity. 
The anti-inflammatory activity may be achieved by providing a stimulus to cells involved in 
the inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for 
30 example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the 
inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or 
suppressing production of other factors which more directly inhibit or promote an 
inflammatory response. Compositions with such activities can be used to treat inflammatory 
conditions including chronic or acute conditions), including without limitation intimation 
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associated with infection (such as septic shock, sepsis or systemic inflammatory response 
syndrome (SIRS)), ischemia-reperfusion injury, endotoxin lethality, arthritis, 
complement-mediated hyperacute rejection, nephritis, cytokine or chemokine-induced lung 
injury, inflammatory bowel disease, Crohn's disease or resulting from over production of 
cytokines such as TNF or IL-1. Compositions of the invention may also be useful to treat 
anaphylaxis and hypersensitivity to an antigenic substance or material. Compositions of this 
invention may be utilized to prevent or treat conditions such as, but not limited to, sepsis, 
acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid arthritis, chronic 
inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1, graft versus host 
disease, inflammatory bowel disease, inflamation associated with pulmonary disease, other 
autoimmune disease or inflammatory disease, an antiproliferative agent such as for acute or 
chronic myogenous leukemia or in the prevention of premature labor secondary to 
intrauterine infections. 

3.10.16 LEUKEMIAS 

Leukemias and related disorders may be treated or prevented by administration of a 
therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of 
the invention. Such leukemias and related disorders include but are not limited to acute 
leukemia, acute lymphocytic leukemia, acute myelocytic leukemia, myeloblastic, 
promyelocyte, myelomonocytic, monocytic, erythroleukemia, chronic leukemia, chronic 
myelocytic (granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such 
disorders, see Fishman et al„ 1985, Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia). 

3.10.17 NERVOUS SYSTEM DISORDERS 

Nervous system disorders, involving cell types which can be tested for efficacy of 
intervention with compounds that modulate the activity of the polynucleotides and/or 
polypeptides of the invention, and which can be treated upon thus observing an indication of 
therapeutic utility, include but are not limited to nervous system injuries, and diseases or 
disorders which result in either a disconnection of axons, a diminution or degeneration of 
neurons, or demyelination. Nervous system lesions which may be treated in a patient 
(including human and non-human mammalian patients) according to the invention include 
but are not limited to the following lesions of either the central (including spinal cord, brain) 
or peripheral nervous systems: 
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(i) traumatic lesions, including lesions caused by physical injury or associated 
with surgery, for example, lesions which sever a portion of the nervous system, or 
compression injuries; 

(ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous system 
5 results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord 

infarction or ischemia; 

(iii) infectious lesions, in which a portion of the nervous system is destroyed or 
injured as a result of infection, for example, by an abscess or associated with infection by 
human immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme disease, 

10 tuberculosis, syphilis; 

(iv) degenerative lesions, in which a portion of the nervous system is destroyed or 
injured as a result of a degenerative process including but not limited to degeneration 
associated with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or 
amyotrophic lateral sclerosis; 

1 5 ( v ) lesions associated with nutritional diseases or disorders, in which a portion of 

the nervous system is destroyed or injured by a nutritional disorder or disorder of metabolism 
including but not limited to, vitamin B12 deficiency, folic acid deficiency, Wernicke disease, 
tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary degeneration of the 
corpus callosum), and alcoholic cerebellar degeneration; 

20 (vi) neurological lesions associated with systemic diseases including but not 

limited to diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, 
carcinoma, or sarcoidosis; 

(vii) lesions caused by toxic substances including alcohol, lead, or particular 
neurotoxins; and 

25 (viii) demyelinated lesions in which a portion of the nervous system is destroyed or 

injured by a demyelinating disease including but not limited to multiple sclerosis, human 
immunodeficiency virus-associated myelopathy, transverse myelopathy or various etiologies, 
progressive multifocal leukoencephalopathy, and central pontine myelinolysis. 

Therapeutics which are useful according to the invention for treatment of a nervous 

30 system disorder may be selected by testing for biological activity in promoting the survival or 
differentiation of neurons. For example, and not by way of limitation, therapeutics which 
elicit any of the following effects may be useful according to the invention: 

(i) increased survival time of neurons in culture; 

(ii) increased sprouting of neurons in culture or in vivo; 
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(iii) increased production of a neuron-associated molecule in culture or in vivo, 
e.g., choline aceryltransferase or acetylcholinesterase with respect to motor neurons; or 

(iv) decreased symptoms of neuron dysfunction in vivo. 

Such effects may be measured by any method known in the art. In preferred, 
non-limiting embodiments, increased survival of neurons may be measured by the method set 
forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons 
may be detected by methods set forth in Pestronk et al. (1980, Exp. Neurol. 70:65-82) or 
Brown et al. (1981, Ann. Rev. Neurosci. 4:17-42); increased production of neuron-associated 
molecules may be measured by bioassay, enzymatic assay, antibody binding, Northern blot 
assay, etc., depending on the molecule to be measured; and motor neuron dysfunction may be 
measured by assessing the physical manifestation of motor neuron disorder, e.g., weakness, 
motor neuron conduction velocity, or functional disability. 

In specific embodiments, motor neuron disorders that may be treated according to the 
invention include but are not limited to disorders such as infarction, infection, exposure to 
toxin, trauma, surgical damage, degenerative disease or malignancy that may affect motor " 
neurons as well as other components of the nervous system, as well as disorders that 
selectively affect neurons such as amyotrophic lateral sclerosis, and including but not limited 
to progressive spinal muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, 
infantile and juvenile muscular atrophy, progressive bulbar paralysis of childhood 
(Fazio-Londe syndrome), poliomyelitis and the post polio syndrome, and Hereditary 
Motorsensory Neuropathy (Charcot-Marie-Tooth Disease). 

3.10.18 OTHER ACTIVITIES 

A polypeptide of the invention may also exhibit one or more of the following 
additional activities or effects: inhibiting the growth, infection or function of, or killing, 
infectious agents, including, without limitation, bacteria, viruses, fungi and other parasites; 
effecting (suppressing or enhancing) bodily characteristics, including, without limitation, 
height, weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or 
organ or body part size or shape (such as, for example, breast augmentation or diminution, 
change in bone form or shape); effecting biorhythms or circadian cycles or rhythms; effecting 
the fertility of male or female subjects; effecting the metabolism, catabolism, anabolism, 
processing, utilization, storage or elimination of dietary fat, lipid, protein, carbohydrate, 
vitamins, minerals, co-factors or other nutritional factors or components); effecting 
behavioral characteristics, including, without limitation, appetite, libido, stress, cognition 
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(including cognitive disorders), depression (including depressive disorders) and violent 
behaviors; providing analgesic effects or other pain reducing effects; promoting 
differentiation and growth of embryonic stem cells in lineages other than hematopoietic 
lineages; hormonal or endocrine activity; in the case of enzymes, correcting deficiencies of 
5 the enzyme and treating deficiency-related diseases; treatment of hyperproliferative disorders 
(such as, for example, psoriasis); immunoglobulin-like activity (such as, for example, the 
ability to bind antigens or complement); and the ability to act as an antigen in a vaccine 
composition to raise an immune response against such protein or another material or entity 
which is cross-reactive with such protein. 

10 

3.10.19 IDENTIFICATION OF POLYMORPHISMS 

The demonstration of polymorphisms makes possible the identification of such 
polymorphisms in human subjects and the pharmacogenetic use of this information for 
diagnosis and treatment. Such polymorphisms may be associated with, e.g., differential 

1 5 predisposition or susceptibility to various disease states (such as disorders involving 

inflammation or immune response) or a differential response to drug administration, and this 
genetic information can be used to tailor preventive or therapeutic treatment appropriately. 
For example, the existence of a polymorphism associated with a predisposition to 
inflammation or autoimmune disease makes possible the diagnosis of this condition in 

20 humans by identifying the presence of the polymorphism. 

Polymorphisms can be identified in a variety of ways known in the art which all 
generally involve obtaining a sample from a patient, analyzing DNA from the sample, 
optionally involving isolation or amplification of the DNA, and identifying the presence of 
the polymorphism in the DNA. For example, PCR may be used to amplify an appropriate 

25 fragment of genomic DNA which may then be sequenced. Alternatively, the DNA may be 
subjected to allele-specific oligonucleotide hybridization (in which appropriate 
oligonucleotides are hybridized to the DNA under conditions permitting detection of a single 
base mismatch) or to a single nucleotide extension assay (in which an oligonucleotide that 
hybridizes immediately adjacent to the position of the polymorphism is extended with one or 

30 more labeled nucleotides). In addition, traditional restriction fragment length polymorphism 
analysis (using restriction enzymes that provide differential digestion of the genomic DNA 
depending on the presence or absence of the polymorphism) may be performed. Arrays with 
nucleotide sequences of the present invention can be used to detect polymorphisms. The 
array can comprise modified nucleotide sequences of the present invention in order to detect 
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the nucleotide sequences of the present invention. In the alternative, any one of the 
nucleotide sequences of the present invention can be placed on the array to detect changes 
from those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence could 
also be detected by detecting a corresponding change in amino acid sequence of the protein, 
e.g., by an antibody specific to the variant sequence. 

3.10.20 ARTHRITIS AND INFLAMMATION 

The immunosuppressive effects of the compositions of the invention against 
rheumatoid arthritis is determined in an experimental animal model system. The experimental 
model system is adjuvant induced arthritis in rats, and the protocol is described by J. 
Holoshitz, et at, 1983, Science, 219:56, or by B. Waksman et al, 1963, Int. Arch. Allergy 
Appl. Immunol, 23:129. Induction of the disease can be caused by a single injection, 
generally intradermally, of a suspension of killed Mycobacterium tuberculosis in complete 
Freund's adjuvant (CFA). The route of injection can vary, but rats may be injected at the base 
of the tail with an adjuvant mixture. The polypeptide is administered in phosphate buffered 
solution (PBS) at a dose of about 1-5 mg/kg. The control consists of administering PBS only. 

The procedure for testing the effects of the test compound would consist of 
intradermally injecting killed Mycobacterium tuberculosis in CFA followed by immediately 
administering the test compound and subsequent treatment every other day until day 24. At 
14, 15, 18, 20, 22, and 24 days after injection of Mycobacterium CFA, an overall arthritis 
score may be obtained as described by J. Holoskitz above. An analysis of the data would 
reveal that the test compound would have a dramatic affect on the swelling of the joints as 
measured by a decrease of the arthritis score. 



3.11 THERAPEUTIC METHODS 

The compositions (including polypeptide fragments, analogs, variants and antibodies 
or other binding partners or modulators including antisense polynucleotides) of the invention 
have numerous applications in a variety of therapeutic methods. Examples of therapeutic 
applications include, but are not limited to, those exemplified herein. 

3.11.1 EXAMPLE 
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One embodiment of the invention is the administration of an effective amount of the 
polypeptides or other composition of the invention to individuals affected by a disease or 
disorder that can be modulated by regulating the peptides of the invention. While the mode 
of administration is not particularly important, parenteral administration is preferred. An 
exemplary mode of administration is to deliver an intravenous bolus. The dosage of the 
polypeptides or other composition of the invention will normally be determined by the 
prescribing physician. It is to be expected that the dosage will vary according to the age, 
weight, condition and response of the individual patient. Typically, the amount of 
polypeptide administered per dose will be in the range of about O.Olug/kg to 100 mg/kg of 
body weight, with the preferred dose being about O.lug/kg to 10 mg/kg of patient body 
weight. For parenteral administration, polypeptides of the invention will be formulated in an 
injectable form combined with a pharmaceutically acceptable parenteral vehicle. Such 
vehicles are well known in the art and examples include water, saline, Ringer's solution, 
dextrose solution, and solutions consisting of small amounts of the human serum albumin. 
The vehicle may contain minor amounts of additives that maintain the isotonicity and 
stability of the polypeptide or other active ingredient. The preparation of such solutions is 
within the skill of the art. 

3.12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 

ADMINISTRATION 

A protein or other composition of the present invention (from whatever source 

derived, including without limitation from recombinant and non-recombinant sources and 

including antibodies and other binding partners of the polypeptides of the invention) may be 

administered to a patient in need, by itself, or in pharmaceutical compositions where it is 

mixed with suitable carriers or excipient(s) at doses to treat or ameliorate a variety of 

disorders. Such a composition may optionally contain (in addition to protein or other active 

ingredient and a carrier) diluents, fillers, salts, buffers, stabilizers, solubilizers, and other 

materials well known in the art. The term "pharmaceutically acceptable" means a non-toxic 

material that does not interfere with the effectiveness of the biological activity of the active 

ingredient(s). The characteristics of the carrier will depend on the route of administration. 

The pharmaceutical composition of the invention may also contain cytokines, lymphokines, 

or other hematopoietic factors such as M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, 

IL-6, IL-7, IL-8, 1L-9, EL-10, IL-11, IL-12, IL-13, IL-14, IL-15, IFN, TNFO, TNF1, TNF2, 
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G-CSF, Meg-CSF, thrombopoietin, stem cell factor, and erythropoietin. In further 
compositions, proteins of the invention may be combined with other agents beneficial to the 
treatment of the disease or disorder in question. These agents include various growth fectors 
such as epidermal growth factor (EGF), platelet-derived growth factor (PDGF), transforming 
growth factors (TGF-a and TGF-(3), insulin-like growth factor (IGF), as well as cytokines 
described herein. 

The pharmaceutical composition may further contain other agents which either 
enhance the activity of the protein or other active ingredient or complement its activity or use 
in treatment. Such additional factors and/or agents may be included in the pharmaceutical 
composition to produce a synergistic effect with protein or other active ingredient of the 
invention, or to minimize side effects. Conversely, protein or other active ingredient of the 
present invention may be included in formulations of the particular clotting factor, cytokine, 
lymphokfne, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti- 
inflammatory agent to minimize side effects of the clotting factor, cytokine, lymphokine, 
other hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti-inflammatory agent 
(such as IL-IRa, IL-1 Hyl, IL-1 Hy2, anti-TNF, corticosteroids, immunosuppressive agents). 
A protein of the present invention may be active in multimers (e.g., heterodimers or 
homodimers) or complexes with itself or other proteins. As a result, pharmaceutical 
compositions of the invention may comprise a protein of the invention in such multimeric or 
complexed form. 

As an alternative to being included in a pharmaceutical composition of the invention 
including a first protein, a second protein or a therapeutic agent may be concurrently 
administered with the first protein (e.g., at the same time, or at differing times provided that 
therapeutic concentrations of the combination of agents is achieved at the treatment site). 
Techniques for formulation and administration of the compounds of the instant application 
may be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, 
latest edition. A therapeutically effective dose further refers to that amount of the compound 
sufficient to result in amelioration of symptoms, e.g., treatment, healing, prevention or 
amelioration of the relevant medical condition, or an increase in rate of treatment, healing, 
prevention or amelioration of such conditions. When applied to an individual active 
ingredient, administered alone, a therapeutically effective dose refers to that ingredient alone. 
When applied to a combination, a therapeutically effective dose refers to combined amounts 
of the active ingredients that result in the therapeutic effect, whether administered in 
combination, serially or simultaneously. 
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In practicing the method of treatment or use of the present invention, a therapeutically 
effective amount of protein or other active ingredient of the present invention is administered 
to a mammal having a condition to be treated. Protein or other active ingredient of the 
present invention may be administered in accordance witib. the method of the invention either 

5 alone or in combination with other therapies such as treatments employing cytokines, 
lymphokines or other hematopoietic factors. When co- administered with one or more 
cytokines, lymphokines or other hematopoietic factors, protein or other active ingredient of 
the present invention may be administered either simultaneously with the cytokine(s), 
lymphokine(s), other hematopoietic factor(s), thrombolytic or anti-thrombotic factors, or 

1 0 sequentially. If administered sequentially, the attending physician will decide on the 
appropriate sequence of administering protein or other active ingredient of the present 
invention in combination with cytokine(s), lymphokine(s), other hematopoietic factor(s), 
thrombolytic or anti-thrombotic factors. 

15 3.12.1 ROUTES OF ADMINISTRATION 

Suitable routes of administration may, for example, include oral, rectal, transmucosal, 
or intestinal administration; parenteral delivery, including intramuscular, subcutaneous, 
intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, 
intraperitoneal, intranasal, or intraocular injections. Administration of protein or other active 

20 ingredient of the present invention used in the pharmaceutical composition or to practice the 
method of the present invention can be carried out in a variety of conventional ways, such as 
oral ingestion, inhalation, topical application or cutaneous, subcutaneous, intraperitoneal, 
parenteral or intravenous injection. Intravenous administration to the patient is preferred. 

Alternately, one may administer the compound in a local rather than systemic manner, 

25 for example, via injection of the compound directly into a arthritic joints or in fibrotic tissue, 
often in a depot or sustained release formulation. In order to prevent the scarring process 
frequently occurring as complication of glaucoma surgery, the compounds may be 
administered topically, for example, as eye drops. Furthermore, one may administer the drug 
in a targeted drug delivery system, for example, in a liposome coated with a specific 

30 antibody, targeting, for example, arthritic or fibrotic tissue. The liposomes will be targeted to 
and taken up selectively by the afflicted tissue. 

The polypeptides of the invention are administered by any route that delivers an 
effective dosage to the desired site of action. The determination of a suitable route of 
administration and an effective dosage for a particular indication is within the level of skill in 



WO 02/070539 W W PCT/US02/05095 



the art. Preferably for wound treatment, one administers the therapeutic compound directly to 
the site. Suitable dosage ranges for the polypeptides of the invention can be extrapolated from 
these dosages or from similar studies in appropriate animal models. Dosages can then be 
adjusted as necessary by the clinician to provide maximal therapeutic benefit. 

3,122 COMPOSITIONS/FORMULATIONS 

Pharmaceutical compositions for use in accordance with the present invention thus 
may be formulated in a conventional manner using one or more physiologically acceptable 
carriers comprising excipients and auxiliaries which facilitate processing of the active 
compounds into preparations which can be used pharmaceutical^. These pharmaceutical 
compositions may be manufactured in a manner that is itself known, e.g., by means of 
conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, 
encapsulating, entrapping or lyophilizing processes. Proper formulation is dependent upon 
the route of administration chosen. When a therapeutically effective amount of protein or 
other active ingredient of the present invention is administered orally, protein or other active 
ingredient of the present invention will be in the form of a tablet, capsule, powder, solution or 
elixir. When administered in tablet form, the pharmaceutical composition of the invention 
may additionally contain a solid carrier such as a gelatin or an adjuvant. The tablet, capsule, 
and powder contain from about 5 to 95% protein or other active ingredient of the present 
invention, and preferably from about 25 to 90% protein or other active ingredient of the 
present invention. When administered in liquid form, a liquid carrier such as water, 
petroleum, oils of animal or plant origin such as peanut oil, mineral oil, soybean oil, or 
sesame oil, or synthetic oils may be added. The liquid form of the pharmaceutical 
composition may further contain physiological saline solution, dextrose or other saccharide 
solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. When 
administered in liquid form, the pharmaceutical composition contains from about 0.5 to 90% 
by weight of protein or other active ingredient of the present invention, and preferably from 
about 1 to 50% protein or other active ingredient of the present invention. 

When a therapeutically effective amount of protein or other active ingredient of the 

present invention is administered by intravenous, cutaneous or subcutaneous injection, 

protein or other active ingredient of the present invention will be in the form of a 

pyrogen-free, parenterally acceptable aqueous solution. The preparation of such parenterally 

acceptable protein or other active ingredient solutions, having due regard to pH, isotonicity, 

stability, and the like, is within the skill in the art. A preferred pharmaceutical composition 
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for intravenous, cutaneous, or. subcutaneous injection should contain, in addition to protein or 
other active ingredient of the present invention, an isotonic vehicle such as Sodium Chloride 
Injection, Ringer's Injection, Dextrose Injection, Dextrose and Sodium Chloride Injection, 
Lactated Ringer's Injection, or other vehicle as known in the art. The pharmaceutical 
5 composition of the present invention may also contain stabilizers, preservatives, buffers, 

antioxidants, or other additives known to those of skill in the art. For injection, the agents of 
the invention may be formulated in aqueous solutions, preferably in physiologically 
compatible buffers such as Hanks's solution, Ringer's solution, or physiological saline buffer. 
For transmucosal administration, penetrants appropriate to the barrier to be permeated are 
1 0 used in the formulation. Such penetrants are generally known in the art. 

For oral administration, the compounds can be formulated readily by combining the 
active compounds with pharmaceutically acceptable carriers well known in the art. Such 
carriers enable 1he compounds of the invention to be formulated as tablets, pills, dragees, 
capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a 
1 5 patient to be treated. Pharmaceutical preparations for oral use can be obtained from a solid 
excipient, optionally grinding a resulting mixture, and processing the mixture of granules, 
after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable 
excipients are, in particular, fillers such as sugars, including lactose, sucrose, mannitol, or 
sorbitol; cellulose preparations such as, for example, maize starch, wheat starch, rice starch, 
20 potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, 
sodium carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, 
disintegrating agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or 
alginic acid or a salt thereof such as sodium alginate. Dragee cores are provided with suitable 
coatings. For this purpose, concentrated sugar solutions may be used, which may optionally 
25 contain gum arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or 
titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. 
Dyestuffs or pigments may be added to the tablets or dragee coatings for identification or to 
characterize different combinations of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules made 
30 of gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol 
or sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler 
such as lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate 
and, optionally, stabilizers. In soft capsules, the active compounds may be dissolved or 
suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene 
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glycols. In addition, stabilizers may be added. All formulations for oral administration 
should be in dosages suitable for such administration. For buccal administration, the 
compositions may take the form of tablets or lozenges formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 
invention are conveniently delivered in the form of an aerosol spray presentation from 
pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., 
dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide 
or other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined 
by providing a valve to deliver a metered amount. Capsules and cartridges of, e.g., gelatin for 
use in an inhaler or insufflator may be formulated containing a powder mix of the compound 
and a suitable powder base such as lactose or starch. The compounds may be formulated for 
parenteral administration by injection, e.g., by bolus injection or continuous infusion. 
Formulations for injection may be presented in unit dosage form, e.g., in ampules or in 
multi-dose containers, with an added preservative. The compositions may take such forms as 
suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain formulatory 
agents such as suspending, stabilizing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions 
of the active compounds in water-soluble form. Additionally, suspensions of the active 
compounds may be prepared as appropriate oily injection suspensions. Suitable lipophilic 
solvents or vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as 
ethyl oleate or triglycerides, or liposomes. Aqueous injection suspensions may contain 
substances which increase the viscosity of the suspension, such as sodium carboxymethyl 
cellulose, sorbitol, or dextran. Optionally, the suspension may also contain suitable 
stabilizers or agents which increase the solubility of the compounds to allow for the 
preparation of highly concentrated solutions. Alternatively, the active ingredient may be in 
powder form for constitution with a suitable vehicle, e.g., sterile pyrogen-free water, before 
use. 

The compounds may also be formulated in rectal compositions such as suppositories 
or retention enemas, e.g., containing conventional suppository bases such as cocoa butter or 
other glycerides. In addition to the formulations described previously, the compounds may 
also be formulated as a depot preparation. Such long acting formulations may be 
administered by implantation (for example subcutaneously or intramuscularly) or by 
intramuscular injection. Thus, for example, the compounds may be formulated with suitable 
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polymeric or hydrophobic materials (for example as an emulsion in an acceptable oil) or ion 
exchange resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt. 

A pharmaceutical carrier for the hydrophobic compounds of the invention is a co- 
solvent system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic 

5 polymer, and an aqueous phase. The co-solvent system may be the VPD co-solvent system. 
VPD is a solution of 3% w/v benzyl alcohol, 8% w/v of the nonpolar surfectant polysorbate 
80, and 65% w/v polyethylene glycol 300, made up to volume in absolute ethanoL The VPD 
co-solvent system (VPD:5W) consists of VPD diluted 1:1 with a 5% dextrose in water 
solution. This co-solvent system dissolves hydrophobic compounds well, and itself produces 

10 low toxicity upon systemic administration. Naturally, the proportions of a co-solvent system 
may be varied considerably without destroying its solubility and toxicity characteristics. 
Furthermore, the identity of the co-solvent components may be varied: for example, other 
low-toxicity nonpolar surfactants may be used instead of polysorbate 80; the fraction size of 
polyethylene glycol may be varied; other biocompatible polymers may replace polyethylene 

1 5 glycol, e.g. polyvinyl pyrrolidone; and other sugars or polysaccharides may substitute for 

dextrose. Alternatively, other delivery systems for hydrophobic pharmaceutical compounds 
may be employed. Liposomes and emulsions are well known examples of delivery vehicles 
or carriers for hydrophobic drugs. Certain organic solvents such as dimethylsulfoxide also 
may be employed, although usually at the cost of greater toxicity. Additionally, the 

20 compounds may be delivered using a sustained-release system, such as semipermeable 

matrices of solid hydrophobic polymers containing the therapeutic agent. Various types of 
sustained-release materials have been established and are well known by those skilled in the 
art. Sustained-release capsules may, depending on their chemical nature, release the 
compounds for a few weeks up to over 100 days. Depending on the chemical nature and the 

25 biological stability of the therapeutic reagent, additional strategies for protein or other active 
ingredient stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase 
carriers or excipients. Examples of such carriers or excipients include but are not limited to 
calcium carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, 

30 and polymers such as polyethylene glycols. Many of the active ingredients of the invention 
may be provided as salts with pharmaceutically compatible counter ions. Such 
pharmaceutical^ acceptable base addition salts are those salts which retain the biological 
effectiveness and properties of the free acids and which are obtained by reaction with 
inorganic or organic bases such as sodium hydroxide, magnesium hydroxide, ammonia, 
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trialkylamine, dialkylamine, monoalkylamine, dibasic amino acids, sodium acetate, 
potassium benzoate, triethanol amine and the like. 

The pharmaceutical composition of the invention may be in the form of a complex of 
the protein(s) or other active ingredients) of present invention along with protein or peptide 
antigens. The protein and/or peptide antigen will deliver a stimulatory signal to both B and T 
lymphocytes. B lymphocytes will respond to antigen through their surface immunoglobulin 
receptor. T lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins 
including those encoded by class I and class H MHC genes on host cells will serve to present 
the peptide antigen(s) to T lymphocytes. The antigen components could also be supplied as 
purified MHC-peptide complexes alone or with co-stimulatory molecules that can directly 
signal T cells. Alternatively antibodies able to bind surface immunoglobulin and other 
molecules on B cells as well as antibodies able to bind the TCR and other molecules on T 
cells can be combined with the pharmaceutical composition of the invention. 

The pharmaceutical composition of the invention may be in the form of a liposome in > 
which protein of the present invention is combined, in addition to other pharmaceutically 
acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as 
micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. 
Suitable lipids for liposomal formulation include, without limitation, monoglycerides, 
diglycerides, sulfides, lysolecithins, phospholipids, saponin, bile acids, and the like. 
Preparation of such liposomal formulations is within the level of skill in the art, as disclosed, 
for example, in U.S. Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 4,737,323, all of 
which are incorporated herein by reference. 

The amount of protein or other active ingredient of the present invention in the 
pharmaceutical composition of the present invention will depend upon the nature and severity 
of the condition being treated, and on the nature of prior treatments which the patient has 
undergone. Ultimately, the attending physician will decide the amount of protein or other 
active ingredient of the present invention with which to treat each individual patient. 
Initially, the attending physician will administer low doses of protein or other active 
ingredient of the present invention and observe the patient's response. Larger doses of 
protein or other active ingredient of the present invention may be administered until the 
optimal therapeutic effect is obtained for the patient, and at that point the dosage is not 
increased further. It is contemplated that the various pharmaceutical compositions used to 
practice the method of the present invention should contain about 0.01 ug to about 100 mg 
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(preferably about 0.1 \ig to about 10 mg, more preferably about 0.1 \ig to about 1 mg) of 
protein or other active ingredient of the present invention per kg body weight. For 
compositions of the present invention which are useful for bone, cartilage, tendon or ligament 
regeneration, the therapeutic method includes administering the composition topically, 
5 systematically, or locally as an implant or device. When administered, the therapeutic 
composition for use in this invention is, of course, in a pyrogen-free, physiologically 
acceptable form. Further, the composition may desirably be encapsulated or injected in a 
viscous form for delivery to the site of bone, cartilage or tissue damage. Topical 
administration may be suitable for wound healing and tissue repair. Therapeutically useful 
10 agents other than a protein or other active ingredient of the invention which may also 
optionally be included in the composition as described above, may alternatively or 
additionally, be administered simultaneously or sequentially with the composition in the 
methods of the invention. Preferably for bone and/or cartilage formation, the composition 
would include a matrix capable of delivering the protein-containing or other active 
1 5 ingredient-containing composition to the site of bone and/or cartilage damage, providing a 
structure for the developing bone and cartilage and optimally capable of being resorbed into 
the body. Such matrices may be formed of materials presently in use for other implanted 
medical applications. 

The choice of matrix material is based onbiocompatibility, biodegradability, 
20 mechanical properties, cosmetic appearance and interface properties. The particular 

application of the compositions will define the appropriate formulation. Potential matrices 
for the compositions may be biodegradable and chemically defined calcium sulfate, 
tricalcium phosphate, hydroxyapatite, polylactic acid, polyglycolic acid and polyanhydrides. 
Other potential materials are biodegradable and biologically well-defined, such as bone or 
25 dermal collagen. Further matrices are comprised of pure proteins or extracellular matrix 

components. Other potential matrices are nonbiodegradable and chemically defined, such as 
sintered hydroxyapatite, bioglass, aluminates, or other ceramics. Matrices may be comprised 
of combinations of any of the above mentioned types of material, such as polylactic acid and 
hydroxyapatite or collagen and tricalcium phosphate. The bioceramics may be altered in 
30 composition, such as in calcium-aluminate-phosphate and processing to alter pore size, 
particle size, particle shape, and biodegradability. Presently preferred is a 50:50 (mole 
weight) copolymer of lactic acid and glycolic acid in the form of porous particles having 
diameters ranging from 150 to 800 microns. In some applications, it will be useful to utilize 
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a sequestering agent, such as carboxymethyl cellulose or autologous blood clot, to prevent the 

protein compositions from disassociating from the matrix. 

A preferred family of sequestering agents is cellulosic materials such as 

alkylcelluloses (including hydroxyalkylcelluloses), including methylcellulose, ethylcellulose, 

hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl-methylcellulose. and 

carboxymethylcellulose, the most preferred being cationic salts of carboxymethylcellulose 

(CMC). Other preferred sequestering agents include hyaluronic acid, sodium alginate, 

polyethylene glycol), polyoxyethylene oxide, carboxyvinyl polymer and polyvinyl alcohol). 

The amount of sequestering agent useful herein is 0.5-20 wt %, preferably 1-10 wt % based 

on total formulation weight, which represents the amount necessary to prevent desorption of 

the protein from the polymer matrix and to provide appropriate handling of the composition, 

yet not so much that the progenitor cells are prevented from infiltrating the matrix, thereby 

providing the protein the opportunity to assist the osteogenic activity of the progenitor cells. 

In further compositions, proteins or other active ingredients of the invention may be 

combined with other agents beneficial to the treatment of the bone and/or cartilage defect, ? 

wound, or tissue in question. These agents include various growth factors such as epidermal 

growth factor (EGF), platelet derived growth factor (PDGF), transforming growth factors 

(TGF-a and TGF-P), and insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary applications. 

Particularly domestic animals and thoroughbred horses, in addition to humans, are desired 

patients for such treatment with proteins or other active ingredients of the present invention. 

The dosage regimen of a protein-containing pharmaceutical composition to be used in tissue 

regeneration will be determined by the attending physician considering various factors which 

modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site 

of damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue 

(e.g., bone), the patienfs age, sex, and diet, the severity of any infection, time of 

administration and other clinical factors. The dosage may vary with the type of matrix used 

in the reconstitution and with inclusion of other proteins in the pharmaceutical composition. 

For example, the addition of other known growth factors, such as IGF I (insulin like growth 

factor I), to the final composition, may also effect the dosage. Progress can be monitored by 

periodic assessment of tissue/bone growth and/or repair, for example, X-rays, 

histomorphometric determinations and tetracycline labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 

polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
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mammalian subject. Polynucleotides of the invention may also be administered by other 
known methods for introduction of nucleic acid into a eel) or organism (including, without 
limitation, in the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in 
the presence of proteins of the present invention in order to proliferate or to produce a desired 
5 effect on or activity in such cells. Treated cells can then be introduced in vivo for therapeutic 
purposes. 

3.12.3 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include 
1 0 compositions wherein the active ingredients are contained in an effective amount to achieve 
its intended purpose. More specifically, a therapeutically effective amount means an amount 
effective to prevent development of or to alleviate the existing symptoms of the subject being 
treated. Determination of the effective amount is well within the capability of those skilled in 
the art, especially in light of the detailed disclosure provided herein. For any compound used 
15 in the method of the invention, the therapeutically effective dose can be estimated initially 
from appropriate in vitro assays. For example, a dose can be formulated in animal models to 
achieve a circulating concentration range that can be used to more accurately determine 
useful doses in humans. For example, a dose can be formulated in animal models to achieve 
a circulating concentration range that includes the IC50 as determined in cell culture (i.e., the 
20 concentration of the test compound which achieves a half-maximal inhibition of the protein's 
biological activity). Such information can be used to more accurately determine useful doses 
in humans. 

A therapeutically effective dose refers to that amount of the compound that results in 
amelioration of symptoms or a prolongation of survival in a patient. Toxicity and therapeutic 

25 efficacy of such compounds can be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g., for determining the LD50 (the dose lethal to 50% of the 
population) and the ED 50 (the dose therapeutically effective in 50% of the population). The 
dose ratio between toxic and therapeutic effects is the therapeutic index and it can be 
expressed as the ratio between LD 50 and ED50. Compounds which exhibit high therapeutic 

30 indices are preferred. The data obtained from these cell culture assays and animal studies can 
be used in formulating a range of dosage for use in human. The dosage of such compounds 
lies preferably within a range of circulating concentrations that include the ED50 with little or 
no toxicity. The dosage may vary within this range depending upon the dosage form 
employed and the route of administration utilized. The exact formulation, route of 
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administration and dosage can be chosen by the individual physician in view of the patient's 
condition. See, e.g., Fingl et al., 1 975, in 'The Pharmacological Basis of Therapeutics", Ch. 
1 p. 1 . Dosage amount and interval may be adjusted individually to provide plasma levels of 
the active moiety which are sufficient to maintain the desired effects, or minimal effective 
concentration (MEC). The MEC will vary for each compound but can be estimated from in 
vitro data. Dosages necessary to achieve the MEC will depend on individual characteristics 
and route of administration. However, HPLC assays or bioassays can be used to determine 
plasma concentrations. 

Dosage intervals can also be determined using MEC value. Compounds should be 
administered using a regimen which maintains plasma levels above the MEC for 10-90% of 
the time, preferably between 30-90% and most preferably between 50-90%. In cases of local 
administration or selective uptake, the effective local concentration of the drug may not be 
related to plasma concentration. 

An exemplary dosage regimen for polypeptides or other compositions of the invention 
will be in the range of about 0.01 p.g/kg to 100 mg/kg of body weight daily, with the >* 
preferred dose being about 0.1 ug/kg to 25 mg/kg of patient body weight daily, varying in 
adults and children. Dosing may be once daily, or equivalent doses may be delivered at 
longer or shorter intervals. 

The amount of composition administered will, of course, be dependent on the subject 
being treated, on the subject's age and weight, the severity of the affliction, the manner of 
administration and the judgment of the prescribing physician. 

3.12.4 PACKAGING 

The compositions may, if desired, be presented in a pack or dispenser device which 
may contain one or more unit dosage forms containing the active ingredient. The pack may, 
for example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser 
device may be accompanied by instructions for administration. Compositions comprising a 
compound of the invention formulated in a compatible pharmaceutical carrier may also be 
prepared, placed in an appropriate container, and labeled for treatment of an indicated 
condition. 

3.13 ANTIBODIES 

Also included in the invention are antibodies to proteins, or fragments of proteins of 

the invention. The term "antibody" as used herein refers to immunoglobulin molecules and 
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immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that 
contain an antigen-binding site that specifically binds (immunoreacts with) an antigen. Such 
antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, F ab , 
Fat- and F (ab ')2 fragments, and an F^ expression library. In general, an antibody molecule 

5 obtained from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ 
from one another by the nature of the heavy chain present in the molecule. Certain classes 
have subclasses as well, such as IgGi, IgG 2 , and others. Furthermore, in humans, the light 
chain may be a kappa chain or a lambda chain. Reference herein to antibodies includes a 
reference to all such classes, subclasses and types of human antibody species. 

1 o An isolated related protein of the invention may be intended to serve as an antigen, or 

a portion or fragment thereof, and additionally can be used as an immunogen to generate 
antibodies that immunospecifically bind the antigen, using standard techniques for polyclonal 
and monoclonal antibody preparation. The full-length protein can be used or, alternatively, 
the invention provides antigenic peptide fragments of the antigen for use as immunogens. An 

1 5 antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid 

sequence of the full length protein, such as an amino acid sequence shown in SEQ ID NO: 
949-1896, and encompasses an epitope thereof such that an antibody raised against the 
peptide forms a specific immune complex with the full length protein or with any fragment 
that contains the epitope. Preferably, the antigenic peptide comprises at least 10 amino acid 

20 residues, or at least 15 amino acid residues, or at least 20 amino acid residues, or at least 30 
amino acid residues. Preferred epitopes encompassed by the antigenic peptide are regions of 
the protein that are located on its surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of alpha-2-macroglobulin-like protein that is located on the 

25 surface of the protein, e.g., a hydrophilic region. A hydrophobicity analysis of the human 
related protein sequence will indicate which regions of a related protein are particularly 
hydrophilic and, therefore, are likely to encode surface residues useful for targeting antibody 
production. As a means for targeting antibody production, hydropathy plots showing regions 
of hydrophilicity and hydrophobicity may be generated by any method well known in the art, 

30 including, for example, the Kyte Doolittle or the Hopp Woods methods, either with or 

without Fourier transformation. See, e.g., Hopp and Woods, 1981, Proc. Nat. Acad. Sci. USA 
78: 3824-3828; Kyte and Doolittle 1982, J. Mol. Biol. 157: 105-142, each of which is 
incorporated herein by reference in its entirety. Antibodies that are specific for one or more 
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domains within an antigenic protein, or derivatives, fragments, analogs or homologs thereof, 
are also provided herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 
thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. 

The term "specific for" indicates that the variable regions of the antibodies of the 
invention recognize and bind polypeptides of the invention exclusively (/.<?., able to 
distinguish the polypeptide of the invention from other similar polypeptides despite sequence 
identity, homology, or similarity found in the family of polypeptides), but may also interact 
with other proteins (for example, S. aureus protein A or other antibodies in ELISA 
techniques) through interactions with sequences outside the variable region of the antibodies, 
and in particular, in the constant region of the molecule. Screening assays to determine 
binding specificity of an antibody of the invention are well known and routinely practiced in 
the art. For a comprehensive discussion of such assays, see Harlow et al. (Eds), Antibodies A 
Laboratory Manual; Cold Spring Harbor Laboratory; Cold Spring Harbor, NY (1988), * 
Chapter 6. Antibodies that recognize and bind fragments of the polypeptides of the invention 
are also contemplated, provided that the antibodies are first and foremost specific for, as 
defined above, full-length polypeptides of the invention. As with antibodies that are specific 
for full length polypeptides of the invention, antibodies of the invention that recognize 
fragments are those which can distinguish polypeptides from the same family of polypeptides 
despite inherent sequence identity, homology, or similarity found in the family of proteins. 

Antibodies of the invention are useful for, for example, therapeutic purposes (by 
modulating activity of a polypeptide of the invention), diagnostic purposes to detect or 
quantitate a polypeptide of the invention, as well as purification of a polypeptide of the 
invention. Kits comprising an antibody of the invention for any of the purposes described 
herein are also comprehended. In general, a kit of the invention also includes a control 
antigen for which the antibody is immunospecific. The invention further provides a 
hybridoma that produces an antibody according to the invention. Antibodies of the invention 
are useful for detection and/or purification of the polypeptides of the invention. 

Monoclonal antibodies binding to the protein of the invention may be useful 
diagnostic agents for the immunodetection of the protein. Neutralizing monoclonal 
antibodies binding to the protein may also be useful therapeutics for both conditions 
associated with the protein and also in the treatment of some forms of cancer where abnormal 
expression of the protein is involved. In the case of cancerous cells or leukemic cells, 
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neutralizing monoclonal antibodies against the protein may be useful in detecting and 
preventing the metastatic spread of the cancerous cells, which may be mediated by the 
protein. 

The labeled antibodies of the present invention can be used for in vitro, in vivo, and in 
situ assays to identify cells or tissues in which a fragment of the polypeptide of interest is 
expressed. The antibodies may also be used directly in therapies or other diagnostics. The 
present invention further provides the above-described antibodies immobilized on a solid 
support. Examples of such solid supports include plastics such as polycarbonate, complex 
carbohydrates such as agarose and Sepharose®, acrylic resins and such as polyacrylamide 
and latex beads. Techniques for coupling antibodies to such solid supports are well known in 
the art (Weir, D.M. et al., "Handbook of Experimental Immunology" 4th Ed, Blackwell 
Scientific Publications, Oxford, England, Chapter 10 (1986); Jacoby, W.D. et al, Meth. 
Enzym. 34 Academic Press, N.Y. (1974)). The immobilized antibodies of the present 
invention can be used for in vitro, in vivo, and in situ assays as well as for immuno-affinity 
purification of the proteins of the present invention. 

Various procedures known within the art may be used for the production of 
polyclonal or monoclonal antibodies directed against a protein of the invention, or against 
derivatives, fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: 
A Laboratory Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY, incorporated herein by reference). Some of these antibodies are 
discussed below. 

3.13.1 POLYCLONAL ANTIBODIES 

For the production of polyclonal antibodies, various suitable host animals (e.g., rabbit, 
goat, mouse or other mammal) may be immunized by one or more injections with the native 
protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring immunogenic 
protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 
recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated 
to a second protein known to be immunogenic in die mammal being immunized. Examples 
of such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, 
serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. The preparation can 
further include an adjuvant. Various adjuvants used to increase the immunological response 
include, but are not limited to, Freund's (complete and incomplete), mineral gels (e.g, 
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aluminum hydroxide), surface-active substances (e.g., lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, dinitrophenol, etc.), adjuvants usable in humans such as 
Bacille Calmette-Guerin and Corynebacterium parvum, or similar immunostimulatory agents. 
Additional examples of adjuvants that can be employed include MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 

The polyclonal antibody molecules directed against the immunogenic protein can be 
isolated from the mammal (e.g, from the blood) and further purified by well known 
techniques, such as affinity chromatography using protein A or protein G, which provide 
primarily the IgG fraction of immune serum. Subsequently, or alternatively, the specific 
antigen which is the target of the immunoglobulin sought, or an epitope thereof, may be 
immobilized on a column to purify the immune specific antibody by immunoaffinity 
chromatography. Purification of immunoglobulins is discussed, for example, by D. 
Wilkinson (The Scientist, published by The Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 
(April 17, 2000), pp. 25-28). 

3.13.2 MONOCLONAL ANTIBODIES 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as 
used herein, refers to a population of antibody molecules that contain only one molecular 
species of antibody molecule consisting of a unique light chain gene product and a unique 
heavy chain gene product In particular, the complementarity determining regions (CDRs) of 
the monoclonal antibody are identical in all the molecules of the population. MAbs thus 
contain an antigen-binding site capable of immunoreacting with a particular epitope of the 
antigen characterized by a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature, 256:495 (1975). In a hybridoma method, a mouse, 
hamster, or other appropriate host animal, is typically immunized with an immunizing agent 
to elicit lymphocytes that produce or are capable of producing antibodies that will specifically 
bind to the immunizing agent. Alternatively, the lymphocytes can be immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment thereof or 
a fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells of 
human origin are desired, or spleen cells or lymph node cells are used if non-human 
mammalian sources are desired. The lymphocytes are then fused with an immortalized cell 
line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell 
(Coding, Monoclonal Antibodies: Prinr.ipl PS and Practice. Academic Press, (1986) pp. 59- 
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103). Immortalized cell lines are usually transformed mammalian cells, particularly myeloma 
cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are 
employed. The hybridoma cells can be cultured in a suitable culture medium that preferably 
contains one or more substances that inhibit the growth or survival of the unfused, 
5 immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas 
typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which 
substances prevent the growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high 
1 0 level expression of antibody by the selected antibody-producing cells, and are sensitive to a 
medium such as HAT medium. More preferred immortalized cell lines are murine myeloma 
lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, 
San Diego, California and the American Type Culture Collection, Manassas, Virginia. 
Human myeloma and mouse-human heteromyeloma cell lines also have been described for 
15 the production of human monoclonal antibodies (Kozbor, J. Immunol., 133:3001 (1984); 
Brodeur et al., Monoclonal Antibody Produ ction Techniques and Applications, Marcel 
Dekker, Inc., New York, (1987) pp. 51-63). 

The culture medium in which the hybridoma cells are cultured can then be assayed for 
the presence of monoclonal antibodies directed against the antigen. Preferably, the binding 
20 specificity of monoclonal antibodies produced by the hybridoma cells is determined by 
immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in 
the art. The binding affinity of the monoclonal antibody can, for example, be determined by 
the Scatchard analysis of Munson and Pollard, Anal. Biochem.. 107:220 (1980). Preferably, 
25 antibodies having a high degree of specificity and a high binding affinity for the target 
antigen are isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by 
limiting dilution procedures and grown by standard methods. Suitable culture media for this 
purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 
30 medium. Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from 
the culture medium or ascites fluid by conventional immunoglobulin purification procedures 
such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel 
electrophoresis, dialysis, or affinity chromatography. 
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The monoclonal antibodies can also be made by recombinant DNA methods, such as 
those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of 
the invention can be readily isolated and sequenced using conventional procedures (e.g., by 
using oligonucleotide probes that are capable of binding specifically to genes encoding the 
heavy and light chains of murine antibodies). The hybridoma cells of the invention serve «? ? 
preferred source of such DNA. Once isolated, the DNA can be placed into expression 
vectors, which are then transfected into host cells such as simian COS cells, Chinese hamster 
ovary (CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, 
to obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA 
also can be modified, for example, by substituting the coding sequence for human heavy and 
light chain constant domains in place of the homologous murine sequences (U.S. Patent No. 
4,816,567; Morrison, Nature 368, 812-13 (1994)) or by covalently joining to the 
immunoglobulin coding sequence all or part of the coding sequence for a non- 
immunoglobulin polypeptide. Such a non-immunoglobulin polypeptide can be substituted 
for the constant domains of an antibody of the invention, or can be substituted for the variable 
domains of one antigen-combining site of an antibody of the invention to create a chimeric 
bivalent antibody. 



3.13.3 HUMANIZED ANTIBODIES 

The antibodies directed against the protein antigens of the invention can further 
comprise humanized antibodies or human antibodies. These antibodies are suitable for 
administration to humans without engendering an immune response by the human against the 
administered immunoglobulin. Humanized forms of antibodies are chimeric 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F^ 
or other antigen-binding subsequences of antibodies) that are principally comprised of the 
sequence of a human immunoglobulin, and contain minimal sequence derived from a non- 
human immunoglobulin. Humanization can be performed following the method of Winter 
and co-workers (Jones et al., Nature, 321:522-525 (1986); Riechmann et al., Nature. 332:323- 
327 (1988); Verhoeyen et al., Science, 239:1534-1536 (1988)), by substituting rodent CDRs 
or CDR sequences for the corresponding sequences of a human antibody. (See also U.S. 
Patent No. 5,225,539). In some instances, Fv framework residues of the human 
immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
can also comprise residues that are found neither in the recipient antibody nor in the imported 
CDR or framework sequences. In general, the humanized antibody will comprise 
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substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the framework regions are those of a human immunoglobulin 
consensus sequence. The humanized antibody optimally also will comprise at least a portion 
5 of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin (Jones 
et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. Struct. Biol., 2:593-596 (1992)). 

3.13.4 HUMAN ANTIBODIES 

Fully human antibodies relate to antibody molecules in which essentially the entire 
10 sequences of both the light chain and the heavy chain, including the CDRs, arise from human 
genes. Such antibodies are termed "human antibodies", or "fully human antibodies" herein. 
Human monoclonal antibodies can be prepared by the trioma technique; the human B-cell 
hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV 
hybridoma technique to produce human monoclonal antibodies (see Cole, et al., 1985 In: 
1 5 Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human 
monoclonal antibodies may be utilized in the practice of the present invention and may be 
produced by using human hybridomas (see Cote, et a!., 1983. Proc Natl Acad Sci USA 80: 
2026-2030) or by transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et 
al., 1985 In: MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc., pp. 
20 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
including phage display libraries (Hoogenboom and Winter, J. Mol. Biol., 227:381 (1991); 
Marks et al., J. Mol. Biol.. 222:581 (1991)). Similarly, human antibodies can be made by 
introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the 

25 endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 
humans in all respects, including gene rearrangement, assembly, and antibody repertoire. 
This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 
5,569,825; 5,625,126; 5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10, 779- 

30 783 (1992)); Lonberg et al. (Nature 368 856-859 (1994)); Morrison (Nature 368, 8 12- 13 

(1994) ); Fishwild et al, fNature Biotechnology 14, 845-51 (1996)); Neuberger (Natare 
Biotechnology 14, 826 (1996)); and Lonberg and Huszar (Intern. Rev. Immunol. 13 65-93 

(1995) ). 
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Human antibodies may additionally be produced using transgenic nonhuman animals 
that are modified so as to produce folly human antibodies rather than the animal's 
endogenous antibodies in response to challenge by an antigen. (See PCT publication 
WO94/02602). The endogenous genes encoding the heavy and light immunoglobulin chains 
in the nonhuman host have been incapacitated, and active loci encoding human heavy and 
light chain immunoglobulins are inserted into the host's genome. The human genes are 
incorporated, for example, using yeast artificial chromosomes containing the requisite human 
DNA segments. An animal which provides all the desired modifications is then obtained as 
progeny by crossbreeding intermediate transgenic animals containing fewer than the full 
complement of the modifications. The preferred embodiment of such a nonhuman animal is a 
mouse, and is termed the Xenomouse™ as disclosed in PCT publications WO 96/33735 and 
WO 96/34096. This animal produces B cells that secrete fully human immunoglobulins. The 
antibodies can be obtained directly from the animal after immunization with an immunogen 
of interest, as, for example, a preparation of a polyclonal antibody, or alternatively from 
immortalized B cells derived from the animal, such as hybridomas producing monoclonal "* 
antibodies. Additionally, the genes encoding the immunoglobulins with human variable 
regions can be recovered and expressed to obtain the antibodies directly, or can be further 
modified to obtain analogs of antibodies such as, for example, single chain Fv molecules. 

An example of a method of producing a nonhuman host, exemplified as a mouse, 
lacking expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent 
No. 5,939,598. It can be obtained by a method including deleting the J segment genes from 
at least one endogenous heavy chain locus in an embryonic stem cell to prevent 
rearrangement of the locus and to prevent formation of a transcript of a rearranged 
immunoglobulin heavy chain locus, the deletion being effected by a targeting vector 
containing a gene encoding a selectable marker; and producing from the embryonic stem cell 
a transgenic mouse whose somatic and germ cells contain the gene encoding the selectable 
marker. 

A method for producing an antibody of interest, such as a human antibody, is 
disclosed in U.S. Patent No. 5,916,771 . It includes introducing an expression vector that 
contains a nucleotide sequence encoding a heavy chain into one mammalian host cell in 
culture, introducing an expression vector containing a nucleotide sequence encoding a light 
chain into another mammalian host cell, and fusing the two cells to form a hybrid cell. The 
hybrid cell expresses an antibody containing the heavy chain and the light chain. 
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In a further improvement on this procedure, a method for identifying a clinically 
relevant epitope on an immunogen, and a correlative method for selecting an antibody that 
binds immunospecifically to the relevant epitope with high affinity, are disclosed in PCT 
publication WO 99/53049. 

5 

3.13.5 FAB FRAGMENTS AND SINGLE CHAIN ANTIBODIES 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to an antigenic protein of the invention (see e.g., U.S. Patent 
No. 4,946,778). In addition, methods can be adapted for the construction of F ab expression 

10 libraries (see e.g., Huse, et al., 1989 Science 246: 1275-1281) to allow rapid and effective 
identification of monoclonal F ab fragments with the desired specificity for a protein or 
derivatives, fragments, analogs or homologs thereof. Antibody fragments that contain the 
idiotypes to a protein antigen may be produced by techniques known in the art including, but 
not limited to: (i) an F( ab )2 fragment produced by pepsin digestion of an antibody molecule; 

1 5 (ii) an F ab fragment generated by reducing the disulfide bridges of an F (ab >)2 fragment; (iii) an 
Fab fragment generated by the treatment of the antibody molecule with papain and a reducing 
agent and (iv) F v fragments. 

3.13.6 BISPECIFIC ANTIBODIES 

20 Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 

have binding specificities for at least two different antigens. In the present case, one of the 
binding specificities is for an antigenic protein of the invention. The second binding target is 
any other antigen, and advantageously is a cell-surface protein or receptor or receptor subunit. 
Methods for making bispecific antibodies are known in the art. Traditionally, the 

25 recombinant production of bispecific antibodies is based on the co-expression of two 

immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and Cuello, Nature. 305 :537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) 
produce a potential mixture often different antibody molecules, of which only one has the 

30 correct bispecific structure. The purification of the correct molecule is usually accomplished 
by affinity chromatography steps. Similar procedures are disclosed in WO 93/08829, 
published 13 May 1993, and in Traunecker et al, 1991 EMBO 10:3655-3659. 

Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 
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preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part 
of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant 
region (CHI) containing the site necessary for light-chain binding present in at least one of 
the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the 
immunoglobulin light chain, are inserted into separate expression vectors, and are co- 
transfected into a suitable host organism. For further details of generating bispecific 
antibodies see, for example, Suresh et al, Methods in Enzymnlo p y 121:210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a 
pair of antibody molecules can be engineered to maximize the percentage of heterodimers 
that are recovered from recombinant cell culture. The preferred interface comprises at least a 
part of the CH3 region of an antibody constant domain. In this method, one or more small 
amino acid side chains from the interface of the first antibody molecule are replaced with 
larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or 
similar size to the large side chain(s) are created on the interface of the second antibody 
molecule by replacing large amino acid side chains with smaller ones (e.g. alanine or 
threonine). This provides a mechanism for increasing the yield of the.heterodimer over other 
unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments 
(eg. F(ab') 2 bispecific antibodies). Techniques for generating bispecific antibodies from 
antibody fragments have been described in the literature. For example, bispecific antibodies 
can be prepared using chemical linkage. Brennan et al. Science 229:81 (1985) describe a 
procedure wherein intact antibodies are proteolytically cleaved to generate F(ab% fragments. 
These fragments are reduced in the presence of the dithiol complexing agent sodium arsenite 
to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' 
fragments generated are then converted to thionitrobenzoate (TNB) derivatives. One of the 
Fab'-TNB derivatives is then reconverted to the Fab'-thiol by reduction with 
mercaptoethylamine and is mixed with an equimolar amount of the other Fab'-TNB 
derivative to form the bispecific antibody. The bispecific antibodies produced can be used as 
agents for the selective immobilization of enzymes. 

Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled to form bispecific antibodies. Shalaby et al, J. Exp. Med. 175:217-225 (1992) 
describe the production of a fully humanized bispecific antibody F(ab') 2 molecule. Each 
Fab' fragment was separately secreted from E. coli and subjected to directed chemical 
coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was 
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able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well as 
trigger the lytic activity of human cytotoxic lymphocytes against human breast tumor targets. 

Various techniques for making and isolating bispecific antibody fragments directly 
from recombinant cell culture have also been described. For example, bispecific antibodies 

5 have been produced using leucine zippers. Kostelny et al., J. Immunol. 148(5):1547-1553 
(1992). The leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' 
portions of two different antibodies by gene fusion. The antibody homodimers were reduced 
at the hinge region to form monomers and then re-oxidized to form the antibody 
heterodimers. This method can also be utilized for the production of antibody homodimers. 

1 0 The "diabody" technology described by Hollinger et al., Proc. Natl. Acad. Sci. USA 

90:6444-6448 (1993) has provided an alternative mechanism for making bispecific antibody 
fragments. The fragments comprise a heavy-chain variable domain (V H ) connected to a 
light-chain variable domain (V L ) by a linker which is too short to allow pairing between the 
two domains on the same chain. Accordingly, the Vh and Vl domains of one fragment are 

1 5 forced to pair with the complementary V L and V H domains of another fragment, thereby 
forming two antigen-binding sites. Another strategy for making bispecific antibody 
fragments by the use of single-chain Fv (sFv) dimers has also been reported. See, Gruber et 
al.. J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific 

20 antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (1991). 

Exemplary bispecific antibodies can bind to two different epitopes, at least one of 
which originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm 
of an immunoglobulin molecule can be combined with an arm which binds to a triggering 
molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or 

25 Fc receptors for IgG (FcyR), such as FcyRI (CD64), FcyRII (CD32) and FcyRIII (CD1 6) so 
as to focus cellular defense mechanisms to the cell expressing the particular antigen. 
Bispecific antibodies can also be used to direct cytotoxic agents to cells which express a 
particular antigen. These antibodies possess an antigen-binding arm and an arm which binds 
a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. 

30 Another bispecific antibody of interest binds the protein antigen described herein and further 
binds tissue factor (TF). 

3.13.7 HETEROCONJUGATE ANTIBODIES 
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Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such 
antibodies have, for example, been proposed to target immune system cells to unwanted cells 
(U.S. Patent No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 
92/200373; EP 03089). it is contemplated that the antibodies can be prepared in vitro using 
known methods in synthetic protein chemistry, including those involving crosslinking agents. 
For example, immunotoxins can be constructed using a disulfide exchange reaction or by 
forming a thioether bond. Examples of suitable reagents for this purpose include 
iminothiolate and methyl-4-mercaptobutyrimidate and those disclosed, for example, in U.S. 
Patent No. 4,676,980. 

3.13.8 EFFECTOR FUNCTION ENGINEERING 

It can be desirable to modify the antibody of the invention with respect to effector 
function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For 
example, cysteine residue(s) can be introduced into the Fc region, thereby allowing "* 
interchain disulfide bond formation in this region. The homodimeric antibody thus 
generated can have improved internalization capability and/or increased complement- 
mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et 
al, J.Exp Med, 176: 1191-1195 (1992) and Shopes, J. Immunol., 148:2918-2922(1992). 
Homodimeric antibodies with enhanced anti-tumor activity can also be prepared using 
heterobifunctional cross-linkers as described in Wolff et al. Cancer Research, 53: 2560-2565 
(1993). Alternatively, an antibody can be engineered that has dual Fc regions and can 
thereby have enhanced complement lysis and ADCC capabilities. See Stevenson et al., 
Anti-Cancer Drug Desip, 3: 219-230 (1989). 

3.13.9 IMMUNOCONJUGATES 

The invention also pertains to immunoconjugates comprising an antibody conjugated 
to a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active 
toxin of bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive 
isotope (i.e., a radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have 
been described above. Enzymatically active toxins and fragments thereof that can be used 
include diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A 
chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, 
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alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, 
PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis 
inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A 
variety of radionuclides are available for the production of radioconjugated antibodies. 

5 Examples include 212 Bi, ,3, 1, 13, In, 90 Y, and 186 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of 
bifunctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate 
(SPDP), iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl 
adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 

1 0 glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis- 
diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates 
(such as tolyene 2,6-diisocyanate), and bis-active fluorine compounds (such as 1,5-difluoro- 
2,4-dinitrobenzene). For example, a ricin immunotoxin can be prepared as described in 
Vitetta et al., Science, 238: . 1098 (1987). Carbon-14-labeled l-isothiocyanatobenzyl-3- 

15 methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for 
conjugation of radionucleotide to the antibody. See W094/1 1026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 
streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation 

20 using a clearing agent and then administration of a "ligand" (e.g., avidin) that is in turn 
conjugated to a cytotoxic agent. 

3.14 COMPUTER READABLE SEQUENCES 

25 In one application of this embodiment, a nucleotide sequence of the present invention 

can be recorded on computer readable media. As used herein, "computer readable media" 
refers to any medium which can be read and accessed directly by a computer. Such media 
include, but are not limited to: magnetic storage media, such as floppy discs, hard disc 
storage medium, and magnetic tape; optical storage media such as CD-ROM; electrical 

30 storage media such as RAM and ROM; and hybrids of these categories such as 

magnetic/optical storage media. A skilled artisan can readily appreciate how any of the 
presently known computer readable mediums can be used to create a manufacture comprising 
computer readable medium having recorded thereon a nucleotide sequence of the present 
invention. As used herein, "recorded" refers to a process for storing information on computer 
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readable medium. A skilled artisan can readily adopt any of the presently known methods for 
recording information on computer readable medium to generate manufactures comprising 
the nucleotide sequence information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a 
computer readable medium having recorded thereon a nucleotide sequence of the present 
invention. The choice of the data storage structure will generally be based on the means 
chosen to access the stored information. In addition, a variety of data processor programs and 
formats can be used to store the nucleotide sequence information of the present invention on 
computer readable medium. The sequence information can be represented in a word 
processing text file, formatted in commercially-available software such as WordPerfect and 
Microsoft Word, or represented in the form of an ASCII file, stored in a database application, 
such as DB2, Sybase, Oracle, or the like. A skilled artisan can readily adapt any number of 
data processor structuring formats (e.g. text file or database) in order to obtain computer 
readable medium having recorded thereon the nucleotide sequence information of the present 
invention. 

By providing any of the nucleotide sequences SEQ ID NOs: I - 948 or a 
representative fragment thereof; or a nucleotide sequence at least 95% identical to any of the 
nucleotide sequences of SEQ ID NOs: 1 - 948 in computer readable form, a skilled artisan 
can routinely access the sequence information for a variety of purposes. Computer software 
is publicly available which allows a skilled artisan to access sequence information provided 
in a computer readable medium. The examples which follow demonstrate how software 
which implements the BLAST (Altschul et al., J. Mol. Biol. 215:403-410 (1990)) and 
BLAZE (Brutlag et al., Comp. Chem. 17:203-207 (1993)) search algorithms on a Sybase 
system is used to identify open reading frames (ORFs) within a nucleic acid sequence. Such 
ORFs may be protein encoding fragments and may be useful in producing commercially 
important proteins such as enzymes used in fermentation reactions and in the production of 
commercially useful metabolites. 

As used herein, "a computer-based system" refers to the hardware means, software 
means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware means of the computer-based systems of the 
present invention comprises a central processing unit (CPU), input means, output means, and 
data storage means. A skilled artisan can readily appreciate that any one of the currently 
available computer-based systems are suitable for use in the present invention. As stated 
above, the computer-based systems of the present invention comprise a data storage means 
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having stored therein a nucleotide sequence of the present invention and the necessary 
hardware means and software means for supporting and implementing a search means. As 
used herein, "data storage means" refers to memory which can store nucleotide sequence 
information of the present invention, or a memory access means which can access 
5 manufectures having recorded thereon the nucleotide sequence information of the present 
invention. 

As used herein, "search means" refers to one or more programs which are 
implemented on the computer-based system to compare a target sequence or target structural 
motif with the sequence information stored within the data storage means. Search means are 

1 0 used to identify fragments or regions of a known sequence which match a particular target 
sequence or target motif. A variety of known algorithms are disclosed publicly and a variety 
of commercially available software for conducting search means are and can be used in the 
computer-based systems of the present invention. Examples of such software includes, but is 
not limited to, Smith-Waterman, MacPattern (EMBL), BLASTN and BLASTA 

1 5 (NPOLYPEPTIDEIA). A skilled artisan can readily recognize that any one of the available 
algorithms or implementing software packages for conducting homology searches can be 
adapted for use in the present computer-based systems. As used herein, a "target sequence" 
can be any nucleic acid or amino acid sequence of six or more nucleotides or two or more 
amino acids. A skilled artisan can readily recognize that the longer a target sequence is, the 

20 less likely a target sequence will be present as a random occurrence in the database. The 
most preferred sequence length of a target sequence is from about 10 to 300 amino acids, 
more preferably from about 30 to 100 nucleotide residues. However, it is well recognized 
that searches for commercially important fragments, such as sequence fragments involved in 
gene expression and protein processing, may be of shorter length. 

25 As used herein, "a target structural motif," or "target motif," refers to any rationally 

selected sequence or combination of sequences in which the sequence^) are chosen based on 
a three-dimensional configuration which is formed upon the folding of the target motif. 
There are a variety of target motifs known in the art. Protein target motifs include, but are 
not limited to, enzyme active sites and signal sequences. Nucleic acid target motifs include, 

30 but are not limited to, promoter sequences, banpin structures and inducible expression 
elements (protein binding sequences). 

3.15 TRIPLE HELIX FORMATION 
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In addition, the fragments of the present invention, as broadly described, can be used 
to control gene expression through triple helix formation or antisense DNA or RNA, both of 
which methods are based on the binding of a polynucleotide sequence to DNA or RNA. 
Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in length and 
are designed to be complementary to a region of the gene involved in transcription (triple 
helix - see Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 15241 :456 
(1988); and Dervan et al, Science 251:1360 (1991)) or to the mRNA itself (antisense - 
Olmno, J. Neurochem. 56:560 (1991); OHgodeoxynucleotides as Antisense Inhibitors of 
Gene Expression, CRC Press, Boca Raton, FL (1988)). Triple helix-formation optimally 
results in a shut-off of RNA transcription from DNA, while antisense RNA hybridization 
blocks translation of an mRNA molecule into polypeptide. Both techniques have been 
demonstrated to be effective in model systems. Information contained in the sequences of the 
present invention is necessary for the design of an antisense or triple helix oligonucleotide. 

3.16 DIAGNOSTIC ASSAYS AND JOTS 

The present invention further provides methods to identify the presence or expression 
of one of the ORFs of the present invention, or homolog thereof, in a test sample, using a 
nucleic acid probe or antibodies of the present invention, optionally conjugated or otherwise 
associated with a suitable label. 

In general, methods for detecting a polynucleotide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the 
polynucleotide for a period sufficient to form the complex, and detecting the complex, so that 
if a complex is detected, a polynucleotide of the invention is detected in the sample. Such 
methods can also comprise contacting a sample under stringent hybridization conditions with 
nucleic acid primers that anneal to a polynucleotide of the invention under such conditions, 
and amplifying annealed polynucleotides, so that if a polynucleotide is amplified, a 
polynucleotide of the invention is detected in the sample. 

In general, methods for detecting a polypeptide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the polypeptide 
for a period sufficient to form the complex, and detecting the complex, so that if a complex is 
detected, a polypeptide of the invention is detected in the sample. 

In detail, such methods comprise incubating a test sample with one or more of the 
antibodies or one or more of the nucleic acid probes of the present invention and assaying for 
binding of the nucleic acid probes or antibodies to components within the test sample. 
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Conditions for incubating a nucleic acid probe or antibody with a test sample vary. 
Incubation conditions depend on the format employed in the assay, the detection methods 
employed, and the type and nature of the nucleic acid probe or antibody used in the assay. 
One skilled in the art will recognize that any one of the commonly available hybridization, 

5 amplification or immunological assay formats can readily be adapted to employ the nucleic 
acid probes or antibodies of the present invention. Examples of such assays can be found in 
Chard, T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science 
Publishers, Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in 
Immunocytochemistiy, Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 

10 (1985); Tijssen, P., Practice and Theory of immunoassays: Laboratory Techniques in 
Biochemistry and Molecular Biology, Elsevier Science Publishers, Amsterdam, The 
Netherlands (1985). The test samples of the present invention include cells, protein or 
membrane extracts of cells, or biological fluids such as sputum, blood, serum, plasma, or 
urine. The test sample used in the above-described method will vary based on the assay 

15 format, nature of the detection method and the tissues, cells or extracts used as the sample to 
be assayed. Methods for preparing protein extracts or membrane extracts of cells are well 
known in the art and can be readily be adapted in order to obtain a sample which is 
compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the 

20 necessary reagents to carry out the assays of the present invention. Specifically, the 
invention provides a compartment kit to receive, in close confinement, one or more 
containers which comprises: (a) a first container comprising one of the probes or antibodies 
of the present invention; and (b) one or more other containers comprising one or more of the 
following: wash reagents, reagents capable of detecting presence of a bound probe or 

25 antibody. 

In detail, a compartment kit includes any kit in which reagents are contained in 
separate containers. Such containers include small glass containers, plastic containers or 
strips of plastic or paper. Such containers allows one to efficiently transfer reagents from one 
compartment to another compartment such that the samples and reagents are not 
30 cross-contaminated, and the agents or solutions of each container can be added in a 

quantitative fashion from one compartment to another. Such containers will include a 
container which will accept the test sample, a container which contains the antibodies used in 
the assay, containers which contain wash reagents (such as phosphate buffered saline, 
Tris-buffers, etc.), and containers which contain the reagents used to detect the bound 
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antibody or probe. Types of detection reagents include labeled nucleic acid probes, labeled 
secondary antibodies, or in the alternative, if the primary antibody is labeled, the enzymatic, 
or antibody binding reagents which are capable of reacting with the labeled antibody. One 
skilled in the art will readily recognize that the disclosed probes and antibodies of the present 
invention can be readily incorporated into one of the established kit formats which are well 
known in the art. 



3.17 MEDICAL IMAGING ' 

The novel polypeptides and binding partners of the invention are useful in medical 
imaging of sites expressing the molecules of the invention (e.g., where the polypeptide of the 
invention is involved in the immune response, for imaging sites of inflammation or 
infection). See, e.g., Kunkel et al., U.S. Pat. NO. 5,413,778. Such methods involve chemical 
attachment of a labeling or imaging agent, administration of the labeled polypeptide to 
subject in a pharmaceutically acceptable carrier, and imaging the labeled polypeptide 
at the target site. 



a 

in vivo 



3.18 SCREENING ASSAYS 

Using the isolated proteins and polynucleotides of the invention, the present invention 
further provides methods of obtaining and identifying agents which bind to a polypeptide 
encoded by an ORF corresponding to any of the nucleotide sequences set forth in SEQ ID 
NOs: 1 - 948, or bind to a specific domain of the polypeptide encoded by the nucleic acid. In 
detail, said method comprises the steps of: 

(a) contacting an agent with an isolated protein encoded by an ORF of the present 
invention, or nucleic acid of the invention; and 

(b) determining whether the agent binds to said protein or said nucleic acid. 
In general, therefore, such methods for identifying compounds that bind to a 

polynucleotide of the invention can comprise contacting a compound with a polynucleotide 
of the invention for a time sufficient to form a polynucleotide/compound complex, and 
detecting the complex, so that if a polynucleotide/compound complex is detected, a 
compound that binds to a polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that bind to 
a polypeptide of the invention can comprise contacting a compound with a polypeptide of the 
invention for a time sufficient to form a polypeptide/compound complex, and detecting the 
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complex, so that if a polypeptide/compound complex is detected, a compound that binds to a 
polynucleotide of the invention is identified. 

Methods for identifying compounds that bind to a polypeptide of the invention can 
also comprise contacting a compound with a polypeptide of the invention in a cell for a time 

5 sufficient to form a polypeptide/compound complex, wherein the complex drives expression 
of a receptor gene sequence in the cell, and detecting the complex by detecting reporter gene 
sequence expression, so that if a polypeptide/compound complex is detected, a compound 
that binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include compounds which modulate the 

10 activity of a polypeptide of the invention (that is, increase or decrease its activity, relative to 
activity observed in the absence of the compound). Alternatively, compounds identified via 
such methods can include compounds which modulate the expression of a polynucleotide of 
the invention (that is, increase or decrease expression relative to expression levels observed in 
the absence of the compound). Compounds, such as compounds identified via the methods of 

1 5 the invention, can be tested using standard assays well known to those of skill in the art for 
their ability to modulate activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be 
selected and screened at random or rationally selected or designed using protein modeling 

20 techniques. 

For random screening, agents such as peptides, carbohydrates, pharmaceutical agents 
and the like are selected at random and are assayed for their ability to bind to the protein 
encoded by the ORF of the present invention. Alternatively, agents may be rationally 
selected or designed. As used herein, an agent is said to be "rationally selected or designed" 

25 when the agent is chosen based on the configuration of the particular protein. For example, 
one skilled in the art can readily adapt currently available procedures to generate peptides, 
pharmaceutical agents and the like, capable of binding to a specific peptide sequence, in order 
to generate rationally designed antipeptide peptides, for example see Hurby et al., Application 
of Synthetic Peptides: Antisense Peptides," In Synthetic Peptides, A User's Guide, W.H. 

30 Freeman, NY (1992), pp. 289-307, and Kaspczak et al., Biochemistry 28:9230-8 (1989), or 
pharmaceutical agents, or the like. 

In addition to the foregoing, one class of agents of the present invention, as broadly 
described, can be used to control gene expression through binding to one of the ORFs or 
EMFs of the present invention. As described above, such agents can be randomly screened or 
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rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design, 
sequence specific or element specific agents, modulating the expression of either a single 
ORF or multiple ORFs which rely on the same EMF for expression control. One class of 
DNA binding agents are agents which contain base residues which hybridize or form a triple 
helix formation by binding to DNA or RNA. Such agents can be based on the classic 
phosphodiester, ribonucleic acid backbone, or can be a variety of sulfhydryl or polymeric 
derivatives which have base attachment capacity. 

Agents suitable for use in these methods preferably contain 20 to 40 bases and are 
designed to be complementary to a region of the gene involved in transcription (triple helix - 
see Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 241:456 (1988); and 
Dervan et al, Science 251:1360 (1991)) or to the mRNA itself (antisense - Okano, J. 
Neurochem. 56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene 
Expression, CRC Press, Boca Raton, FL (1988)). Triple helix-formation optimally results in 
a shut-off of RNA transcription from DNA, while antisense RNA hybridization blocks 
translation of an mRNA molecule into polypeptide. Both techniques have been demonstrated 
to be effective in model systems. Information contained in the sequences of the present 
invention is necessary for the design of an antisense or triple helix oligonucleotide and other 
DNA binding agents. 

Agents which bind to a protein encoded by one of the ORFs of the present invention 
can be used as a diagnostic agent. Agents which bind to a protein encoded by one of the 
ORFs of the present invention can be formulated using known techniques to generate a 
pharmaceutical composition. 

3.19 USE OF NUCLEIC ACIDS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific nucleic 
acid hybridization probes capable of hybridizing with naturally occurring nucleotide 
sequences. The hybridization probes of the subject invention may be derived from any of the 
nucleotide sequences SEQ ID NOs: 1 - 948. Because the corresponding gene is only 
expressed in a limited number of tissues, a hybridization probe derived from of any of the 
nucleotide sequences SEQ ID NOs: 1 - 948 can be used as an indicator of the presence of 
RNA of cell type of such a tissue in a sample. 

Any suitable hybridization technique can be employed, such as, for example, in situ 

hybridization. PCR as described in US Patents Nos. 4,683,195 and 4,965,188 provides 

additional uses for oligonucleotides based upon the nucleotide sequences. Such probes used 
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in PCR may be of recombinant origin, may be chemically synthesized, or a mixture of both. 
The probe will comprise a discrete nucleotide sequence for the detection of identical 
sequences or a degenerate pool of possible sequences for identification of closely related 
genomic sequences. 

5 Other means for producing specific hybridization probes for nucleic acids include the 

cloning of nucleic acid sequences into vectors for the production of mRNA probes. Such 
vectors are known in the art and are commercially available and may be used to synthesize 
RNA probes in vitro by means of the addition of the appropriate RNA polymerase as T7 or 
SP6 RNA polymerase and the appropriate radioactively labeled nucleotides. The nucleotide 

1 0 sequences may be used to construct hybridization probes for mapping their respective 
genomic sequences. The nucleotide sequence provided herein may be mapped to a 
chromosome or specific regions of a chromosome using well known genetic and/or 
chromosomal mapping techniques. These techniques include in situ hybridization, linkage 
analysis against known chromosomal markers, hybridization screening with libraries or 

1 5 flow-sorted chromosomal preparations specific to known chromosomes, and the like. The 
technique of fluorescent in situ hybridization of chromosome spreads has been described, 
among other places, in Verma et al (1988) Human Chromosomes: A Manual of Basic 
Techniques, Pergamon Press, New York NY. 

Fluorescent in situ hybridization of chromosomal preparations and other physical 

20 chromosome mapping techniques may be correlated with additional genetic map data. 
Examples of genetic map data can be found in the 1994 Genome Issue of Science 
(265:1981f). Correlation between the location of a nucleic acid on a physical chromosomal 
map and a specific disease (or predisposition to a specific disease) may help delimit the 
region of DNA associated with that genetic disease. The nucleotide sequences of the subject 

25 invention may be used to detect differences in gene sequences between normal, carrier or 
affected individuals. 

3.20 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 

Oligonucleotides, i.e., small nucleic acid segments, may be readily prepared by, for 
example, directly synthesizing the oligonucleotide by chemical means, as is commonly practiced 
30 using an automated oligonucleotide synthesizer. 

Support bound oligonucleotides may be prepared by any of the methods known to those 
of skill in the art using any suitable support such as glass, polystyrene or Teflon. One strategy is 
to precisely spot oligonucleotides synthesized by standard synthesizers. Immobilization can be 
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achieved using passive adsorption (Inouye & Hondo, (1990) J. Clin. Microbiol. 28(6) 1469-72); 
using UV light (Nagata et al, 1985; Dahlen et al, mi; MoiTissey & Collins, (1989) Mol. Cell 
Probes 3(2) 189-207) or by covalent binding of base modified DNA (Keller et al, 1988; 1989); 
all references being specifically incorporated herein. 
5 Another strategy that may be employed is the use of the strong biotin-streptavidin 

interaction as a linker. For example, Broude et al (1994) Proc. Natl. Acad. Sci. USA 91(8) 
3072-6, describe the use of biotinylated probes, although these are duplex probes, that are 
immobilized on streptavidin-coated magnetic beads. Streptavidin-coated beads may be 
purchased from Dynal, Oslo. Of course, this same linking chemistry is applicable to coating any 

1 0 surface with streptavidin. Biotinylated probes may be purchased from various sources, such as, 
e.g., Operon Technologies (Alameda, CA), 

Nunc Laboratories (Naperville, IL) is also selling suitable material that could be used. 
Nunc Laboratories have developed a method by which DNA can be covalently bound to the 
microwell surface termed Covalink NH. CovaLink NHisa polystyrene surface grafted with 

1 5 secondary amino groups (>NH) that serve as bridge-heads for further covalent coupling. 

CovaLink Modules may be purchased from Nunc Laboratories. DNA molecules may be bound 
to CovaLink exclusively at the 5'-end by a phosphoramidate bond, allowing immobilization of 
more than 1 pmol of DNA (Rasmussen et al, (1991) Anal. Biochem. 198(1) 138-42). 

The use of CovaLink NH strips for covalent binding of DNA molecules at the 5 f -end has 

20 been described (Rasmussen et al., (1 99 1). In this technology, a phosphoramidate bond is 
employed (Chu et al., (1983) Nucleic Acids Res. 1 1(8) 6513-29). This is beneficial as 
immobilization using only a single covalent bond is preferred. The phosphoramidate bond joins 
the DNA to the CovaLink NH secondary amino groups that are positioned at the end of spacer 
arms covalently grafted onto the polystyrene surface through a 2 nm long spacer arm. To link an 

25 oligonucleotide to CovaLink NH via an phosphoramidate bond, the oligonucleotide terminus 
must have a 5*-end phosphate group. It is, perhaps, even possible for biotin to be covalently 
bound to CovaLink and then streptavidin used to bind the probes. 

More specifically, the linkage method includes dissolving DNA in water (7.5 ng/ul) and 
denaturing for 10 min. at 95°C and cooling on ice for 10 min. Ice-cold 0.1 M 

30 1-methylimidazole, pH 7.0 (1-Melm 7 ), is then added to a final concentration of 10 mM 

l-Melmy. A ss DNA solution is then dispensed into CovaLink NH strips (75 ul/well) standing 
on ice. 

Carbodiimide 0.2 M l-ethyl-3-(3-dimethy]aminopropyl)-carbodiimide (EDC), dissolved 
in 10 mM 1-Melm 7 , is made fresh and 25 ul added per well. Hie strips are incubated for 5 hours 
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at 50°C. After incubation the strips are washed using, e.g., Nunc-Immuno Wash; first the wells 
are washed 3 times, then they are soaked with washing solution for 5 min., and finally they are 
washed 3 times (where in the washing solution is 0.4 N NaOH, 0.25% SDS heated to 50°C). 

It is contemplated that a further suitable method for use with the present invention is that 

5 described in PCT Patent Application WO 90/03382 (Southern & Maskos), incorporated herein 
by reference. This method of preparing an oligonucleotide bound to a support involves 
attaching a nucleoside 3-reagent through the phosphate group by a covalent phosphodiester link 
to aliphatic hydroxyl groups carried by the support. The oligonucleotide is then synthesized on 
the supported nucleoside and protecting groups removed from the synthetic oligonucleotide 

10 chain under standard conditions that do not cleave the oligonucleotide from the support. 

Suitable reagents include nucleoside phosphoramidite and nucleoside hydrogen phosphorate. 

An on-chip strategy for the preparation of DNA probe for the preparation of DNA probe 
arrays may be employed. For example, addressable laser-activated photodeprotection may be 
employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described 

15 byVodotetal (1991) Science 251(4995) 767-73, incorporated herein by reference. Probesmay 
also be immobilized on nylon supports as described by Van Ness et al (1991) Nucleic Acids 
Res. 19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1988) Anal. 
Biochem. 169(1) 104-8; all references being specifically incorporated herein. 

To link an oligonucleotide to a nylon support, as described by Van Ness et al (1991), 

20 requires activation of the nylon surface via alkylation and selective activation of the 5 f -amine of 
oligonucleotides with cyanuric chloride. 

One particular way to prepare support bound oligonucleotides is to utilize the 
light-generated synthesis described by Pease et al, (1994) PNAS USA 91(1 1) 5022-6, 
incorporated herein by reference). These authors used current photolithographic techniques to 

25 generate arrays of immobilized oligonucleotide probes (DNA chips). These methods, in which 
light is used to direct the synthesis of oligonucleotide probes in high-density, miniaturized 
arrays, utilize photolabile 5 f -protected JV-acyl-deoxynucleoside phosphoramidites, surface linker 
chemistry and versatile combinatorial synthesis strategies. A matrix of 256 spatially defined 
oligonucleotide probes may be generated in this manner. 

30 3.21 PREPARATION OF NUCLEIC ACID FRAGMENTS 

The nucleic acids may be obtained from any appropriate source, such as cDNAs, 
genomic DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC inserts, 
and RNA, including mRNA without any amplification steps. For example, Sambrook et aL 



WO 02/070539 



PCT/US02/05095 



(1989) describes three protocols for the isolation of high molecular weight DNA from 
mammalian cells (p. 9.14-9.23). 

DNA fragments may be prepared as clones in M13, plasmid or lambda vectors and/or 
prepared directly from genomic DNA or cDNA by PCR or other amplification methods. 
Samples may be prepared or dispensed in multiwell plates. About 1 00-1 000 ng 

of DNA S2m n !e c 

may be prepared in 2-500 ml of final volume. 

The nucleic acids would then be fragmented by any of the methods known to those of 
skill in the art including, for example, using restriction enzymes as described at 9.24-9.28 of 
Sambrook et al (1 989), shearing by ultrasound and NaOH treatment. 

Low pressure shearing is also appropriate, as described by Schriefer et al (1 990) Nucleic 
Acids Res. 18(24) 7455-6, incorporated herein by reference). In this method, DNA samples are 
passed through a small French pressure cell at a variety of low to intermediate pressures. A 
lever device allows controlled application of low to intermediate pressures to the cell. The 
results of these studies indicate that low-pressure shearing is a useful alternative to sonic and 
enzymatic DNA fragmentation methods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using the 
two base recognition endonuclease, Cv/JI, described by Fitzgerald etal (1992) Nucleic Acids 
Res. 20(14) 3753-62. These authors described an approach for the rapid fragmentation and 
fractionation of DNA into particular sizes that they contemplated to be suitable for shotgun 
cloning and sequencing. 

The restriction endonuclease Cv/JI normally cleaves the recognition sequence PuGCPy 
between the G and C to leave blunt ends. Atypical reaction conditions, which alter the 
specificity of this enzyme (Cv/JI**), yield a quasi-random distribution of DNA fragments form 
the small molecule pUC19 (2688 base pairs). Fitzgerald (1992) quantitatively evaluated 
the randomness of this fragmentation strategy, using a Cv/JI** digest of pUC19 that was size 
fractionated by a rapid gel filtration method and directly ligated, without end repair, to a lac Z 
minus M13 cloning vector. Sequence analysis of 76 clones showed that CV/JI** restricts 
pyGCPy and PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated 
at a rate consistent with random fragmentation. 

As reported in the literature, advantages of this approach compared to sonication and 
agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 ug instead of 
2-5 ug); and fewer steps are involved (no preligation, end repair, chemical extraction, or agarose 
gel electrophoresis and elution are needed 
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Irrespective of the manner in which Ihe nucleic acid fragments are obtained or prepared, 
it is important to denature the DNA to give, single stranded pieces available for hybridization. 
This is achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. The solution is 
then cooled quickly to 2°C to prevent renaturation of the DNA fragments before they are 
5 contacted with the chip. Phosphate groups must also be removed from genomic DNA by 
methods known in the art. 

3.22 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon ' 
membrane. Spotting may be performed by using arrays of metal pins (the positions of which 

1 0 correspond to an array of wells in a microtiter plate) to repeated by transfer of about 20 nl of a 
DNA solution to a nylon membrane. By offset printing, a density of dots higher than the density 
of the wells is achieved. One to 25 dots may be accommodated in 1 mm 2 , depending on the type 
of label used. By avoiding spotting in some preselected number of rows and columns, separate 
subsets (subarrays) may be formed. Samples in one subarray may be the same genomic segment 

1 5 of DNA (or the same gene) from different individuals, or may be different, overlapped genomic 
clones. Each of the subarrays may represent replica spotting of the same samples. In one 
example, a selected gene segment may be amplified from 64 patients. For each patient, the 
amplified gene segment may be in one 96-well plate (all 96 wells containing the same sample). 
A plate for each of the 64 patients is prepared. By using a 96-pin device, all samples may be 

20 spotted on one 8 x 12 cm membrane. Subarrays may contain 64 samples, one from each patient. 
Where the 96 subarrays are identical, the dot span may be 1 mm 2 and there may be a 1 mm 
space between subarrays. 

Another approach is to use membranes or plates (available from NUNC, Naperville, 
Illinois) which may be partitioned by physical spacers e.g. a plastic grid molded over the 

25 membrane, the grid being similar to the sort of membrane applied to the bottom of multiwell 

plates, or hydrophobic strips. A fixed physical spacer is not preferred for imaging by exposure to 
flat phosphor-storage screens or x-ray films. 

The present invention is illustrated in the following examples. Upon consideration of the 
present disclosure, one of skill in the art will appreciate that many other embodiments and 

30 variations may be made in the scope of the present invention. Accordingly, it is intended that 
the broader aspects of the present invention not be limited to the disclosure of the following 
examples. The present invention is not to be limited in scope by the exemplified embodiments 
which are intended as illustrations of single aspects of the invention, and compositions and 
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methods which are functionally equivalent are within the scope of the invention. Indeed, 
numerous modifications and variations in the practice of the invention are expected to occur to 
those skilled in the art upon consideration of me present preferred embodiments. Consequently, 
the only limitations which should be placed upon the scope of the invention are those which 
appear in the appended claims. 

All references cited within the body of the instant specification are hereby incorporated 
by reference in their entirety. 

4.0 EXAMPLES 

4.1 EXAMPLE 1 

Novel Nucleic Acid Sequences Ohtained From Various Libraries 
A plurality of novel nucleic acids were obtained from cDNA libraries prepared from 
various human tissues and in some cases isolated from a genomic library derived from human 
chromosome using standard PCR, SBH sequence signature analysis and Sanger sequencing 
techniques. The inserts of the library were amplified with PCR using primers specific for the 
vector sequences which flank the inserts. Clones from cDNA libraries were spotted on nylon 
membrane filters and screened with oligonucleotide probes (e.g., 7-mers) to obtain signature 
sequences. The clones were clustered into groups of similar or identical sequences. 
Representative clones were selected for sequencing. 

In some cases, the 5* sequence of the amplified inserts was then deduced using a typical 
Sanger sequencing protocol. PCR products were purified and subjected to fluorescent dye 
terminator cycle sequencing. Single pass gel sequencing was done using a 377 Applied 
Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. In some cases RACE 
(Random Amplification of cDNA Ends) was performed to further extend the sequence in the 5' 
direction. 

4.2 EXAMPLE 2 
Novel Nucleic Acids 

The novel nucleic acids of the present invention of Ihe invention were assembled from 
sequences that were obtained from a cDNA library by methods described in Example 1 above, 
and in some cases sequences obtained from one or more public databases. The nucleic acids 
were assembled using an EST sequence as a seed. Then a recursive algorithm was used to 
extend the seed EST into an extended assemblage, by pulling additional sequences from 
different databases (i.e., Hyseq's database containing EST sequences, dbEST version 1 19, gb pri 
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1 19, and UniGene version 1 19) that belong to this assemblage. The algorithm terminated when 
there was no additional sequences from the above databases that would extend the assemblage. 
Inclusion of component sequences into the assemblage was based on a BLASTN hit to the 
extending assemblage with BLAST score greater than 300 and percent identity greater than 
5 95%. 

Using PHRAP (Univ. of Washington) or CAP4 (Paracel), a full length gene cDNA 
sequence and its corresponding protein sequence were generated from the assemblage. Any 
frame shifts and incorrect stop codons were corrected by hand editing. During editing, the 
sequence was checked using FASTY and/or BLAST against Genbank (i.e., dbEST version 121, 

10 gb pri 121, UniGene version 121, Genpept release 121). Other computer programs which may 
have been used in the editing process were phredPhrap and Consed (University of Washington) 
and ed-ready, ed-ext and cg-zip-2 (Hyseq, Inc.). The full-length nucleotide and amino acid 
sequences, including splice variants resulting from these procedures are shown in the Sequence 
Listing as SEQ ID NOS: 1- 948. 

1 5 Table 1 shows the various tissue sources of SEQ ID NO: 1-1896. 

The nearest neighbor results for polypeptides encoded by SEQ ID NO: 1-948 (i.e. 
SEQ ID NO: 949-1896) were obtained by a BLASTP (version 2.0al 19MP-WashU) search 
against Genpept, Geneseq and SwissProt databases using BLAST algorithm. The nearest 
neighbor result showed the closest homologue with functional annotation for SEQ ID NO: 1- 

20 948. The translated amino acid sequences for which the nucleic acid sequence encodes are 
shown in the Sequence Listing. The homologues with identifiable functions for SEQ ID NO: 
1-948 are shown in Table 2 below. 

Using eMatrix software package (Stanford University, Stanford, CA) (Wu et ai., J. 
Comp. Biol., Vol. 6 pp. 219-235 (1999) herein incorporated by reference), polypeptides 

25 encoded by SEQ ID NO: 1-948 (i.e. SEQ ID NO: 949-1896) were examined to determine 

whether they had identifiable signature regions. Table 3 shows the signature region found in 
the indicated polypeptide sequences, the description of the signature, the eMatrix p-value(s) 
and the position(s) of the signature within the polypeptide sequence. 

Using the Pfam software program (Sonnhammer et al., Nucleic Acids Res., Vol. 26(1) 

30 pp. 320-322 (1998) herein incorporated by reference) polypeptides encoded by SEQ ID NO: 
1-948 (i.e. SEQ ID NO: 949-1896) were examined for domains with homology to certain 
peptide domains. Table 4 shows the name of the domain found, the description, the product 
of all the e-value of similar domains found, the pFam score for the identified domain within 
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the sequence, number of similar domains found, and the position of the domain in the SEQ 
ID NO: being interrogated. 

The GeneAtlas™ software package (Molecular Simulations Inc. (MSI), San Diego, 
CA) was used to predict the three-dimensional structure models for the polypeptides encoded 
by SEQ ID NO: 1-948 (i.e. SEQ ID NO: 949-1896). Models were generated by (1) PSI- 
BLAST which is a multiple alignment sequence profile-based searching developed by 
Altschul et al, (Nucl. Acids res. 25, 3389-3408 (1997)), (2) High Throughput Modeling 
(HTM) (Molecular Simulations Inc. (MSI) San Diego, CA) which is an automated sequence 
and structure searching procedure (htto://www.msi.com/l and (3) SeqFold™ which is a fold 
recognition method described by Fischer and Eisenberg (J. Mol. Biol. 209, 779-791 (1998)). 
This analysis was carried out, in part, by comparing the polypeptides of the invention with 
the known NMR (nuclear magnetic resonance) and x-ray crystal three-dimensional structures 
as templates. Table 5 shows, "PDB ID", the Protein DataBase (PDB) identifier given to 
template structure; "Chain ID", identifier of the subcomponent of the PDB template structure; 
"Compound Information", information of the PDB template structure and/or its 
subcomponents; "PDB Function Annotation" gives function of the PDB template as 
annotated by the PDB files Ortto^www.itah.nnr/PnBA- start and end amino acid position of 
the protein sequence aligned; PSI-BLAST score, the verify score, the SeqFold score, and the 
Potential(s) of Mean Force (PMF). The verify score produced by GeneAtlas™ software 
(MSI), is based on Dr. Eisenberg's Profile-3D threading program developed in Dr. 
DavidEisenberg's laboratory (US patent no. 5,436,850 and Luthy, Bowie, and Eisenberg, 
Nature, 356:83-85 (1992)) and a publication by R. Sanchez and A. Sali, Proc. Natl. Acad. 
Sci. USA, 95:12502-13597. The verify score produced by GeneAtlas™ normalizes the verify 
score for proteins with different lengths so that a unified cutoff can be used to select good 
models as follows: 

Verify score (normalized) = (raw score - high score)/(l/2 high score) 

The PMF score, produced by GeneAtlas™ software (MSI), is a composite scoring 
function that depends in part on the compactness of the model, sequence identity in the 
alignment used to build the model, pairwise and surface mean force potential (MFP). As 
given in Table 5, a verify score between 0 to 1.0, with 1 being the best, represents a good 
model. Similarly, a PMF score between 0 to 1.0, with 1 being the best, represents a good 
model. A SeqFold™ score of more than 50 is considered significant. A good model may 
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also be determined by one of skill in the art based on all the information in Table 5 taken in 
totality. 

The nucleotide sequence within the sequences that codes for signal peptide sequences 
and their cleavage sites can be determined from using Neural Network SignalP VI. 1 

5 program(from Center for Biological Sequence Analysis, The Technical University of 

Denmark). The process for identifying prokaryotic and eukaryotic signal peptides and their 
cleavage sites are also disclosed by Henrik Nielson, Jacob Engelbrecht, Soren Brunak, and 
Gunnar von Heijne in the publication "Identification of prokaryotic and eukaryotic signal 
peptides and prediction of their cleavage sites" Protein Engineering, Vol. 10, no. 1, pp. 1-6 

1 0 (1997), incorporated herein by reference. A maximum S score and a mean S score, as 

described in the Nielson et at., as reference, were obtained for the polypeptide sequences. 
Table 6 shows the position of the last amino acid of the signal peptide in each of the 
polypeptides and Hie maximum score and mean score associated with that signal peptide. 
Table 7 correlates each of SEQ ID NO: 1-948 to a specific chromosomal location. 

1 5 Table 8 is a correlation table of the novel polynucleotide sequences SEQ ID NO: 1- 

948, novel polypeptide sequences SEQ ID NO: 949-1896, and their corresponding priority 
nucleotide sequences in the priority application USSN 09/799,45 1, herein incorporated by 
reference in its entirety. 

20 TABLE 1 



Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 


Null 


mix tissues library 


CTL016 


52 137 189-192 316-325 529 591 


Null 


enriched libray 


CTL021 


65 84 169 189-192 31 1 316-325 406 676 727 
782 850 


Null 


mix tissues library 


CTL028 


65 


Null 


PCR products 
cloning 


PCR2V1 


34 51 134 189 191-192 224 232 2603,11 388 
606 623 806 


Null 


mix tissues library 


SUP002 


51 96 103 1 63 216 272 294 311 316-325 328 
378 383 388 446-448 450 453 474 481 500 516 
610 774 780 885 904 922 


adipocytes 


Stratagene 


ADP001 


2 43 51 73 76 88 97 142 166 181 186 188 208 
257 262-263 267-270 282 31 1 316-325 383 386 
427-429 459 463 465 493 507 514 522 545 552 
572 643 651 667 700 721 740 754 758 778 795 
872 881 883 888 947 


adrenal gland 


Clontech 


ADR002 


3-6 10-11 13 16 20-21 24 27-28 33 38 48-49 51 
53-54 58 66-67 75 88 97 99 124-125 130 140 
157-158179188 197-198200 212-214216 218 
224 229 231 237 257 267 279 281-282 288 302 
311 326 362 376-377 381 383 396 398-403 429 
443 453-454 456 459-460 474 489 515 526 531- 
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Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQIDNOS: 


adult brain 


BioChain 


ABR012 


532 540 545 550 559 564 568 577 581 586 589 
599 605 610 613 631-632 643 648 651 667 670 
672 681 684 699 703 706 708 717-718 734 736 
751 779 785-786 795 813 817 837 871 876 887- 
888 897 904 907 916 921 924-926 948 
140208311748 810 


adwlt brain 
adult brain 


Clontech 


ABR013 
ABROOl 


51 245 311 316-325 436 717 810 936 
12 51 87 142 169 178 180 245 263 286 288 290 
295 304 308 311-313 375 379-380 403 425 428 
431 458 486 499 503 512 557-558 567-568 606 
610 641 651 695 704 730 741 754 766 810 822 
827 841 850 864 871 884 897 917 920 925-927 
934 946 


adult brain 


Clontech 


ABR006 


2 14-15 22-23 29 32-33 49 66-68 83 99 111-112 
115 129 131 142 147 153 157 163 169 189-192 
200 205 207 212-214 218 221 229 234 256-257 
263 272 276 279 282 292-299 301 3 1 1 3 1 5 340- 
343 349 376-377 3 83-386 388 403 405 407 410 
425 438 453-454 460 463 469 474 489 495 499- 
500 51 1 522 531-532 539 541 545-546 551 556 
563 565 571 579-583 591 594 606 626 628 631- 
632 643 647 651 678 684 693-692 700 717 721 
726 730 732 741 744 754 757 769 772 774 782 
788 793 810 820 827-828 853 867 869 875 879 ' 
897 913 921-922 925-926 933-934 939-941 947 


adult brain 
adult brain 


Clontech 
Clontech 


ABR008 
ABR011 


1-2 9-10 13 16-18 23 27-28 30-32 37 39 42-43 
46 49-51 66 70 76 80 83 86-87 95-97 109 1 1 1- 
112 116-117 124 130-131 133-134 136-137 141- 
142 146-147 152-157 160 162 169 171 179 184 
189-192 195 200-201 206 211-212 216-218 239 
247-248 250 252 254-258 261-263 271-272 276 
278 282 288 293-295 297 300 302 307 309 3 1 1 
314-326 328 333 337-341 343 347 349 351-354 
358 360-361 367 374 376-378 381 384 388-390 
393 395-396 400-403 405 407 409 41 3 41 4 41 8- 
420 422 427-429 433 438 440-441 445-447 450 
453-455 458 460-461 463 466-470 474 476 486 
491-493 496 498-500 507 511 514 520-521 525 
527-529 531-532 534-535 542 546 548-549 551- 
552 557-558 560 562 564-566 568 571-572 578- 
joj joo-otw Dyuoyi 5y4 599 602 606 61 8-619 
621 626 629 631-634 643-644 647 651 656-660 
664 670 672 677 680 684 687-688 691-695 697 
706 709-710 712-714 716-718 721-722 724-725 
727-728 730 733 740-741 745 751-752 754 761 
765 774 777-779 787 790 792-793 799 801-804 
808 810 812 820 822 824 827 831-832 834 836 
845 850 85^861 868-869 871-872 875-876 883 

887 891 897 000 004. 007 Q1 n Q1 Q 01 n oon <v»c 
oo / o7i 071 yvv y\j*t ysjf yi\j ya r/17-920 925- 

927 929 93 1 -934 938-943 946-947 


adult brain 
adult brain 


GIBCO 
GIBCO 


AB3001 

ABD003 : 


51 133 810 892 

16-17194066 92-9497124 131 134163 186 
188 208 213 231 268-270 284 288 295 297 299 
311 315-325 340 373 387 396 407 429 469 489 
495 498-499 533 542 545 562 568 587 589 618- 
519 643 664 687-688 694-695 730 748 836 876 
882 884 902 925-926 948 
2 22-24 29 33 43 45 50-51 66 71 75 77 82 87-88 
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Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQIDNOS: 








91-92 95 131 140 157 179 188-192 200 208 213 
220 225 247 252 257 261 263-265 277 284 288 
295 299 301 315-325 355-356373 387-389 392 
395-396 407 423 431 443-444 450-45] 457 4:>9 
468 476 489 495 499-500 514 520-522 532-533 

— . - _ a ^ f » m x s £ eea eery CCA C~l £. C*7*7 f Ol 

541-542 545-546 557-558 562 564 576-577 581- 
583 588-589 591 595 597 599 601 610 619 631- 

-v >o >»- j j z*^*/f zee ceo CC(\ cca CtL*i C.HC 

632 639 643-644 654-655 655-ooU oo4 00/ o/o 
682 687-688 693 696 700 704 71 1 713-714 746 
758 765-766 774-775 780 oUO oUZ oU4 oU/ oiu 
827 829 834 842 850 854-855 866 870-871 878 
892-893 897 899 910 916 920-921 929 931-932 
934 


adult brain 


Invitrogen 


ABR014 


2 51 65 84 86 134 311 316-325 384422 445 460 
503 525 564 634 651 721 794 804 810 922 


adult brain 


Invitrogen 


ABR015 


37 134 263 272 277 294 311 443 467 500 514 
582-583 619 651 694 850 871-872 883 888 936 


adult brain 


Invitrogen 


ABR016 


19 22 57 134 188 233 271 277 299 373 440 444 
459 469 514 640 717 882 890 920 


adult brain 


Invitrogen 


ABT004 


1-2 18 28 51 55-57 67 87-88 115 119 137-139 
142 163 200 204 213 218 257 263 271 282 288 
299 301 31 1 341 358 370 378 402 407422-423 
427 458 460 463 499 504 534-535 551 557-558 
571 586 605-606 610 61 8 627-628 640 643 680 
687 691-692 697 701-702 715 719-721 725 727 
753-754 758 771 782 810 827 859 871-872 881 
913 920 925-926 938-941 944 946 


adult heart 


GIBCO 


AHR001 ! 


1-2 5-6 14-18 20-21 23 28 32 37 41 45 51 53 55- 
56 62 66 69-70 80-81 85 87 91 97 107 120-121 
124 134 140-141 156 163 165-166 172 188-192 
195 197-198 200 208 213 216 221 229231235 
261-265 267 271 276 284 288 302 305 308 31 1 
316-325 328 333-334 337-338 347 368-369 373 
376-377 379-380 389 396 420 440 445 453-454 
459-460 465 468 478 483-484 489 491-493 495 
501 504 507 514 524 529 533 539 541-543 545 
549 552-553 564 566 568 574 577 581-583 587 
589-591 596 599 602 605 608-609 618-619 623 
625 629-632 643 645 647 651 664 672 676 678 
683-684 707 714 716-717 732 735 740 743-744 
751 754 757 765 775 778 784-786 788 807-808 
810 826 828-829 842 850 860 876 878-880 890 
894 897 899 902 916 923-927 933 939-941 


adult kidney 


GIBCO 


AKD001 


1-2 5-6 13 16-17 19-23 26 28 33 38-39 43 45 48- 
51 55-57 60 66-67 69-73 79 82-83 87 90 94 96- 
97 100 103 126 131 134 140 148-149 157 163 
166 179 184 186 188-192 200-203 213-216 220- 
221 224 226-229 232 235 245 252 257 261-263 

060. oift 779 974 OlfiJJll 97Q 989 9fffl 900 994 
299 308 311 316-325 332 335 339-340358 360- 
363 373 375 379-380 386 388-389 392 395-396 
402 413 421 423-424 428-429 43 1 436 440 444 
450 454 457 459-460 468-469 476 489 492-493 
499 504 511 513-514 520-521 524-526531 533 
538-542 544-547 552 564 567-568 574577-578 
582-583 590-591 595-596 598 602 607 610 613 
618-619 622 631-632 639-642 644 647 651 654- 
655 658-659 664 667-669 673 67 8 680-682 684 
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Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 








687 689 693 696 706-707 712 714-715 717-718 
721 729-731 734-736 740 744 748 754 760 771 
774 782 784 789 795 807 809-810 819 825 834 
836-837 842 850 859 870 872 876 878-879 884 
887 890 895 897-899 902 905 910 919-921 925- 
926 933 936 944 


flHnlt HrWv 

* *— -v 


1 T«^,*f^«^ 0 « — 




1 14-1528 3G 35-37 53-54 73 88 112 114 129 
134 137 140 149 157 1 66 172 186-188 191-192 
203-204 213 235 245 257 262-263 266 268-270 
273-274 288-289 297 299 302 310-313 315-325 
335 340 358 373 378-381 395 413 423 441 450 
453 456 459-460 470 477 491-494 500 513 540 
542 545 554 556 564 567 587 591 619 622 627 
633 643 668-669 677 684 689 693 701-702 704 
714 729-730 754 758 760 777 781 785-786 788- 
789 807 836-837 840 849-850 872 876 881 890 
895 905-906 913 923 925-926 931-933 944 


duuji liver 


Clontech 


ALV003 


159 179 189-192 201 219 257 349 392 568 664 
753 796 887934 


adult liver 


Invitrogen 


ALV002 


5-6 28 35-36 52 54 70 72 86-87 303 1 12 127 134 
140 159 179 188 200-201 213 218-219 225 239- 
240 257 263 271 275 3 1 1 315 367 373 388 392 
444 459-460 464 468 497-499 512 527 532 542 
545 562 599 605 629 640 657 680 684 687-688 
706 713 715 717-718 721 742 754 758 771 791- 
793 818 829 843 854-855 871 878-879 887 921 
933-934 


adult lung 


GIBCO 


ALG001 


5-6 16 28 38 51 74 97 122 124 134 140 163 188- 
192 200218221 262-263 268-272 294311 316- 
325 379-380 429 463 468 493 51 1 520-522 537- 
538 542 545 568-569 595 622 643-644 664 667 
711 714 72 1 730 754 775 850 860 863 879 887 
897 925-926 944 


adult lung 


Invitrogen 


LGT002 


2 5-8 13 16-17 29-31 35-39 43 46 57 67 72 76 
78 81 85 87 90 9497 100 110 119 122 130-131 
134 137 140 146 149 167 172 174 179 188 197- 
198 201 213 216 218 220-221 223 231 245-246 
251-252 256-257 262-263 267-270 277 284 288 
296 299 301-302 311 316-325 340 354 373 379- 
380 388 392 395 400-401 410 413 421 431 436 
441-443 445 451 455 457 460 463-464 467 469 
475 478 489 491 493 497 499 504 507 514 518- 
519 524 529 534-535 537 542 545-546 548 552 
555 559 568 578 581-583 592 597 602-603 605- 
607 613 615 619 621-622 636-637 642-643 646- 
647 654-655 679-681 684 687-689 693 701-702 
704 706 711 713 715-716 718 727 732-734 738 
748 753-754 757-758 760 762 766 769 774 782 
785-786 802 817 829 834 ftfO RSI RSQ-R*n kaa 
867 870-871 878-879 887 890 899 902 904 910 
917923 925-926 936-937 


adult spleen 


Clontech 


SPLcOl 


33 38 57 67 75 87 134 142 163 216 223 229 244 
257 304 307 3 1 1 316-325 340 355-356 378 441 
468 525 538 545 560 564 599 721 754 766 780 
794 827 841 850 866 


adult spleen 


GIBCO 


ASP001 


2 14-15 20-22 29 38 43 48 51 53-56 65 67 72 74 
84 87 131-132 134 137 140 172 188-192 200 
212 221 256 263 271 282 308 31 1 316-325 343 
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Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ IDNOS: 








383 389 423 436 441 443 45y-4oU 40/ 4oy 4Vj 
499-500 505 514 520-522 524 529 537 539 545 
552 585 619 631-632 639 643 664 673 707 723 
735 742-744 758 771 799 810 817 836 850 878 
925-926 934 936 1 


bladder 


Invitrogen 


BLD001 


5-6 8 20-21 28 72 91 122 126 130 166 188 197 
200 213-214 225 257 262 315-325 341 409 486 
491 572 593 622 650 673 691-692 810 813 861 
870 877 883 887 904 


bone marrow 


Clonetech 


BMD007 


65 76 84 245 516 


bone marrow 


Clontech 


BMD001 


8 13-16 28 38 43 45-48 50-51 57 62-63 65 67 
84-85 97 100 104 118 122-124 131 134 140 163 
188 214 216 221 224 231 245 252 261-263 268- 
270 273-274 279 288 290 31 1 373 378 389-391 
395 414 428 431 436 440-441 443 451 455 459- 
460 465 469-470 475 495 497-498 502 507 514- 
519 529 537-538 542 546 550 552 556 560-561 
563-564 568 576-577 580 587 589 596 601-602 
610-613 619-620 626 642-643 647 651 664 666 
668 676 678 681-682 684 696 704 706-707 715 
727 730 732-735 740 748 753 758 761 764 771 
775 780 794 800-801 830 834 836 842 850 863 
871-872 878-879 882 884 888 897 900-901 904 
910 921 923 929 934 947 


bone marrow ! 


Clontech 


BMD004 


65 


bone marrow 


GF 


BMD002 


1-2 5-6 10 13 16-21 27 31 38 42-43 46 57 65-66 
76 80 84 87 97 99 110 112 118 131 134 137 140 
145 161 163 165 172 195 206 208 221 229 231 
237 244 247 252 256 267-270 272 276 278-279 
282 284288 294 301 304 307 311 316-327 333- 
334 337-338 345-347 352 360-361 368 373 376- 
378 381 383 388 414 436 441 443 450 452 454- 
455 457 469-470 483-484 486 490 498 5 16 519- 
521 524 530-531 539 542-543 545-546 551 553 
555 559 564 571 576-577 580 585 591 594 602 
604-605 607-608 610-612 619-621 625-626 629 
631-632 639-640 644 650-651 664-665 684 687- 
688 693 699 703 714 723-724 727 733 735 740 
742 745 748 750-752 754-755 777-780 784 787 
794-795 802 809 817 824 827 831-832 834 846- 
847 850-851 854-855 861 867 875 878 883 886 
891 894 897 900 902 910 914 919 921 925-926 
929 936 939-941944 


cervix 


BioChain 


CVX001 


3-4 14-16 20-23 25 33 42-43 45 48-50 54 57 67 
69 75 85 87 91 95-97 107 110 114 124 126-127 
131 134 137 140 150 157 163 165 172 185-188 
200 204 212-213 216 225 229 245 252 257 261- 
263 266-270 276 282 288 290 301-302 308 316- 

<iot ao7 ^An 'XMJXKd W7J\T\ ^7R ^81 ^88-^92 

jLj DjLJ J'rX} D/4> m J/0 «? / O JO J JOOV^i 

394 396 409 413-414 421 428-429 438-440 443- 
444 454 456-457 459 463 467 475 486 489 493 
495 507 514-515 522 534-537 556 568 572 574 
577 582-583 587 594 600 608 610 613 622 626 
633 639 643 647-648 651 653 667 680 683 685- 
686 693 696 703-704 706 71 1 721 723-725 727 
730-731 734-735 742-743 748 754-757 762 771 
776 785-786 788 794 800 802 807 809-810 817 
827 829 834-835 842 850 857 860 862-863 868 
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Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ1DNOS: 








870 873 876-877 879-880 884 887 891 897 904- 
905 910 916-917 921 925-926 933 937 947 


colon 


Invitrogen 


CLN0OI 


19-21 53 55-57 72 88 133-134 168 213 245 252 
311-313 316-325 340 443 459 469 483-484 486 
497 515-516 597 606 622 643 667 676 706 718 
742-743 753 766 829 833 872 887 902 923 929 




BioChain 


DIA002 


305 311 


endothelial cells 


Strategene 


EDT00J 


1-2 7-8 14-16 19-22 24 28-29 32-33 41 43 45 51 
57 61 74 83 87-88 97 105 112 116-1 17 131 134 
137 140 148 165 172 179 188-192 197-198 208 
212-213 220-221 225 229 231 237 246 252 256- 
258 261-265 268-272 276-277 279 281-282 284 
286 288 294 297 299 302 307-308 311-313 326 
334-335 340 355-356 358 360-361 364 375 383 
386 389 392 403 413 423424 429 440 443 445 
451 453 455-456 459-460 462-463 465-466 468- 
470 475 491 495 497-499 504 514 520-522 524- 
526 528 532-536 539-540 546 551-552 554 556 
564 566-567 571 574-577 581-583 587 591-593 
597-599 601 607 615 618 622 625 633 639 641- 
644 651 667 677 680 684 691-692 701-702 704 
716-717 720-721 726 732-733 735 743-744 754 
758 765 785-786 795 802 806 809,819 826 828- 
830 832 834 836 846-847 850 867 871 877-878 
890-891 897 902 907 921 923 925-926 944 946 


esophagus 


BioChain 


ESO002 


188 


fetal brain 


Clontech 


FBR001 


33 49 51 126 134 197-198 264-265 360-361 413 
460647 810 819 871 


fetal brain 


Clontech 


FBR004 


137 156 205 282 284 405 424 480 489 701-702 " 
820 921 


fetal brain 


Clontech 


FBR006 


2 9-10 18-19 22 28 30-32 37 39-40 42-43 46-47 
49 57 66-67 76 80 83 96 109 1 12 116-117 120 
124 131 133-134 136 142-143 146 152 155 160 
162 165 169 173 184 189-198200-201 205 215- 
216 238 244 248 254-255 257-258 260-263 272- 
274 276-277 282 288 293-294 307 309 31 1 314- 
328 343 347 351-352 354 357-358 360-361 373- 
375 378-381 390 392 400-401 403 405 407 410- 
411 413 420 424 429 445 450 452-453 458 460 
463 467-469 472 474 477 479 483-484 491 499 
507 520-521 525 527 529 531 533 538 545 551 
562 564 566 571 574 579 581-583 587 591 599- 
600 604 606 611 626 629 631-632 638 643 651 
654-655 657-660 672-673 676-677 684 689 693- 
694 697 699 709 714-715 717 720-721 732-733 
735 744 748 751-752 754 761 763 767 772 775 
777-779 781 785-786 790 792 802 804 808 810 
820 824 826 838-840 850 858-860 864 866 872- 
873 881 891-892 901-902 904 910-911 913 917- 
918 920 925-926 933 939-941 946-947 


fetal brain 


Clontech 


FBRS03 


316-325 684 


fetal brain 


GIBCO 


HFB001 


2 12 16-17 19 23 27-28 32-33 39 41-45 49 87-89 
94 97 100 107 1 12 130-131 134 142 157 163 
172 188-192 200 216 224-225 231 237 242 246 
252 258 261 263-265 271 273-274 276-277 288 
295 299 301 307 31 1 314-326 328 341 355-356 
373 375 387 389 392 395 424-425 431 438 445 
450-452 457 459-460 468-469 475 489 491 495 
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Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 








500 504 51 1 514 520-529 531 533 540-542 545 
552 554 557-558 566 576-577 579-584 587 591 
596 598-599 606 613 626 631-632 643 651 664 
668 673 676 680 693-694 696 703-704 716-717 
721 727 735 738 740 744 748 757-758 769 774 
778 780-781 810 827-828 830 850 Soy 571-872 
876 878-879 884 890 892 897 899 904 906-907 
913 916 918 920 924 928 934 938 946 


fetal brain 


Invitrogen 


FBT002 


2-4 20-21 45 51 53 57 88 93 125-126 134 166 
1 84 1 86 1 88 200 213 224 263 276-278 307 3 11 
341 373 375 418-419 423 427 432 450 452 459- 
460 470 492 498-499 507 514 522 534-535 545 
550-552 571 577 610 714 721 743 754 795 827 
861 866 872-873 887 896 925-926 934 939-941 
946 948 


fetal heart 


Invitrogen 


FHR001 


2-41013 16-18 29 31-32 37-38 43 46 49 51 53 
55-56 67-68 75 80 85 87 97 1 15 120 137 152 
156 160-161 163 168-169 174 178 189-192 196 
200 216 220 225 252 262 276-277 282 288 301- 
302 305 311 315-325 333 343 351 357-358 360- 
361 368-372 378 424 436 440-441 445 453 460 
469 478 483-484 495 520-521 527 533 53 8 541- 
543 546 556 564-566 568 576 581 587 594-595 
601-602 606 609 612 615 633 638 640 643 653- 
655 664-665 672-673 677 684 691-693 697 704 
707 709 717 735 738 744 746 748-749 751-752 
754 761 777 779 781-782 785-786 797-798 820 
824 826 829 834 838 841-847 850 875 877-878 
893-894 897 901 910 913 925-927 936 946 


fetal kidney 


Clontech 


FKD001 


8 14-15 32 43 50 68 96 106 126 131 134 140 
186 1 88 226-228 233 279 282 31 1 339 428 440 
450 456 468 552 618 651 700 726 735 748 751 
781 794 797-798 826 878 887 oyy 


fetal kidney 


Clontech 


FKD002 


50 83 96 131 134 143 163 172 193-194 201 203 
215 263 273-274 3 1 1 316-325 339 360-363 374 
376-377 379-380 388 394 400-401 403 407 425 
440 451 454 493 525 536-538 540-542 572 580 
582-583 587 605-606 621 631-632 647 673 689 
706 709 714 726 735 761 774 777 799 809 845- 
848 858 872 875 878-879 882 895 918 927 


fetal kidney 


Invitrogen 


FKD007 


66 214 


fetal liver 


Clontech 


FLV002 


52 189-192 219 297 308 335 364 378 427 828 


fetal liver 


Clontech 


FLV004 


2 19 28-29 37 39 49 52-53 55-56 62 65-66 76 87 
124 134 137 139 142 179 188 195 208 216 219 
244 252 263 268-270 272 277 287-288 294 303 
305 311 315-325 339 355-356 358 360-361 368 
374 378 403 441 454-455 460 477 483-484 497 
514 520-521 542 553 582-583 587 591 594 61 1- 
690.691 61K-61Q 654-655 658-659 681 684 
687-688 709 721 730 738 744 752 754 781 793 
802 813 818 826 832 836 854-855 876 878 893 
897 900 910 924 933 944 


fetal liver 


Invitrogen 


FLV001 


2 7 19 28 35-37 47 52 54-56 66 95 134 139 179 
188-192 200 213 218 263 272 288 294 305 311 
315 349 378-381 388 392 403 426 443 454459- 
460 469 496-499 514 527 529 532 534-535 555 
586 605 640 644 658-659 673 680 687 698 713 
715 720-721 723 726 754 758 778 795 817-818 
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Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQIDNOS: 


fetal liver- 
spleen 


Soares 


FLS001 


829 853 861 868 870-871 897 903 933 
1-23 25-39 41-43 4546 49-65 76 83-84 87-88 
91 94 97 100 112 118 122 126 129-130 134 137 
140 148 163 168 172 179 186 188-189 191-192 
197-198200-201 213-214216221 225-229 231- 
232 235 242 252-253 256-258 262-271 277 284 
287-288 297 299 307-309 311 315-326 330 355- 
356 360-362 370 373 378-380 388 392 394 396 
400-402 413 426 428 436 440-441 443-447 450 
454-455 457 459-460 463-465 467-469 475 477- 
478 489-509 51 1-514 519-521 525-527 529-535 
537 539 542 545 551-552 555-556 559 561 567 
569 571 576-577 580-583 526-5S7 589 591-592 
595 598-602 605 607 610 612-613 618-619 623 
625-626 631-632 638-640 642-644 646-647 649 
651-652 654-655 667 673 676 680-681 683-684 
700 703-704 706 71 1 715-718 720-721 726 732- 
735 740 742-744 748 754 756-758 763-765 771 
774-775 777-782 785-786790 793-795 797-798 
806-808 815 818 824-826 829-830 834-835 837 
841 846-847 849-850 856 860-861 866 870-872 
876 878 SSUm 888 894 897-898 902 905 907 
910 919 924-926 929 933 942 947 


fetal liver- 
spleen 


Soares 


FLS002 


1 3-4 11-12 14-17 20-23 26-29 32-34 38 41 43 " 
45-47 49 51-52 55-62 65-67 76 83-85 87-88 90- 
91 95 97-99 104-105 112 114-117 126 130 133 
150 163 165 172 178 186-187 193-194 200-20^ 
208 213 221 225 229 232-235 244-246 248-253 
256-257 262-265 267-271 273-274 284 287 299 
311 315 326 335 337-338 343 355-356358375- 
378 381 392 394 400-402 414 416 426 428-429 
440-442 444-447 453-455 457 459 461 464465 
467 476-477 483-484 489-490 492 495 497-500 
504 506-507 509 511-514 519 522 524 526 532- 
535 537 539-540 542 545 551-552 556 567 569 
574 576-577 581 589-590 592 599 601-602 605 
607 610 612-613 619-620 625 627 629 631-632 
638 640-641 646 648-649 654-655 667 670 683- 
684 687-688 693 696 700 703-706 713 716-721 
726 734-735 740 742 744 748 754 758 771 775 
777-778 780-782 785-787 790-792 794-799 801- 
802 806 808 818 824 829 835 849-850 852-855 
857 870-871 876 882-884 886 888 890 894 897- 
898 900 902-903 907 919 921 923-926 929-930 


fetal liver- 
spleen 


Soares 


FLS003 


30 34 67 85 88 99 130 172 188-189 191-192 213 
229 231 257 311 315-325 329 331 335 362 391 
394 400-402 423 441 455 457 461 476 498 500 
di i on D4ioiz j51 542 576 587 592 612-613 
625 649 665-666 703 719 731 733 740 744 771 
775 777-778 787 797-798 819 824 826 850 854- 
856 861 863 870 879 884 897 923 931-933 947 


fetal lung 
fetal lung 


Clontech 
Clontech 


FLG001 
FLG004 


65 179 213 223 340 360-361 491 564 577 591 " 
627 646 650 712 715 744 758 939-941 


fetal lung 


Invitrogen 


FLG003 


388 921 

49 54 75 97 137 148 152 188 197-199 213-214 
225 240 256 288 316-325 369 378 392 423 429 
464 496 526 580-581 586 591 693 706 726.7?7 
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Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 








766 878 913 925-926 939-941 


fetal muscle 


Invitrogen 


FMS001 


28 65 115 121 126 134 137 156 168 172-173 
181 213-214225 263 267 305 340-341 360-361 
440 459 516 534-535 543 564 586 606 609-610 
623 650 676 683 754 766 853 871 886 894 930 
934-935 948 


fetal muscle 


Invitrogen 


FMS002 


19-21 41 49 51 53 57 75 96 101 103 112 134 
136 156 171 184 188 191-192 212216 250 262 
267 276 305 31 1 342 348-350 355-356 360-361 
374 392 403 41 1 415 423 425 457 469 491 495 
499 508 515 517 534-536 543 546 564 566 576 
580 582^583 587 594 599 609 61 1 615 618 623 
644 647 658-659 664 668-669 677 683 691-692 
696 703 735 743 754 766 788 802 817 826 828 
850 877-879 894 910 925-926 935 


fetal skin 


Invitrogen 


FSK001 


3-7 18 24 27 29 35-37 51 53 55-56 66-67 76 90 
95 97 122 126 134 136-137 166-167 181 188- 
192 208 213-214 224-225 245 250252 257 260 
262 268-271 273-274 282 284 297 302 312-313 
315-326 341 367 373 375 378 383 387-388 390 
394 423 429 440-441 450 454-455 457 459 463- 
464 470 472 475 486 489 492 495 498-500 51 1 
514 524 527 530-532 534-535 541-542 545 550 
552 555-556 565 571 586 589 591-592 602 604 
606 610 613 618 622-623 626 631-632 640 651 
654-655 672-673 685-686 693 701-702 704 706 
717-71 8 720 723 727-728 744 754 762 764-766 
768-771 795 809 814 821 824 827 843 853 868 
870-874 887-888 890 897 902 907 925-926 928 
930 933-934 939-941 944-945 947 


fetal skin 


Invitrogen 


FSK002 


2 5-619 29 34 51 57 59 88 97 101 124131 134 
143 163 166 172 189 191-192 196212 216222- 
223 231 250 257 263 268-272 282 284 287-288 
294 297 299 302 304 310-31 1 316-325 328 333 
340 352 360-361 365-367 372 379-380 388 390 
400-401 403 410-41 1 440 449-450 454 457 463 
470 478 491 495 500 505 515 520-521 524 532 
534-535 541 555 560 562 564 572 576 581 592 
595 599 61 1 622 626 630 636 640 642 650 664 
677 683 691-693 696 699 701-702 708-709 715 
721 723 728 735 744 747-748 750 754 766 779 
782 799 803 807 813 820 824 826 834 846-847 
867-868 872-874 878-879 890-891 897 901 904 
907 910 912 916 918 925-926 933 944-945 


fetal spleen 


BioChain 


FSP001 


311748 


infant brain 


Soares 


IB2002 


2-4 1214-15 20-21 23-24 27 29 31-32 39 41 46 
48-49 51 53 55-56 66 75-76 86-88 93 95 101 
105 108-109 1 16-117 125 127 129 131 136 145 
163 166 170 180-181 186 188-189 191-192 2UU- 
201 207-208 212-214 216 220 224 229 231 245 
247 252 257 259 264-265 267 271 279 282 288 
293 295 299-300 311 314-326 337-338 340 349 
367 373 375 388 390-393 396 402 405 407 418- 
421 424 428-429 43 1 433 436 450 452-453 457 
459 463 468 489 495 498-500 507 51 1 522-524 
526 528-530 532 541-542 545-546 552 557-558 
562 564-566 571 577-583 587 589-591 599-601 
606 608 613-614 619 631-632 647 654-655 658- 
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Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 








659 667 676 684 691-693 696 700 704 711 718 
721 723 725 740-741 743-744 748 754 775 777- 
778 780 788 792-793 795 802 805 808 819 826- 
829 834 836 838 861 863 869-870 875 879 881- 
882 884 887 m^S91 893 897 902 920-921 925- 
926 934 938-941 946 


infant brain 


Scares 


132003 


1 OH 1T ?A yf> An iA n A «* . A ^ . - _ "1 _ . I 

*.i.4 *j i *y j i o>6/ y/ 1UO iu l24 

126-127 13l 142 166 170 188 200-201 208214- 
215 220 224 226-228 231 251 257 263 267 271- 
272279288 293 299311 314-326 337-339349 
360-361 367 386 392 397 400-402 407 410 418- 
41 9 424-425 427 429 452 454 460 475 489 495 
497-500 507 522-523 525 529 532 539 542 545- 
546 551-552 557-558 564-565 578 582-583 585 
591 601 606 625-626 631-633 643-644 673 690- 
693 701-702 706 711 721 723 734 740-741 743- 
744 748 751 754 761 778 788 795 802 808 819 
826-827 829 837 843 869-871 875 878-880 884 
896-897 902 920 933-934 946 


leukocytes 


Clontech 


LUC003 


12 14-151832111 134137 172221277280 
31 1 316-325 436 454 467 549 552 568 585 603 
643 691-692 698-699 734 744 751 754 784 797- 
798 861 897 916 923 


leukocytes 


GIBCO 


LUC00I 


2-4 7-8 13-1720-23 31-33 38 43 4849 51 53-57 
63 66-68 74-78 85-88 93 97 122 124 129 131- 
132 134 137 140 363 166 168 171-172 175 188- 
192 197 200208-213 216221 223 231 236242 
252 257-258 261-263 268-270 272 277 279 287- 
288 294 307 311 314 316-326 329 337-339 341 
373-374 376-377 381 388-392 396 400 401 413- 
414 423 436 44! 450 454455 459 463 465 467 
489 491493 495 498499 504-505 507 514 518 
520-522 524 526 529 531 533 536-537 539-540 
545 552-554 556 568 571 577 580 585 589-590 
596 599 602 605 607 610 612-613 615 618-619 
621-622 625 638 640 642-644 664 667 677-678 
684 690-693 696 700 703-704 707 713 715-718 
721 727 734-735 738 740-746 748 753-754 758 
775 778 780 789 794 797-798 801-802 815 817 
825 827 829 834 836 846-847 850 859-861 863- 
864 866-867 871-872 878 884 886-888 891 896- 
897 902 904 910 913 916 921 923-926 929-932 

t^C A ill 

936 943 


lung 


Strategene 


LFB001 


24 22 28 32-33 47 53 79 120 129 134 140 163 
172 188 208 220-221 231 252 257 263 276-277 
284 307 375 378-380 396 423 428 440 450 459 
463 486 491 493 495 499 539 571 591 601 607 
613 615 618 625 639 651 684 716-717 721 727 
735 748 782 828 850 870-871 


lymph node 


Clontech 


ALN001 


43 98 131 140 163 188 221 245 277 299 31 1 491 
515 546 564 593 603 610 615 630 682 694 707 
717 800 831 850 878 880 936 939-941 947 


lymphocyte 


ATCC 


LPC001 


2 16 19-21 25 31-32 49 53 55-56 63 67 85-87 90 
97 120 122 137 140 163 165 168 172 188 197- 
198 215-216 221 229 231 236-23 8 248 252 256 
272-274 283-284 288 294 299 316-326 343 368 
374 378 395 423 43 1 454-455 467 469 476 478 
491 495 498 505 512 515 517-518 520-522 524 
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Library/RNA 
Source 


HYSEQ Library 
Name 










526 529 531-532 537 539 542 545-546 551 556 

en ctt con cftd coa cqo cq/c fifil fOl 

571 577 5oU joy-j7U jryL y?o oui ozz ooi-ooz 

640 642 654-655 664 666 668-669 673 684 703 

708 716 718 721 723 727 733 735 743-744 754 

758 765 771 775 777-779 783-786 797-799 810 

816-817 828 834 845 859 861 863 870 878-879 

881 884 887 890 897 904 907 910 912-913 918 

923 929 939-942 945-947 


macrophage 


Invitrogen 


HMP001 


49 97 208 252 301 306 311 316-325 337-338 
345-346 416 512 522 572 670 716 743 785-786 
802 888 919 923 


mammary gland 


Invitrogen 


MMG001 


1-8 14-18 20-21 25 28-29 37 39 43 49 51-57 60 
66-67 72 75-76 87 95 97 103-104 106 1 12 1 15 
119 122 127 130 134 137 139 142 150 166-168 
172 175 184 186 188-189 19M92 200 213-214 
222-224 226-229 240 252 257-259 263 267 271 
276 278 282 287-288 299 301-302 305 307-308 
311-313 316-325 327-328 332 340-341 358 360- 
362 369 373 378 381 383 388 390-392 397 402- 
403 409 41 5-41 6 423 425 428-429 433 436 444 
454-456 459-460 464 467 469 481 483-484 486 
493 495 498-499 515 524-525 529-530 532-537 
541-542 545 551-552 562 582-583 586-587 593 
599-600 602 604-605 610 618-619 622 625-627 
634 644 646-647 652 654-655 662 673 676 680 
684 687-688 691-692 701-703 715 717 721 723 
726 735 743 751 754 758 765-766 771 777-778 
789 803 805 807 809-811 821 827 829 850 860 
887-888 892 896 898 901-902 905 911 913 917 
925-926 930 936 939-942 


melanoma 


Clontech 


MEL004 


3-4 16 20-21 4i4o4o \vi vjo 14/ 10J 155 iyi 
213 216 221 231 241 245-246 260 262-263 316- 
325 381 407 431 504 525 527 542 556 568 577 
589 596 607 613 676 693 714 735 737-739 744 

OC© 0£A TTC QOO CCA f?/£3 07R ttR7 RQ7 

O0-/OU / / j 5iZ ojU ©Do o/o 00/ oz? / 


neuron 


Strategene 


NTO001 


2 16 32 51 66 88 97 124 130 134 137 172 188- 
189 191-192 23 1 252 257 260 277 291 373 424 
431 454 460 489 495 523 525 582-583 591 631- 
632 643 649 670 695 725-726 735 765 789 797- 
798 837 850 878 884 888 890 913 929 946 


neuron 


Strategene 


NTR0O1 


o c c OA oo 1 1/C 1 11 1 OQiOA 1 07_1 Oft 10 A 311 
2 5-0 2.V-L2. 130-1.3/ 1 oo-ly4 lyJ-i Jo o 1 1 

375 381 410 457 462 475 495 531 546 548 552 

599 618 678 743 752 819 828 890 895 897 930 

934 938 944 946 


neuronal cells 


Strategene 


NTU001 


2 5-6 20-21 55-56 87 137 188-192 197-198 215- 
216 260 287 291 310-311 316-325 365 375 423 
457 459 470 499 532 542 564 576 598-599 623 
643 651-652 673 721 726 743 745 752 754 765 
780 787 789 822 829 870 875 888 896 917 919 
929 


ovary 


Invitrogen 


AOV001 


2-7 10 13 18-22 25 27-28 30-31 33 38-39 41-43 
45 48 50-51 53-56 62 66-67 69 72 74-75 80 83 
85-87 93 95 99-101 107 112-115 120 124-126 
129 131 134-137 140 142-143 147-148162-163 
172 178 188 191-192 200 204 208 212-213 220- 
221 225 229 231 235 237 246-247 252-253 258 
261-262 264-265 267-272 276-279 282 287-288 
290 293-294 299 307-308 31 1 316-327 332 337- 
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Library/RNA 
Source 


HYSEQ Library 
! Name 


SEQ ID NOS: 








338 340-342 349 360-362 373-374 379-381 386 
388-389 393 396 399 403 413-414 423 425 427 
429 431 441 444-445 450 452 454-455 457 459- 
460 462 467470 475 477 483484 489 491-493 
495496 500 504-505 507 515 518-519 522-523 
527-529 531 533 537-540 542 545-546 548 551- 
552 555 564 568 570-571 577 579 581-583 589- 
596 599-600 605 610 613 615-616 619 623 625- 
627 630-636 639 641-644 647 649 651 654-655 
664 668-669 672 676 678 680-682 684 687 694 
701-704 706-707 715-717 721-722 727 729 731- 
732 734-735 738 740 743 748-749 753-754 758 
764 771 775 777-782 784 800 802 807 821-822 
824 828 834 836-837 842 846-847 850 860-863 
866 870-871 876-880 882 884 887 890-891 897 
899 901 906 910 913 920-92! 923-924 933 939- 
941 944947 


pituitary gland 


Clontech 


PIT004 


4142 83 85 97 134 193-194 204 208 213 224 
257-258 263-265 285-286 308 311 360-361 413 
443 445 491 514 529 532 639 644 647 682 701- 
702 716 781 822 829 836 850 933 939-941 947 


placenta 


Clontech 


PLA003 


16 31 34 49 66 80 87 97 101-102 134 158 165 
172 179 184 188 197-198 209-210 218 220 229 
235 249 256 267-270 277 287-288 302 307 332 ' 
360-361 365 388 394 414 441 444 454 457 460 
493 498-500 505-506 509 529 531-532 550 559- 
560 564 572 587 601 625 630-632 638 672 682- 
684 689 706 708 726 733 735 744 754 761 784- 
786 793 863 875 897 924 929 937 


placenta 


Invitrogen 


APL001 


34 68 102 263 444 493 520-521 534-535 689 
706 754 797-798 


placenta 


Invitrogen 


APL002 


2 14-15 43 55-56 66-67 334 184 213 221 229 
252 257 263 277 287 394 443 529 532 61 8 622 
684 742 754 810 829 883 902 


prostate 


Clontech 


PRT001 


7-8 51 85 87 97 100 122 134 139 214 216 221 " "" 
231 257 271 116 VK^ injvm loo Af\r\ Am a^% 
440 459 477 530 534-535 546 556 582-583 599 
622 631-632 639 651 663-664 673 683 707 715 
735 740 765 773-774 777 810 823 897 909 919 
934 939-941 947 


rectum ~~ 
saliva gland 


Invitrogen " 
Clontech 


REC001 
SALS03 


18 54 66 134 137 169 188 200 213 225 251 263 
288 311-313 316-325 340388 423 429441 454 
459 514 532 542 610 626 646 651 657 715 719 
723 728 735 740 758 766 785-786 823 829 833 
836 886 942 


salivary gland 


Clontech 


SAL001 


460 

31 49 78 95 134 136-137 143 176 188 208 223 
244 268-270 284 3.08 31 1 316-325 388-389 391 
436 441 459 476 514-515 520-521 <n? W 
589 596 610 619 684 691-692 713 71 8 727 736 
754 777 824 836 864 867 878 883 897 901-902 
916-917933 938-941 


skeletal muscle 

skeletal muscle 
skin fibroblast 
skin fibroblast 


Clontech 

Clontech 

ATCC 

ATCC 


SKMOOl 

SKM002 

SFBOOl 

SFB002 


42 98 156 1 63 191-192 200 261 305 311 395 
415 462 468 504 531 543 566 582-583 585 594 
680 740 853 875 927 933 935 
850 

379-380 850 
742 850 
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Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 


skin fibroblast 


ATCC 


SFB003 


87 


small intestine 


Clontech 


SIN001 


27-29 31 38 40 46 48 51 54 57-58 62 65 67 75 
77 85 97 110 112 116-117 119 131-132 134 137 
140 161 163 166 168 177 188 197-198 208213 
220 224 229 246 257 261-262 264-265 276-277 
288 295 297 299 3 1 1 316-326 328-330 337-338 
340 360-361 373 375 382 390-391 410 413 428- 
429 436 438 440 453-454 459 468 Alb-All 497 
507 51 1 522 531 536 538 542 545-546 548 552 
556 564 570-571 576 580-581 586-587 591 596 
599 605 610 613 619 625-626 643-644 651-653 
664-666 668-670 677 680 684 693 700-702 706- 
707 713-715 723-724 729-730 735 740 746 748 
753-754 757-758 764 777-778 784-786 818 822 
824 826-829 833-837 842 862-863 865-867 877- 
878 886 897 900-902 906 913 916 921 925-926 
936 939-941 


spinal cord 


Clontech 


SPC001 


18 23 33 37 42 51 67 87 92 94 97 100 140 162 
184 188 191-192 208 213 220 231 248 262268- 
271 273-274 282 287-288 290 307 3 11 316-325 
358 364 376-377 383 387 389-390 402 412 422 
444 455 476 483-484 489 504 522 534-535 556 
562 587-588 591 597 603-604 618-619 643 651 
667-670 677 693 703-704 717-718 727 746 757 
773 808 810 827 834-835 837 850 871-872 875 
904 910 931-932 939-941 


stomach 


Clontech 


STO001 


18 65 88 163 188 208 213 261 272 277 286 294 
336 373 396 412 459 514 553 602 610 647 651- 
652 671 673 734 774 790 831 833 842 850 876 


testis 


GIBCO 


ATS001 


1 3-4 14-16 28 31 45-46 66 85 90 95 97 103 1 12 
128-130 134 140 163 166 188 191-192 199-200 
213 226-228 261-265 267-271 284 302 31 1 316- 
325 327 379-380 391 413 421 428 444 454 457 
459-460 467 491 493 495 500 505 519 525 529 
532 534-535 545 552 556 566 568 575 596 599 
613 616-617 647 649 6>1 680 6o4 7UJ 7U/ /lo 
719 721 727 734 738 740 744 748 758 765-766 
774 777 782 802 810 817 827-828 834 842 846- 
847 850 862-863 871-872 878 880 892 901 916- 
917 921 


thalamus 


Clontech 


THA002 


2 87 96 103 106 189-192 208 252 258 295 308 
311 367 376-377 383-384 445 455 459-460 498 
529 587 598 602 629 654-655 705-706 715 717 
723 754 775 810 817 822 864 867 881 892 927 
930 


thymus 


Clonetech 


THM001 


3-4 8 18 28 54 57 63 65 68 84 97 100 116-1 17 
122 134 142 I5l 169 171-172 188 195 197-198 
201 213 221 237 245 261 287 31 1 316-325 360- 
361 37^-377 423 441 444 459 489 491-493 495 
498 504 507 514 527 532 534-536 539 553 556 
568 571-572 590 595-596 599 610 618 622 631- 
632 643 647 651 654-655 664 687-688 691-693 
703 715 721 733-735 748 760 762 765 781 794 
799 802 831 834 836 842 850 860-861 863 871 
878 885 896-897 903 910 923 925-926 928 939- 
941 


thymus 


Clontech 


THMc02 


2-4 17 20-22 37-38 42-43 46 63 65-68 76 88 95 
103 118 120 124 134 137 140-141 143 163 165 
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Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQIDNOS: 








171 179 182 189-194 198 200 212-215 221 226- 
228 23 1 244 257 262 266 276-277 287-288 297 
299 307 3 16-325 341 352 358 360-361 373 376- 
377 379-381 389391 394-396 403 410-411 436 
440 445 450 459 463-464 469 478 491 495 500 
507 51 1 519-521 530 532 539 542 550 555 560 
1 563 576 5Si 587 595 601 610-61 1 613-614 618 
622 625-626 631-632 638 642-644 657 664 667 
670 673 680 683 687 691-693 699 715-716 721 
740 743-744 747-754 761 763-765 771 777 780- 
781 784-787 790 794 805 81 1 820 826 83 1 834 
841 845 861 867-868 878 881 883 891 893-894 
896-897 902 904 910 912-914 91 8 923 936-941 
946-947 


thyroid gland 


Clontech 


THR001 


1-2 18-21 27 32 38 42 46 49 51 53-56 66 72 77- 
78 87-88 97 115 119 124 130-131 134 136 152 
163-165 172 183 188-192 202 212-213 216221 
224 229 235 241-243 252 257-258 261 263-265 
267 277 279 297 301 305 308 31 1 3 16-325 327 
357 363 373 376-377 381 383 389 397 400-401 
410 413-414 427*428 443-444 446-447 457 459 
463 467-469 475 482 489 495 499-500 504 509 
513 519-522 526 529 533 537-538 542 545-546 
548 556 564 567-568 582-583 589 592 599 605 
608 61 1 621 623 630 642-644 648 651-652 654- 
655 664 672-676 684-686 691-694 700 706-708 
713 717-718 721-722 725 729 731 734-735 740 
748 753-754 760 764 766 771 774 777 781 792 
797-800 802 805 826 828-829 834 842 850 861 
863 868 876 879 897 899 901 910 913 929 937 
939-941 


trachea 


Clontech 


TRC001 


20-21 38 112 161 163 188 263 267 327 413 420 
457 459-460 471 514 540-541 552 572 574 622 
639 654-655 676-677 691-692 707 725 743 748 
765 777-778 862 868 897 905 908 944 


umbilical cord 


BioChain 


FUC001 


1-2 29 32 46 67 83 87 94 134 136 140 148 160 " 
163 166 172 181 186-192 197-198208213 216 
225-231 237 252 261-265 267-270 279 282 288 
295 302 308 31 1 316-326 339-340 365 376-377 
379-380 384 392-397 421 423 428 433 440 445 
452 459 461 463-464 470 472 489 491 495 497 
500 507 517-518 522 525-526 528 534-535 540 
545-546 556-558 564 566 568 571-572 577 592 
599 601 605 610 618 623 644 651 661 668*669 
673 678 680 685-686 696 706 709 718 735-736 
748 754 769 772-777 782 792 797-799 802 807 
809 815 817 824 850 854-855 870 876 883 888 
891 897 899 901 913 921 928 930-932 


uterus 


Clontech ~ 


UTR001 


51 67 126 130 133 140 1RR-1Q7 990 *)tn ioo 
373 440 491 514 599 685-686 693 713 716-717 
735 897 905 911 939-941 


young liver 


GIBCO 


ALV001 


3-4 17 20-21 32 43 55-56 70 100 134 137 163 
172 174 179 186 188-192 200 213 216 219 221 
229 232 252 275 301 31 1 315-325 378 381 392 
441 459-460 497 499-500 514 524 526 533 539 
550 568 571 588-589 595 619 622 631-632 642 
658-659 664 677 680 693 700 707 713 719 743 
754 757-758 766 807 834 863 867 876 884 887 
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Tissue Origin 


Library/RNA 
Source 


HYSEQ Library 
Name 


SEQ ID NOS: 








904 907 



TABLE 2 



SEQ 

YT\ 
ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 


949 


AAM253 


Homo sapiens 


HYSE- Human protein sequence 
SEQIDNO:899. 


644 


99 


949 


AAY275 

01 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene No. 15. 


644 


99 


949 


gil 37856 

10 


Mus musculus 


sideroflexin 4 


396 


60 


950 


gil27691 

J. 


Homo sapiens 


Human putative ubiquitin C- 
terminal hydrolase (UHX1) 
mRNA, complete cds. 


371? 


100 


SOU 


65 


UnmA ^aniens 


ubiquitin specific protease 11, 
clone MGC:8620 
IMAGE:2961 383, mRNA, 
complete cds. 


3709 


99 


950 


gil 3 5294 
94 


Mus musculus 


Similar to ubiquitin specific 
protease 11 


3167 


83 




AAY116 
96 


Homo sapiens 


MITU LckSH3 domain- 
combining protein. 


4131 


99 


951 


AAG786 
48 


Homo sapiens 


SHAN- Human RalBPI related 
protein 82. 


3875 


99 


951 


gil36251 
66 


Homo sapiens 


RALBPI mRNA, complete cds. 


3875 


"99 


952 


AAY116 
96 


Homo sapiens 


MITU LckSH3 domain- 
combining protein. 


3953 


96 


952 


AAG786 
48 


Homo sapiens 


SHAN- Human RalBPI related 
protein 82. 


3697 


96 


952 


gil36251 
66 


Homo sapiens 


RALBPI mRNA, complete cds. 


3697 


"96 


953 


gil 04371 
91 


Homo sapiens 


cDNA: FLJ21 146 fis, clone 
CAS09305. 


2190 


94 


953 


gil28432 
22 


Mus musculus 


putative 


1672 


11 


953 


gil 57788 
93 


Homo sapiens 


Similar to hypothetical protein 
FLJ20967, clone MGC: 1 1 140 
IMAGE:3837082, mRNA, 
complete cds. 


1328 


99 


954 


gil 04371 
91 


Homo sapiens 


cDNA: FLJ21 146 fis, clone 
CAS09305. 


2359 


100 


954 


gil 28432 
22 


Mus musculus 


putative 


1643 


72 


954 


gil57788 
93 


Homo sapiens 


Similar to hypothetical protein 
FLJ20967, clone MGC: 1 1 140 
IMAGE:3837082, mRNA, 
complete cds. 


1409 


99 


955 


gil 58253 
77 


Mus musculus 


NIMA-related kinase 8 


2009 


89 


955 


gil 58253 
79 


Danio rerio 


NIMA-related kinase 8 


1439 


70 


955 


I AAO019 


Homo sapiens 


HYSE- Human polypeptide SEQ 


548 


85 
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CPA 

m 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 




74 




ID NO 15866. 






956 


AAW886 
60 


Homo sapiens 


HUMA- Secreted protein 
encoded by gene 127 clone 
HSUBW09. 


175 


97 


"956" 


AAO001 
87 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 14079. 


70 


55 


956 


gii3SI54 
29 


Suifolobus 
solfataricus 


Sugar transport reJated protein 


70 


40 


957 


AAB939 
66 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:14027. 


1133 


100 


957 


gil04338 
35 


Homo sapiens 


cDNAFLJ12377fis, clone 
MAMMA10O2524, weakly 
similar to HYPOTHETICAL 
1 17.8 KD PROTEIN IN STE2- 
FRS2 INTERGENIC REGION. 


1133 


100 


957 


AAO043 
81 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 18273. 


594 


100 


958 


AAB952 
97 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO:17525. 


885 


100 


958 


gil04349 
41 


Homo sapiens 


cDNAFLJ13087fis, clone 
NT2RP30O2099. 


885 


100 


958 


gil67405 
66 


Homo sapiens 


Similar to hypothetical protein 
FLJ13087, clone MGC:15009 
IMAGE:3536735,mRNA, 
complete cds. 


807 


95 


959 


AAY276 
76 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene No. 110. 


474 


100 


959 


gi529595 
1 


Oryza sativa 


Similar to Herpesvirus papio 
BRRF2 homolog gene, partial 
cds.(U23857) 


69 


41 


960 


AAG892 
62 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 382. 


352 


98 


960 


AAY307 
21 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted protein. 


352 


98 


960 


AAB236 
15 


Homo sapiens j 


ALPH- Human secreted protein 
SEQ ID NO: 30. 


343 


97 


961 


AAY726 
05 


Homo sapiens 


INCY- Human Electron Transfer 
Protein, ETRN-3. 


579 


100 


961 


AA0116 
27 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 25519. 


579 


100 


961 


AAG039 
41 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 8022. 


570 


98 


^62~ 


gil46034 
55 


Homo sapiens 


ubiquitin-conjugating enzyme 
E2A (RAD6 homolog), clone 
MGC:20175 IMAGE:3051041, 
mRNA, complete cds. 


599 


79 


962 


gi488377 
3 


Gallus gallus 


ubiquitin-conjugating enzyme 


599 


79 


962 


gil44852 
44 


Mus musculus 


ubiquitin-conjugating enzyme 
HR6A 


599 


79 


963 


gil46034- 
55 


Homo sapiens 


ubiquitin-conjugating enzyme 
E2A (RAD6 homolog), clone 
MGC:20175IMAGE:3051041, 
mRNA, complete cds. 


699 


90 


963 


gi488377 
3 


Gallus gallus 


ubiquitin-conjugating enzyme 


699 


90 


963 


gil44852 


Mus musculus 


ubiquitin-conjugating enzyme 


699 


90 
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SEQ 
ID 


Hit ID 


Spcicies 


Description 


S score 


Percent 
identity 




44 




HR6A j 






964 


gil68770 
66 


Homo sapiens 


clone MGC:24447 
IMAGE.4077762, mRNA, 
complete cds. 


362 


100 


964 


gi!68770 
59 


Homo sapiens 


clone MGC:24437 
IMAGE:4075637, mRNA, 
complete cds. 


362 


100 


964 


AAY949 
59 


Homo sapiens 


GEMY Human secreted protein 
clone mc300 1 protein sequence 
SEQ1DN0:124. 


204 


97 


965 


AAB929 
93 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:11723. 


2879 


97 


965 


AAG813 
64 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO:246. 


2879 


97 


965 


gil40423 
80 


Homo sapiens 


cDNA FU14690 fis, clone 
NT2RP2005270. 


2879 


97 


966 


AAB957 
69 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:18703. 


2841 


99 


966 


gil04366 
07 


Homo sapiens 


cDNAFU14207fis, clone 
NT2RP3003 1 85, weakly similar 
to TROPOMYOSIN 1, FUSION 
PROTEIN 33. 


2841 


99 


966 


gil28331 
93 


Mus musculus 


putative 


2375 


85 


967 


AAM254 
13 


Homo sapiens 


HYSE- Human protein sequence 
SEQIDNO:928. 


799 


100 


967 


AAW678 
63 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 57 clone 
HFEBF41. 


551 


98 


967 


gil35438 
11 


Mus musculus 


Unknown (protein for 
IMAGE:3591061) 


95 


33 


968 


gil 04379 
60 


Homo sapiens 


cDNA: FLJ21792 fis, clone 
HEP00441. 


5865 


99 


968 


AAE0618 
6 


Homo sapiens 


HUMA- Human gene 58 
encoded secreted protein 
fragment, SEQ IDNO:248. 


3088 


98 


968 


AAE0609 
8 


Homo sapiens 


HUMA- Human gene 58 
encoded secreted protein 
HSLCX03, SEQ ID NO: 160. 


3088 


98 


969 


gil26980 
79 


Homo sapiens 


mRNA for KIAA1767 protein, 
partial cds. 


4441 


98 


969 


AAM255 
78 


Homo sapiens 


HYSE- Human protein sequence 
SEQ ID NO: 1093. 


3898 


98 


969 


AAE0618 
6 


Homo sapiens 


HUMA- Human gene 58 
encoded secreted protein 
fragment, SEQ ID NO:248. 


3464 


98 


970 


AAY483 
59 


Homo sapiens 


META- Human prostate cancer- 
associated protein 56. 


403 


98 


970 


gil52159 
66 


Homo sapiens 


DL8Q12 gene for hypothetical 
protein, exons 1-2. 


92 


53 


970 


AAR992 
56 


Homo sapiens 


UYAR- Natural killer lytic 
associated protein. 


75 


37 


971 


gi656182 
7 


Mus musculus 


Kif21a 


5684 


76 


971 


gi656182 
9 


Mus musculus 


KifZlb 


4944 


60 


971 


gi 126979 


Homo sapiens 


mRNA for KIAA1708 protein, 


4656 


99 
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SEQ 

IT* 
1X9 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 




61 




partial cds. 






972 


AAW750 
79 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 23 clone 
HBMCT32. 


148 


100 


973 


AAY359 
21 


Homo sapiens 


GEST Extended human secreted 
protein sequence, SEQ ID NO. 

170. 


548 


99 


973 


AAM253 
86 


Homo sapiens 


HYSE- Human protein sequence 
SEQIDNO:901. 


494 


96 


m 


AAY359 
23 


Homo sapiens 


GEST Extended human secreted 
protein sequence, SEQ ID NO. 
172. 


494 


96 


974 


AAY275 
87 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene No. 2L 


448 


100 


974 


gil28025 
61 


Bovine 
herpesvirus 4 


unknown 


74 


42 


975 


AAU162 
97 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 1250. 


2420 


98 


975 


AAB944 
86 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:15170. 


1761 


74 


975 


AAM940 
18 


Homo sapiens 


HELI- Human stomach cancer 
expressed polypeptide SEQ ID 
NO 106. 


1761 


74 


976 


AAM412 
64 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 6195. 


903 


99 


976 


AAM394 
78 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2623. 


903 


99 


976 


AAB437 
71 


Homo sapiens 


HUMA- Human cancer 
associated protein sequence SEQ 
IDNO:1216. 


903 


99 


977 


gil 17616 
11 


Homo sapiens- 


kinesm-like protein RBKIN1 
(RBKIN) mRNA, complete cds, 
alternatively spliced. 


9290 


99 


977 


gil 17616 
13 


Homo sapiens 


kinesin-like protein RBKIN2 
(RBKIN) mRNA, complete cds, 
alternatively spliced. 


9055 


98 


977 


gil20540 
30 


Homo sapiens 


mRNA for KINBSIN-13A1 
(KINI3Afiene). 


8955 


97 


9/0 


gi759580 
2 


Mus musculus 


ELKL motif kinase 2 short form 


188 


48 


978 


gi759580 
0 


Musmusculus 


ELKL motif kinase 2 long form 


188 


48 ! 


CV70 


AAM939 
56 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 4158. 


187 


48 


979 


gil61984 
56 


Homo sapiens 


Similar to RIKENcDNA 
0610040E02 gene, clone 
MGC:17973 IMAGE:39 19892, 
iiu\n/\y complete cas. 


1050 


100 


979 


gil67406 
89 


Mus musculus 


RIKENcDNA 0610040E02 j 
gene 


891 


76 


979 


gil28413 
15 


Mus musculus 


putative 


891 


76 


980 


gil47149 
27 


Homo sapiens 


amino acid transporter system 
Al, clone MGC:17722 
IMAGE:3871 101, mRNA, i 
complete cds. 


2466 


100 


980 


gil 16407 


Homo sapiens 


amino acid transporter system 


2466 


100 
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SEQ 
ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 




43 




Al mRNA, complete cds. 






980 


AAB935 
56 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO:12942. 


2459 


99 


981 


AAW750 
90 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 34 clone 
HTEGA81. 


507 


100 


981 


AAW751 
52 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 34 clone 
HKMLK44. 


507 1 


100 


981 , 


AAW751 
51 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 34 clone 
HTEGA81. 


507 


100 


982 


AAB947 
54 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:15811. 


4658 


99 


982 


gi 140428 
59 


Homo sapiens 


cDNA FLJ14964 fis, clone 
PLACE4000581, moderately 
similar to FIBROPELLIN I 
PRECURSOR. 


4658 


99 


982 


gill 1771 
64 


Mus musculus 


polydom protein 


3880 


81 


983 


AAB652 
78 


Homo sapiens 


GETH Human PR01185 
(UNQ599) protein sequence 
SEQ IDNO:401. 


993 


100 


983 


AAM253 
16 


Homo sapiens 


HYSE- Human protein sequence 
SEQIDNO:83L 


993 


100 


983 


AAM238 
05 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO: 1330. 


993 


100 


984 


AAY359 
96 


Homo sapiens 


GEST Extended human secreted 
protein sequence, SEQ ID NO. 
381. 


589 


66 


984 


AAB652 
78 


Homo sapiens 


GETH Human PRO 1 185 
(UNQ599) protein sequence 
SEQIDNO:401. 


567 


65 


984 


AAM253 
16 


Homo sapiens 


HYSE- Human protein sequence 
SEQIDNO:831. 


567 


65 


985 


gil 68770 
39 


Homo sapiens 


hypothetical protein FU22688, 
clone MGC:2438 
IMAGE:2819805, mRNA, 
complete cds. 


1952 


91 


985 


gil33252 
53 


Homo sapiens 


Similar to hypothetical protein 
FLJ22688, clone MGC:4098 
IMAGE:2819805, mRNA, 
complete cds. 


1952 


91 


985 


gil04391 
77 


Homo sapiens 


cDNA: FLJ22688 fis, clone 
HSI11003. 


1695 


89 


986 


gi5 12469 


Homo sapiens 


H.sapiens HLA-DMA gene. 


1285 


92 


986 


gi218187 
6 


Homo sapiens 


Human DNA sequence from 
clone aa-^jz / on cnromobomc 
6. Contains the BRD2 gene 
encoding bromodomain- 
containing 2 protein, the HLA- 
DMA gene encoding major 
histocompatibility complex class 
II DM alpha, two CpG islands, 
ESTs, STSs and GSSs, complete 
sequence. 


1285 


92 


986 


gi 1 50303 Homo sapiens 


clone MGC:13532 


1285 


92 
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SEQ 
ID 


Hit ID 


Speicics 


Description 


S score 


Percent 
identity 




36 




IMAGE:4245221,mRNA, 
complete cds. 






987 


AAB425 
29 


Homo sapiens 


CURA- Human ORFX 
ORF2293 polypeptide sequence 
SEQIDNO:4586. 


2683 


99 


987 


" gil26978 

93 


Homo sapiens 


mRNA for KIAA 1 674 protein, 
partial cds. 


2683 


99 


987 


AAM905 
36 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQIDNO:18129. 


1512 


98 


988 


gi222453 
9 


Homo sapiens 


Human mRNA for KIAA0299 
gene, partial cds. 


9903 


99 


~988~" 


AAY165 
88 


Homo sapiens 


RHON A protein that interacts 
with presenilins. 


4733 


97 


988 


gi730171 
0 


Drosophila 
melanogaster 


CGI 1754 gene product 


3074 


43 


989 


AAW748 
87 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 160 clone 
HCELB21. 


203 


100 


989 


AAM244 
01 


Homo sapiens 


HYSE- Human EST encoded 
protein SEO ID NO: 1926. 


183 


85 


989 


gi929400 
3 


Arabidopsis 
thaliana 


cytochrome P450-like protein 


70 


39 


990 


gill 9904 
20 


Homo sapiens 


mRNA for MOP-3, complete 
cds. 


4359 


93 


990 


AAB932 
29 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:12223, 


3950 


99 


~990~ 


gi7023II 
4 


Homo sapiens 


CDNAFLJ10833 fls, clone 
NT2RP4001206, moderately 
similar to Drosophila 
melanogaster strawberry notch 
mRNA. 


3950 


99 


991 


gi433710 
5 


Homo sapiens 


MSH55 gene, partial cds; and 
CLIC1,DDAH, G6b, G6c,G5b, 
G6d,G6e, G6f, BAT5, G5b, 
CSK2B,BAT4, G4,ApoM, 
BAT3,BAT2,AIF-1,1C7,LST- 
1,LTB, TNF, and LTA genes, 
complete cds. 


668 


100 


991 


gi29969 


Homo sapiens 


Human gene for casein kinase II 
subunitbeta(EC 2.7.1.37). 


668 


100 


"991 


gi29967 


Homo sapiens 


Human mRNA for 
phosvitin/casein kinase type II 
betasubunit (EC 2.7.1.37). 


668 


100 


992 


AAY108 
40 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted protein. 


349 


I00 


993 


AAM259 
27 


Homo sapiens 


HYSE- Human protein sequence 
bhQ ID NO:l442. 


824 


100 


993 


AAY733 
25 


Homo sapiens 


INCY-HTRM clone 00H06 
protein sequence. 


820 


99 


"993 


AAG038 
70 


Homo sapiens 


GEST Human secreted protein, 
SEQIDNO:795l. 


819 


99 


994 


AAB940 
43 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:l420l. 


5698 


99 


994 


gi 104339 
76 


Homo sapiens 


cDNAFUl247lfis, clone 
NT2RM1 000894, highly similar 
to DNA-DIRECTED RNA 


5698 


99 
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POLYMERASE 1 135 KD 
POLYPEPTIDE (EC 2.7.7.6). 






994 


gil62159 
4 


Mus musculus 


second largest subunit of RNA 
polymerase I 


5095 


84 


995 


AAU158 
80 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 833. 


1091 


100 


995 


ABB0334 
5 


Homo sapiens 


HUMA- Human musculoskeletal 
system related polypeptide SEQ 
ID NO 1292. 


1091 


100 


995 


gil 38794 
42 


Mus musculus 


Similar to RIKENcDNA 
2310035M22 gene 


1056 


93 


996 


gi669260 
7 


Mus musculus 


MGA protein 


3446 


77 


996 


gi304356 
0 


Homo sapiens 


mRNA for KIAA05 18 protein, 
partial cds. 


3272 


100 


996 


AAB945 
60 


Homo sapiens 


HELI- Human protein sequence 
SEOIDNO:15332. 


1933 


99 


997 


gi577139 
2 


Homo sapiens 


RAB-like protein 2B (RABL2B) 
mRNA, complete cds. 


718 


99 


997 


gil 59288 
38 


Homo sapiens 


RAB, member of RAS 
oncogene family-like 2B, clone 
MGC:10160 IMAGE :3906749, 
mRNA, complete cds. 


718 


99 


997 


gi726396 
1 


Homo sapiens 


Human DNA sequence from 
clone RP1 1-395L14. Contains 
(part of) up to six novel genes or 
pseudogenes, the gene for a 
novel forkhead protein similar to 
FOXD4(forkhead box D4, 
FREAC5), the gene for a novel 
phosphoglucomutase like 
protein, apseudogene similar to 
part of DEAD/H (Asp-GIu- Ala- 
Asp/His) box (S.cerevisiae 
CHLl-like helicase), an 
RPL23A (60S ribosomal protein 
L23A) pseudogene, the 
RABL2A gene for RAB-like 2A, 
the gene for a novel protein 
similar to small nuclear 
ribonucleoprotein polypeptide A* 
(SNRPA1) and the 3' part of the 
gene for a novel protein similar 
to acrosin (ACR). Contains 
ESTs, STSs, GSSs and nine 
putative CpG islands, complete 
sequence. 


714 


97 


998 


gil04402 
02 


Homo sapiens 


cDNA: FLJ23495 fis, clone 
LNG02228. 


2398 


99 


998 


AAU172 
89 


Homo sapiens 


HUMA- Novel signal 
transduction pathway protein, 
Seq ID 854. 


487 


97 


998 


AAM926 
81 


Homo sapiens 


HUMA- Human digestive 
system antigen SEQ ID NO: 
2030. 


487 


97 


999 


gil26532 
49 


Homo sapiens 


Similar to CAAX box 1, clone 
MGC:8471 IMAGE:2821721, 
mRNA, complete cds. 


450 


100 
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999 


AAY322 
07 


Homo sapiens 


INCY- Human receptor 
molecule (REC) encoded by 
Incyte clone 2936050. 


429 


95 


999 


gi757623 
2 


Homo sapiens 


Human DNA sequence from 
cloneRP4-809E13on 
chromosome Xq26.1-27.1 . 
Contains the gene for a putative 
prenylated protein, two putative 
prenylated protein pseudogenes, 
ESTs, STSs, GSSs and three 
putative CpG islands, complete 
sequence. 


397 


87 


1 AAA 

1000 


gi 157785 
56 


Homo sapiens 


alpha-l-B glycoprotein 
precursor (A1BG) mRNA, 
complete cds. 


1487 


98 


1000 


gil 18773 
48 


Rattus 
norvegicus 


putative alpha IB-glycoprotein 


518 


40 


1000 


AAY646 
70 


Homo sapiens 


GEST Human 5' EST related 
polypeptide SEQ ID NO:831. 


430 


76 


I002 


AAY873 
15 


Homo sapiens 


INCY- Human signal peptide 
containing protein HSPP-92 
SEQIDNO:92. 


2817 


100 


I00l 


AAM937 
93 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3821. 


2527 


99 


lOOl 


gil28045 
27 


Homo sapiens 


hypothetical protein FLJ22405, 
clone MGC:2543 
IMAGE:2961 594, mRNA, 
complete cds. 


2194 


100 


1002 


gi416030 
4 


Mus musculus 


HS1 binding protein 3 


1449 


75 


1002 


AAB958 
01 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:18781. 


1082 


100 


1002 


gil04366 
60 


Homo sapiens 


cDNAFU14249fis, clone 
OVARC1001200, weakly 
similar to Mus musculus mRNA 
forHSl binding protein 3. 


1082 


100 


1003 


AAY873 
15 


Homo sapiens 


INCY- Human signal peptide 
containing protein HSPP-92 
SEQIDNO:92. 


1837 


100 


1003 


gi 128045 
27 


Homo sapiens 


hypothetical protein FU22405, 
clone MGC:2543 
IMAGE:2961 594, mRNA, 
complete cds. 


1837 


100 


1003 


gil04387 
80 


Homo sapiens 


cDNA: FLJ22405 fis, clone 
HRC08294. 


1837 


100 


1 (\(\A 

1UU4 


AAM937 
93 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3821. 


3401 


99 


1004 


gil73906 
94 


UllMvUlUO 


oimiiar to nypotnetical protein 
FLJ22405 


2543 


90 


1004 


AAY873 
15 


Homo sapiens 


INCY- Human signal peptide 
containing protein HSPP-92 
SEQ IDNO:92. 


2535 


100 


1005 


AAY873 
27 


Homo sapiens 


INCY- Human signal peptide 
containing protein HSPP-104 
SEQ ID NO: 104. 


584 


100 


1005 


AAY597 
05 


Homo sapiens 


GEST Secreted protein 51-41-1- 
F10-FL1. 1 


554 


95 
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1005 


AAY128 
65 


Homo sapiens 


GEST Human 5' EST secreted 
protein SEQ ID NO:455. 


208 


100 


1006 


AAY362 
37 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 14. 


177 


100 


1007 


AAY873 
10 


Homo sapiens 


INCY- Human signal peptide 
containing protein HSPP-87 
SEQ ID NO:87. 


370 


100 


1007 


AAG773 
44 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID 
NO:8ilO. 


201 


79 


1007 


gil41981 
25 


Homo sapiens 


clone MGC:18053 
IMAGE:41 48889, mRNA, 
complete cds. 


68 


61 


1008 


AAB941 
08 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 14340. 


1844 


96 


1008 


AAU045 
57 


Homo sapiens 


GETH Human Strao" homologue, 
PRO10282. 


1844 


96 


1008 


gil 35609 
66 


Homo sapiens 


STRA6 isoform 1 mRNA, 
complete cds, alternatively 
spliced. 


1844 


96 


1009 


AAY383 
94 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene No. 9. 


213 


100 


1010 


gil 04323 
82 


Homo sapiens 


Human DNA sequence from 
clone RP4-7 17123 on 
chromosome lp2 1.2-22.3 
Contains ESTs, STSs and GSSs. 
Contains part of a novel gene for 
a protein similar to Xenopus 
laevis Sojo protein, a novel gene 
and a 60S ribosomal protein L39 
(RPL39) pseudogene, complete 
sequence. 


3267 


100 


1010 


gi569043 
5 


Xenopus laevis 


nuclear protein Sojo 


1386 


44 


1010 


AAG750 
36 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID 
NO:5800. 


557 


98 


1011 


AAG005 
17 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 4598. 


160 


48 


1011 


AAO024 
74 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 16366. 


153 


45 


1011 


gi854065 


Human 
herpesvirus 6 


U88 


145 


50 


1012 


AAY195 
61 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted protein. 


514 


100 


1012 


AAB381 
57 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 39 

PPA TT"\ VT/^\ • t\C 

SEQ ID NU:96. 


70 


30 


1012 


AAU049 
58 


Homo sapiens 


GETH Human Interleukin 17 
receptor, IL-17RH4. 


69 


60 


1013 


AAR152 
22 


Homo sapiens 


TEXA Chronic myelogenous 
leukaemia-derived myeloid- 
related protein. 


635 


100 


1013 


gi32402 


Homo sapiens 


Human mRNA for HP-1, a 
member of the 
corticostatin/defensin family. 


493 


100 


1013 


gi292363 


Homo sapiens 


Human neutrophil peptide-l 


493 


100 
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gene, complete cds. 






1014 


gi633017 
6 


Homo sapiens 


mRNA for KIAA1 167 protein, 
partial cds. 


4079 


99 


1014 


gi898084 
1 


Rattus 
norvegicus 


GRIP-associated protein 1 long 
form 


3814 


92 


1014 


gil73892 
63 


Mus musculus 


Similar to GRIP-associated 
j)rotein I 


3646 


89 


1015 


gi 104430 
47 


Homo sapiens 


Human DNA sequence from 
clone RP11-465L10 on 
chromosome 20. Contains 10 
CpG islands, ESTs, STSs and 
GSSs. Contains the gene for a 
novel protein similar to 
Drosophila CGI 1399, the gene 
for a novel C2H2 type zinc 
finger protein similar to chicken 
FZF-1, a Ferritin light 
polypeptide (FTL) pseudogene, 
the MMP9 gene for matrix 
metalloproteinase 9 (gelatinase 
B, 92kD gelatinase, 92kD type 
IV collagenase) (CLG4B), a 
novel gene, the SLC12A5 gene 
for solute carrier family 12, 
(potassium-chloride transporter) 
member 5 (KIAA1 176) and the 
3' end of gene KIA A 163 7, 
complete sequence. 


6471 


99 


1015 


gi 104389 
18 


Homo sapiens 


cDNA:FLJ22504fis ) clone 
HRC11430. 


4392 


98 


1015 


gi984814 


Gallus galius 


zinc finger protein 


2127 


58 


1016 


AAE0607 
7 


Homo sapiens 


HUMA- Human gene 37 
encoded secreted protein 
HDPCJ91, SEQ ID NO:I39. 


267 


100 


1016 


AAY871 
00 


Homo sapiens 


HUMA- Human secreted protein 
sequence SEQ ID NO:139. 


267 


100 


1016 


gil27188 
12 


Yarrowia 
lipolytica 


ND3 protein 


69 


48 


1017 


AAY864 
63 


Homo sapiens 


HUMA- Human gene 47- 
encoded protein fragment, SEQ 
IDNO:378. 


361 


100 


1 m h 

1017 


AAY863 
20 


Homo sapiens 


HUMA- Human secreted protein 
HPRBC80,SEQIDNO:235. 


361 


100 


1U1 / 


gi754963 
3 


Arabidopsis 
thaliana 


hypothetical protein 


70 


31 


lOlo 


AAM253 
84 


Homo sapiens 


HYSE- Human protein sequence 
SEQIDNO:899. 


1126 


100 


1018 


AAY275 
81 


iiumv u mi/ i wild 


riuiviA- numan secreted protein 
encoded by gene No. 15. 


774 


100 


1018 


gil37856 
18 


Mus musculus 


sideroflexin 4 


660 


60 


1019 


gi452890 


Cricetulus 
migratorius 


serum amyloid P; SAP; female 
protein; FP 


158 


71 


1019 


gi387051 


Cricetulus 
longicaudatus 


FP 


157 


71 


1019 


gi347257 


Mesocricetus 
auratus 


serum amyloid P component 


157 


71 
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1020 


gil74287 
83 


Raistonia 
solanacearum 


PROBABLE NADP- 

DEPENDENT 

OXIDOREDUCTASE 

OXIDOREDUCTASE 

PROTEIN 


68 


29 


1020 


gil51592 
26 


Agrobacterium 
tumefaciens str. 
C58 (Cereon) 


AGRJL_1604p 


67 


28 


1020 


gil77425 
00 


Agrobacterium 
tumefaciens str. 
C58 (Dupont) 


succinoglycan biosynthesis 
protein 


67 


28 


1021 


gil65539 
33 


Homo sapiens 


cDNA FLJ25217 fis, clone 
REC08938, highly similar to 
Oryctolagus cuniculus 
Na+/gIucose cotransporter- 
related protein mRNA. 


1477 


100 


1021 


AAE0661 
4 


Homo sapiens 


SAGA Human protein having 
hydrophobic domain, HP03974. 


1394 


100 


1021 


gil 52098 
08 


Homo sapiens 


unnamed protein product 


1394 


100 


1022 


AAY167 
81 


Homo sapiens 


GEM Y Human secreted protein 
(clone bhl 57 7). 


1258 


100 


1022 


gil26540 
11 


Homo sapiens 


similar to rat nuclear ubiquitous 
casein kinase 2, clone 
MGC:5494 IMAGE:3452665, 
mRNA, complete cds. 


1258 


100 


1022 


gil20536 
24 


Homo sapiens 


mRNA for NUCKS protein. 


1258 


100 


1023 


AAB832 
46 


Homo sapiens 


MILL- Human FATP1 SEQ ID 
NO: 47. 


3372 


100 


1023 


AAB832 
39 


Homo sapiens 


MILL- Human FATPl SEQ ID 
NO: 38. 


3372 


100 


1023 


AAB832 
34 


Homo sapiens 


MILL- Human FATPl SEQ ID 
NO: 32. 


3372 


100 


1024 


gil 59299 
04 


Homo sapiens 


Similar to dolichyl-phosphate 
mannosyltransferase polypeptide 
2, regulatory subunit, clone 
MGC:21559 IMAGE:4181887, 
mRNA, complete cds. 


366 


100 


1024 


AAY579 
05 


Homo sapiens 


INCY- Human transmembrane 
protein HTMPN-29. 


153 


100 


1024 


gi379036 
3 


Homo sapiens 


mRNA for DPM2, complete cds. 


153 


100 


1025 


AAY257 
32 


Homo sapiens 


HUMA- Human secreted protein 
encoded from gene 22. 


212 


100 


1026 


AAG770 
16 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID 

INU. / /oU. 


291 


100 


1026 


AAS0319 
3_aal 


Homo sapiens 


GEHO Human lymphocyte cell 
surface antigen CD53 cDNA 
sequence. 


116 


95 


1026 


AAV812 
20 aal 


Homo sapiens 


GEHO Human CD53 antigen 
cDNA. 


116 


95 


1027 


AAY118 
48 


Homo sapiens 


GEST Human 5' EST secreted 
. protein SEO ID No: 448. 


193 


100 


1027 


AAY359 
56 


Homo sapiens 


GEST Extended human secreted 
protein sequence, SEQ ID NO. 


193 


100 
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205. 






1027 


AAY360 
98 


Homo sapiens 


GEST Extended human secreted 
protein sequence, SEQ ID NO. 
483. 


193 


100 


1028 


AAM939 
42 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 4126. 


1354 


99 


1028 


ABB 1142 
2 


Homo sapiens 


HYSE- Human Zn finger protein 
homologue, SEQ ID NO:1792. 


953 


92 


1028 


gi646720 
6 


Homo sapiens 


GIOT-4 mRNA for 
gonadotropin inducible 
transcription repressor-4, 
complete cds. 


883 


56 


1029 


gil 28352 
22 


Musmusculus 


putative 


1562 


82 


1029 


gi263665 
4 


Homo sapiens 


Human myosin binding protein 
H (MyBP-H) gene, complete 
cds. 


1206 


66 


1029 


gil54681 
3 


Mus musculus 


myosin binding protein H 


1203 


66 


1030 


AAY363 
32 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 109. 


268 


300 


1030 


gil00389 
17 


Buchnera sp. 
APS 


hypothetical protein 


76 


49 


1030 


gi750051 
8 t 


Unknown 


hypothetical protein F35E2.7 - 
Caenorhabditis elegans > 


63 


38 


1031 


gi971940 
9 


Homo sapiens 


candidate tumor suppressor 
protein mRNA, complete cds. 


2030 


99 


1031 


gil28565 
14 


Mus muscuius 


putative 


922 


oo 


1031 


AAU220 
41 


Homo sapiens 


HUMA- Human cardiovascular 
system antigen polypeptide SEQ 
ID No 815. 


703 




1032 


AAR152 
22 


Homo sapiens 


TEXA Chronic myelogenous 
leukaemia-derived myeloid- 
related protein. 


635 


100 


1032 


gi32402 


Homo sapiens 


Human mRNA for HP-1, a 
member of the 

corticostatin/defensin family. 


493 


100 


1032 


gi292363 


Homo sapiens 


Human neutrophil peptide-1 
gene, complete cds. 


493 


100 


1033 


gil 65525 
02 


Homo sapiens 


cDNA FLJ32395 fls, clone 
SKMUS20001I7, moderately 
similar to Homo sapiens 
MAGEF1 mRNA. 


1599 


100 


1033 


gi 126591 
42 


Musmusculus 


mage-gl 


1178 


76 


1033 


gil28571 
IP. 


Mus musculus 


putative 


1178 


76 


1034 


AAB496 
50 


Homo sapiens 


CURA- Human SEC2 protein 
sequence SEQ ID 4. 


2615 


100 


1034 


gil22265 
32 


Homo sapiens 


unnamed protein product 


2615 


100 


1034 


gil47348 
86 


Musmusculus 


Unknown (protein for 
IMAGE:3498778) 


2343 


89 


1035 


AAM237 
21 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO: 1246. 


2889 


100 


1035 


AAM792 


Homo sapiens 


HYSE- Human protein SEQ ID 


2676 


100 
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29 




NO 1 891. 






1035 


AAB883 
70 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone 
PSEC0106. 


1611 


100 


1036 


AAY596 
57 


Homo sapiens 


GEST Secreted protein 108-003- 
5-0-A8-FL. 


689 


100 


1036 


gil44956 
99 


Homo sapiens 


clone MGC:15961 
IMAGE:3538818, mRNA, 
complete cds. 


689 


100 


1036 


gil44245 
22 


Homo sapiens 


clone MGC:14327 
IMAGE:4298098, mRNA, 
complete cds. 


689 


100 


1037 


AAY276 
26 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene No. 60. 


352 


100 


1038 


AAD183 
56 aal 


Homo sapiens 


INCY- Human iipid metabolism 
enzyme-5 (LME-5) cDNA. 


1748 


100 


1038 


AAB735 
60 


Homo sapiens 


MILL- Human lipase 1 8892. 


1748 


100 


1038 


AAE1099 
6 


Homo sapiens 


INCY- Human lipid metabolism 
enzyme-5 (LME-5) protein. 


1748 


100 


1039 


AAG034 
75 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 7556. 


448 


100 


1039 


AAY128 
61 


Homo sapiens 


GEST Human 5' EST secreted 
protein SEQ ID NO:451. 


448 


100 


1039 


gi56760 


Rattus 
norvegicus 


neuronal nohacetlycholine 
binding subunit 


75 


23 


1040 


AAY530 
49 


Homo sapiens 


GEMY Human secreted protein 
clone cj378 3 protein sequence 
SEQIDNO:104. 


463 


100 


1040 


gil36036 
74 


Steililabium 
pogonostalix 


maturase 


78 


38 


1040 


gil36036 
76 


Telipogon 
parvulus 


maturase 


74 


36 


1041 


AAY413 
54 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 47 clone 
HUFCJ30. 


288 


100 


1041 


gi 1 52304 
14 


Arabidopsis 
thatiana 


putative protein 


63 


43 


1042 


AAW747 
77 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 48 clone 
HFCA174. 


245 


100 


1042 


gil54889 
20 


Homo sapiens 


Similar to RIKEN cDNA 
2010107G23 gene, clone 
MGC:9596 LMAGE:3896656, 
mRNA, complete cds. 


245 


100 


1042 


gil28424 
65 


Musmusculus 


putative 


241 


97 


1043 


gi 152780 
28 


Homo sapiens 


beta-gaIactose-3 -0- 
sulfotransferase, 4, clone 
MGC:15045 IMAGE:3636329, 
mRNA, complete cds. 


ZDOo 


i nn 


1043 


gil47945 
24 


Homo sapiens 


Galbetal-3GalNAc 3 - 
sulfotransferase mRNA, 
complete cds. 


2564 


99 


1043 


AAB938 
92 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:13832. 


2556 


99 


1044 


gi!53215 


Homo sapiens 


empty spiracles-like protein 


1341 


100 
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S score 


Percent 
identity 


1044 


gil32767 
73 


Homo sapiens 


(EMX2) mRNA, complete cds. 
mRNA; cDNA 

DKFZp761M1614 (from clone 
DKFZp761M1614). 


1341 


100 


1044 


" gi 165496 
86 


Homo sapiens 


cDNA FU30479 fis, clone 
BRAWH1000168, highly similar 
to Homeotic protein emx2. 


1336 


99 


l04T 


gil68770 
66 


Homo sapiens 


clone MGC:24447 
IMAGE:4077762,mRNA, 
complete cds. 


362 


100 

1 \J\J 


1045 


gil 68770 
59 


Homo sapiens 


clone MGC:24437 
IMAGE:4075637,mRNA, 
complete cds. 


362 


100 


1045 


" AAY949 
59 


Homo sapiens 


GEMY Human secreted protein 
clone rac300 1 protein sequence 
SEQIDNO:l24. 


204 


97 


1046 


gi239445 
1 


Caenorhabditis 
elegans 


Hypothetical protein ZC178.2 


406 


30 


1046 


AAB875 
75 


Homo sapiens 


GETH Human PR01342. 


384 


JO 


1046 


AAY994 
08 


Homo sapiens 


GETH Human PR01342 
(UNQ697) amino acid sequence 
SEQIDNO:243. 


384 


38 


1047 


gil20531 
47 


Homo sapiens 


mRNA; cDNA 

DKFZp434F1726 (from clone 
DKFZp434F1726). 


3484 


98 


1047 


ABB1173 
9 


Homo sapiens 


HYSE- Human IF-gamma 
receptor homologue, SEQ ID 
NO:2109. 


1044 


100 


1047 


AAR049 
32 


Homo sapiens 


YEDA Interferon-gamma 
receptor segment from clone 39 
responsiblefor binding the target 


829 


98 


1048 


AAG739 
89 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID 
NO:4753. 


957 


inn 


1048 


AAB589 
98 


Homo sapiens 


HUMA- Breast and ovarian 
cancer associated antigen protein 
sequence SEQ ID 706. 


957 


100 


l04F 


AAM891 
00 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ IDNO:16693. 


362 


67 


1049 


gi 100473 
33 


Homo sapiens 


mRNA for K1AA1628 protein, 
partial cds. 


5204 


100 


1049 


gil 18629 
39 


Mus museums 


DDM36 


4378 


89 


1049 


gil 18629 
41 


Mus museums 


DDM36E 


4366 


88 


1050 


pi 190647 


Homo sapiens 


Human pregnancy-specific beta- 
1 glycoprotein (PSG) mRNA, 
complete cds. 


611 


72 


1050 


gi984306 


Homo sapiens 


Human pregnancy-specific 
glycoprotein 13 (PSG13') 
mRNA, complete cds. 


606 


71 


1050 
1051 , 


gil90568 
kAM436 


Homo sapiens 
Homo sapiens 


Human pregnancy-specific beta- 
l-glycoproteinll(PSGll) 
tnRNA, complete cds. 
HUMA- Human polypeptide 


585 
588 


64 
100 
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SEQ 
ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 




55 




SEQ ID NO 333. 






1051 


AAM435 
88 


Homo sapiens 


HUMA- Human polypeptide 
SEQ ID NO 266. 


588 


100 


1051 


AAW600 
43 


Homo sapiens 


HUTC- Human MHC class I 
chain-related gene A (MICA) 
polypetide. 


588 


100 


1052 


gi487783 
6 


Rattus 
norvegicus 


TRP2 


524 


74 


1052 


gill 0956 
41 


Mus musculus 


transient receptor potential 
channel 2-beta 


521 


73 


1052 


gil 10956 
39 


Mus musculus 


transient receptor potential 
channel 2-alpha 


521 


73 


1053 


AAB941 
88 


Homo sapiens 


HEL1- Human protein sequence 
SEQ IDNO:14511. 


1209 


97 


1053 


gil04342 
24 


Homo sapiens 


cDNAFLJ12623fis, clone 
NT2RM4001746. 


1209 


97 


1053 


gil26527 
97 


Homo sapiens 


clone MGC:5179 
IMAGE:2900118,mRNA a 
complete cds. 


1066 


88 


1054 


AAY383 
89 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene No. 4. 


152 


90 


1054 


gil 36246 
35 


Euglena viridis 


maturase-Iike protein 


63 


42 


1055 


AAY275 
82 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene No. 16. 


320 


100 


1055 


gi!34216 
31 


Caulobacter 
crescentus 


conserved hypothetical protein 


65 


27 


1055 


gH61247 
13 


Caulobacter 
crescentus] > 
[Caulobacter 
crescentus 


conserved hypothetical protein 


65 


27 


1056 


AAO087 
59 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 22651. 


613 


100 


1056 


gi 104375 
69 


Homo sapiens 


cDNA: FLJ21463 fis, clone 
COL04765. 


269 


57 


1056 


AAY453 
82 


Homo sapiens 


HUMA- Human secreted protein 
fragment encoded from gene 28. 


266 


58 


1057 


AAE0517 
5 


Homo sapiens 


INCY- Human drug 
metabolising enzyme (DME-6) 
protein. 


1830 


99 


1057 


AAU122 
25 


Homo sapiens 


GETH Human PRO4404 
polypeptide sequence. 


1830 


99 


1057 


AAU183 
63 


Homo sapiens 


HUMA- Human endocrine 
polypeptide SEQ ID No 318. 


1092 


95 


1058 


AAG812 
74 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO:66. 


815 


96 


1058 


gi 140358 
56 


Homo sapiens 


unnamed protein product 


815 


96 


1058 


AAG812 
73 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO:64. 


652 


99 


1059 


AAY359 
80 


Homo sapiens 


GEST Extended human secreted 
protein sequence, SEQ ID NO. 
229, 


581 


97 


1059 


ABB1196 
0 


Homo sapiens 


HYSE- Human neuroendocrine- 
specific protein- like homologue, 
SEQ IDNO:2330. 


246 


100 


1059 


AAZ3831 


Homo sapiens 


PROT- Human transmembrane 


240 


97 
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SEQ 
ID 


Hit ID 

9 aal 


Speides 


Description 


S score 


Percent 
identity 


1060 


gi795932 
5 


Homo sapiens 


protein cDNA clone HP02061. 
mRNA for KIAA1 529 protein, 
partial cds. 


8481 


100 


1060 


gil28363 
54 


Mus musculus 


putative 


511 


63 


1060 


AAW036 
26 


Homo sapiens 


UYNY Human thyrotropin GPR 
N-terminal sequence. 


736 


31 


1061 


AAY762 
00 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 77. 


262 


100 


1061 


gil59243 
50 


Staphylococcus 
aureus subsp. 
aureus Mu50 


oxacillin resistance-related FmtC 
protein 


64 


31 


1061 


gil24836 
31 


Staphylococcus 
aureus 


FmtC 


64 


31 


1062 


AAY362 
70 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 47. 


359 


100 


1062 


gi499607 
9 


Human 
herpesvirus 6 


64% identical to U95 gene of 
strain Ul 102 of HHV-6-MCMV 
IE2 homolog, US22 gene familv 


68 


37 


1062 


gi573357 
6 


Human 

herpesvirus 6B 


U95 


66 


37 


1063 


gi526274 
8 


Rattus 
norvegicus 


Proline rich synapse associated 
protein 2 


3930 


93 


1063 


gi738I05 
6 


Rattus 
norvegicus 


Shank postsynaptic density 
protein 3a 


3895 


92 


1063 


gil33591 
73 


Homo sapiens 


mRNA for KIAA1650 protein, 
partial cds. 


3085 


100 


1064 


gil43367 
49 


Homo sapiens 


16pl3.3 sequence section 6 of 8. 


974 


99 


1064 


gil04400 
21 


Homo sapiens 


cDNA:FU23360fis, clone 
HEP 15172. 


974 


99 


1064 


AAB943 
88 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:14511. j 


914 


78 


1065 


AAB941 

88 | 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:145tl. 


1227 


100 


1065 


gil04342 
24 


Homo sapiens 


cDNAFU12623fis, clone 
NT2RM4001746. 


1227 


100 


1065 


gil26527 
97 


Homo sapiens 


done MGC:51 79 
lMAGE:2900118,mRNA, 
complete cds. 


1084 


90 


1066 


AAY824 
88 


Homo sapiens 


NISC- Human L-type amino 
acid transporter 1 protein 
sequence SEQ ID NO:2. 


2438 


94 


lOoT" 


gi592673 
2 


Homo sapiens 


mRNA for L-type amino acid 
transporter 1, complete cds. 


2438 


94 


1066 


gi442664 
0 


Homo sapiens 


L-type amino acid transporter 
subunit LAT1 mRNA, complete 
cds. 


2438 


94 


1067 


AAG813 
26 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO:170. 


1135 


100 


1067 


gil40359 
60 ! 


Homo sapiens 


unnamed protein product 


1135 


100 


1067 


AAY788 
05 


Homo sapiens 


PROT- Hydrophobic domain 
containing protein clone 
HP 10508 protein sequence. 


1053 


99 


1068 


gil20529 
83 


Homo sapiens 


mRNA; cDNA DKFZp434I1610 
(from clone DKFZp434I1610); 


2502 


63 
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ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 








complete cds. 






1068 


AAM797 
60 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3406. 


2270 


61 


1068 


AAB943 
88 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:14947. 


2050 


57 


1069 


g!126550 
91 


Homo sapiens 


AD-003 protein, clone 
MGC:783 IMAGE:3050940, 
mRNA, complete cds. 


546 


53 


1069 


gi652379 
9 


Homo sapiens 


adrenal gland protein AD-003 
mRNA, complete cds. 


536 


53 


1069 


AAM518 
24 


Homo sapiens 


BIOW- Human transcription 
regulator 13. 


370 


61 


1070 


gi 143280 
09 


Homo sapiens 


clone IMAGE:3942111, 
mRNA, partial cds. 


2392 


100 


1070 


gil45858 
69 


Homo sapiens 


hypothetical protein SB146 


2389 


99 


1070 


gil60417 
67 


Homo sapiens 


Similar to NADPH oxidase- 
related, C2 domain-containing 
protein, clone MGC:23187 
IMAGE:485 1 468, mRNA, 
complete cds. 


2384 


99 


1071 


ABB1224 
5 


Homo sapiens 


HYSE- Human CKSR-2 
homologue, SEQ IDNO:2615. 


256 


50 


1071 


AAZ8823 
9_aal 


Homo sapiens 


INCY- Human cytokine signal 
regulator CKSR-2 encoding 
cDNASEQ IDNO:4. 


245 


51 


1071 


AAB676 
67 


Homo sapiens 


INCY- Amino acid sequence of 
human cytokine signal regulator 
2. 


245 


51 


1072 


gil54189 
97 


Homo sapiens 


capillary morphogenesis protein- 
1 mRNA, complete cds. 


3015 


100 


1072 


AAB955 
05 


Homo sapiens 


HELI- Human protein sequence 
SEQ1DNO:18062. 


2163 


99 


1072 


gi 104357 
18 


Homo sapiens 


cDNA FLJ13645 fis, clone 
PLACE1011310, weakly similar 
to MYOSIN HEAVY CHAIN, 
GIZZARD SMOOTH 
MUSCLE. 


2163 


99 


1073 


gi 159850 
82 


Homo sapiens 


unnamed protein product 


3158 


99 


1073 


AAU049 
58 


Homo sapiens 


GETH Human Interleukin 17 
receptor, IL-17RH4. 


3148 


97 


1073 


gi645355 
2 


Homo sapiens 


mRNA; cDNA 

DKFZp434N1928 (from clone 
DKFZp434N1928). 


3007 


100 


1074 


AAB938 
27 


Homo sapiens 


HELI- Human protein sequence 
SEOIDNO:13641. 


884 


99 


1074 


A A DOI C 

AAB236 
06 


Homo sapiens 


AT DT-J Unman CA/»i*^tf»rl nrntptn 

/vL,i*xi- n urn an ocurcicu piuicm 

SEOIDNO:12. 


884 


99 


1074 


gi 104331 
26 


Homo sapiens 


cDNA FLJ11790 fis, clone 
HEMBA1006091. 


884 


99 


1075 


gi 157779 
31 


Homo sapiens 


DGCRK3 gene for G-protein 
beta subunit like protein, 
complete cds. 


1731 


99 


1075 


gil50823 
09 


Homo sapiens 


clone MGC: 19898 
IMAGE:4548339,mRNA, 
complete cds. 


1731 


99 
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ID 
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Speicies 
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S score 
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identity 


1075 


gil33591 
65 


Homo sapiens 


mRNA for KIAA1645 protein, 
jjartial cds. 


1731 


99 


1076 


gil66059 
05 


Homo sapiens 


unnamed protein product 


100 


32 


1076 


gil 16111 
88 


Homo sapiens 


Human DMA sequence from 

Hnne> PP/1 < 5 8/^5? nn 

cione x\x*t-ooovjo on 
chromosome 20ql 1.2-12. 
Contains the gene for a novel 
protein similar 10 noosomai 
protein S2(RPS2), a gene 
ciituuiiig a pruicin similar 10 
basic protease inhibitor 
chelonianin, a novel gene, the 3' 
end of a novel gene, ESTs, 
o i o5, vjoos ana a v*pu isiana, 
complete sequence. 


94 


32 

• 


1076 


AAY359 
35 


Homo ^anipfK 


\jEio x ErXicnucu numan secreieo 
protein sequence, SEQ ID NO. 
184. 


oi 


29 


1077 


gi 165542 
29 


Homo ^am'pnc 


cTCNJA FT F9^4?A fic rlnnp 
oi^iN/i rj-jij'fjo us, cione 

TST08261. 


1 104 


1 AA 

100 


1077 


gil50824 
26 


Homo sapiens 


Similar to RIKENcDNA 

t-oiuujjrn gene, cionc 

MGC:20203 IMAGE :46846 87, 

mRNA r omnlpfp cd<i 


1156 


99 


1077 


gil28581 
55 


Musmusculus 


putative 


1054 


89 


1078 


gil55592 
90 


Homo sapiens 


cione MGC:20275 
IMAGE:3842589, mRNA, 
complete cds. 


1917 


100 


1078 


gil56255 
64 


Homo sapiens 


WD40-andFYVE-domain 
ooniaining protein i {vfurZ) 
mRNA, complete cds. 


1893 


99 


1078 


gi 165542 
04 


Homo ^aniens 


eDNA FT fie r\nnt> 

TST03665. 


1 "200 


1 AA 


1079 


AAB927 
75 


Homo sapiens 


Hh 1 1 - Human nrnfpin cAnuonpp 

SEQIDNO:11256. 


nil 


no 


1079 


gil40421 
69 


Homo sapiens 


cDNA FIJI 4564 fis, cione 
NT2RM4000229 wealrlv similar 
to Gallus gallus actin filament- 
associated protein (AFAP-1 10) 
mRNA. 


3212 


99 


1079 


gil56208 
87 


Homo sapiens 


mRNA for KIAA1914 protein, 
partial cds. 


1702 


100 


1080 


AAG891 
72 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 292. 


591 


100 


1080 


AAY125 
32 


Homo sapiens 


GEST Human 5' EST secreted 
protein SEQ ID NO: 1 97 from 
WO 9906553. 


512 


91 


1080 


AAB871 
73 


Homo sapiens 


MILL- Human secreted protein 
TANGO 402 S22T variant, SEQ 
IDNO:215. 


119 


44 


1081 


gi996380 
4 


Homo sapiens 


zinc finger protein ZNF286 
(ZNF286) mRNA, complete cds. 


574 


94 


1081 


gil40179 
65 


Homo sapiens 


mRNA for KIAA1874 protein, 
partial cds. 


517 


91 


1081 


AAU162 
38 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 1191. 


362 


100 
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ia entity 


1082 


AAG038 
10 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 7891. 


841 


99 


1082 


gi 186800 


Homo sapiens 


Human nbosomal protein LI 2 
mRNA, complete cds. 


541 


OQ 

yy 


1082 


gi!41983 
33 


Homo sapiens 


nbosomal protein LI 2, clone 
MGC:9760 IMAGE:3855674, 
mRNA, complete cds. 


OA 1 

841 




1083 


AAY129 
02 


Homo sapiens 


GEST Human 5' EST secreted 
protein SEQ IDNO:492. 


134 


100 


1084 


gi 186774 


Homo sapiens 


Human Kruppel related zinc 
finger protein (HTF10) mRNA, 
complete cds. 


30oz 


5/ 


1084 


gi273935 
3 


Homo sapiens 


DNA from chromosome 19, 
BAC 33152, complete sequence. 


2985 


55 


1084 


AAM797 
39 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3385. 


2612 


53 


1085 


AAV481 
25_aal 


Homo sapiens 


HUMA- Nucleotide sequence 
encoding clone HMWGS46 of 
Prohibitin receptor family. 


1354 


93 


1085 


ABB1191 

3 


Homo sapiens 


HYSE- Human B-cell receptor 
associated protein homologue, 
SEQ ID NO:2283. 


1354 


93 


1085 


AAY944 
43 


Homo sapiens 


UNII Human repressor of 
estrogen repressor activity 
(REA) protein. 


1354 


93 


1086 


AAG723 
70 


Homo sapiens 


YEDA Human OR-like 
polypeptide query sequence, 
SEQ ID NO: 2051. 


333 


100 


1086 


AAG714 
53 


Homo sapiens 


YEDA Human olfactory 
receptor polypeptide, SEQ ID 
NO: 1134. 


333 


100 


1086 


AAE0455 
6 


Homo sapiens 


INCY- Human G-protein 
coupled receptor- 12 (GCREC- 
12) protein. 


315 


100 


1087 


AAG813 
23 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO:164. 


792 


100 


1087 


gi 140359 
54 


Homo sapiens 


unnamed protein product 






1087 


gil28387 
99 


Mus musculus 


putative 


504 


/o 


1088 


AAE0969 
1 


Homo sapiens 


HUMA- Human gene 2 
encoding novel protein 
HCOKA10, SEQ IDNO:38. 


96 


55 


1088 


AAG761 
25 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID 
NO:6889. 


96 


55 


1088 


AAU169 

A A 


Homo sapiens 


HUMA- Human novel secreted 

nratp in QPO IF> 1 RS 
proLcm, oJDv^ iu ioj. 


yo 


55 


1089 


AAY130 
37 


Homo sapiens 


GEST Human secreted protein 
encoded by 5' EST SEQ ID NO: 
51. 


187 


100 


1089 


AAY363 
95 


Homo sapiens 


HUMA- Fragment of human 
secreted protein encoded by 
gene 5. 


79 


39 


1089 


gil30968 
04 


Mus musculus 


Unknown (protein for 
IMAGE:3586067) 


77 


40 


1090 


gi 15 1478 


Mus musculus 


Spred-2 


2098 


92 
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ID 
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Speicies 


Description 


S score 


Percent 
identity 




77 










1090 


gil51478 
75 


Mus musculus 


Spred-1 


1101 


52 


1090 


AAU174 
15 


Homo sapiens 


HUMA- Novel signal 
transduction pathway protein, 
SeqID980. 


1029 


98 


1091 


gil65517 
62 


Homo sapiens 


cDNAFU31812fis, clone 
NT2R12009406, moderately 
similar to Homo sapiens rec 
mRNA. 


804 


97 


1091 


" gi668287 
3 


Homo sapiens 


rec mRNA, complete cds. 


453 


55 


1091 


gi723061 
2 


Rattus 
norvegicus 


small rec 


451 


56 


1092 


gi!22248 
85 


Homo sapiens 


mRNA; cDNA DKFZp761I101 1 
(from clone DKFZp761I1011). 


241 


94 


1092 


gi388056 
0 


Caenorhabditis 
elegans 


Similarity to Yeast El -E2 
ATPase (SW:YED1_YEAST), 
contains similarity to Pfam 
domain: PF00122(ENE2 
ATPase), Score= 102.4, E- 
value=2.7e-28, N=4~cDNA EST 
yk5f9.5 comes from this 
gene-cDNA EST ykl0dl2.5 
comes from this gene-cDNA 
EST yk5f9.3 comes from this 
gene-cDNA ESTykl0dl2.3 
comes from this gene-cDNA 
EST yk40hl 1 .5 comes from this 
gene~cDNAESTykl31gll.3 
comes from this gene-cDNA 
ESTykl31gll.5 comes from 
this gene-cDNA EST ykl 33d6.5 
comes from this gene-cDNA 
EST yk318f23 comes from this 
gene-cDNA EST yk3 1 8f2.5 
comes from this gene-cDNA 
ESTykl22aI2.5 comes from 
this gene-cDNA EST yk248h!5 
comes from this gene-cDNA 
EST yk260b8.5 comes from this 
gene-cDNA EST yk286h5.5 
comes from this gene-cDNA 
EST yk356gl .5 comes from this 
gene 


135 


50 


1092 


gil65166 
58 


Homo sapiens 


ORF for hypothetical protein. 


129 


52 


1093 


AAB409 
96 


Homo sapiens 


CURA- Human ORFX ORF760 
polypeptide sequence SEQ ID 
NO:1520. 


349 


40 


1093 


gil65166 
58 


Homo sapiens 


ORF for hypothetical protein. 


349 


40 


1093 


gi 104369 
63 


Homo sapiens 


cDNA:FLJ20986fis, clone 
CAE01156. 


349 


40 


1094" 


gi!69042 
08 


Homo sapiens 


very large G protein-coupled 
receptor lb (VLGR1) mRNA, 
complete cds. 


1014 | 


96 




gi 169042 


Mus musculus 


very large G protein-coupled 


808 


78 
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10 




receptor 1 






1094 


gil20444 
71 


Homo sapiens 


mRNA; cDNA 

DKFZp761P0710 (from clone 
DKFZp7o 1P07 1 0); complete 
cds. 


173 


27 


1095 


gi 124839 
02 


Rattus 
norvegicus 


zmc finger protein HIT-IO 


1545 


4/ 


1095 


AAB958 
62 


Homo sapiens 


HELI- Human protein sequence 
SEQIDN0:l8929. 


1234 


<A 


1095 


gil04367 
89 


Homo sapiens 


cDNA FU 14345 fis a clone 
THYRO 1 00 1 1 89, weakly 
similar to ZINC FINGER 
PROTEIN 91. 


1 234 


<A 


1096 


AAB509 
63 


Homo sapiens 


GETH Human PR01286 
protein. 


466 


i AA 


1096 


AAU124 
21 


Homo sapiens 


GETH Human PRO 1 286 
polypeptide sequence. 


4oo 


1 AA 

IUU 


1096 


AAU091 
79 


Homo sapiens 


GETH Human PR01268 
polypeptide. 


ACC 

4oo 


1 AA 


1097 


AAE1202 
3 


Homo sapiens 


INCY- Human G-protein 
coupled receptor, GCREC-2. 


2849 


no 

yo 


1097 


AAG681 
26 


Homo sapiens 


FARB Human 7TM-GPCR 
protein sequence SEQ ID NO:6. 


2824 


y& 


1097 


gil75125 
39 


Mus museums 


Unknown (protein for 
MGC:29266) 


21 83 


/3 


1098 


AAB941 
08 


Homo sapiens 


HELI- Human protein sequence 
SEO IDNO:14340. 


273 


1 AA 
IUU 


1098 


AAU045 
58 


Homo sapiens 


GETH Human Stra6 homologue, 
PR019578. 


273 


1 AA 
IUU 


1098 


AAU045 
57 


Homo sapiens 


GETH Human Stra6 homologue, 
PRO10282. 


273 


1 AA 
IUU 


1099 


AAU123 
82 


Homo sapiens 


GETH Human PR0792 
polypeptide sequence. 


137 


32 


1099 


AAB244 
16 


Homo sapiens 


GETH Human PR0792 protein 
sequence SEO ID N0:155. 


137 


32 


1099 


AAB240 
55 


Homo sapiens 


GETH Human PR0792 protein 
sequence SEQ IDNO:31. 


137 


32 


1100 


gi633042 
2 


Homo sapiens 


mRNA for KIAA1202 protein, 
partial cds. 


AClt 1 

4yi3 


yy 


1100 


gi 123 140 
62 


Homo sapiens 


Human DNA sequence from 
clone RP1 1-1 19E20 on 
chromosome Xp 11.21-11.23 
Contains part of the gene for 
KIAA1202 protein, ESTs, STSs 
and GSSs, complete sequence. 


4oyo 


yy 


1100 


gi 154212 
01 


Homo sapiens 


SHAP-A (SHAP) mRNA, partial 
cds, alternatively spliced. 


3845 


99 




07 


J1U111U oapivwa 


cadherin 20 f CDH20) mRNA 
complete cds. 


4170 


99 


1101 


gi410175 
1 


Mus musculus 


cadherin 7 precursor 


4032 


96 


1101 


gi854635 


Xenopus laevis 


F-cadherin 


3251 


78 


1102 


AAY363 
10 


Homo sapiens > 


HUMA- Human secreted protein 
encoded by gene 87. 


250 


100 


1103 


gil65514 

23 


Homo sapiens 


cDNAFLJ31547fis, clone 
NT2RI2001010, weakly similar 
to FATTY ACYL-COA 


853 


98 
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Percent 
identity 








HYDROLASE PRECURSOR, 
MEDIUM CHAIN (EC 
3.1.2.14). 






1103 


gi264198 
6 


Mesocricetus 
auratus 


carboxylesterase precursor 


438 


50 


1 1 M 

1103 


AAD006 
60_aai 


Homo sapiens 


INCY- Human Hydrolase 
protein-5 (HYDRL-5) encoding 
cDNA. 


428 


52 


1104 


AAE0483 
6 


Homo sapiens 


SUGE- Human SGP018 
phosphatase polypeptide. 


4915 


95 


1104 


gil27188 
35 


Homo sapiens 


unknown mRNA. 


3350 


99 


1104 


AAB409 
19 


Homo sapiens 


CURA- Human ORFX ORF683 
polypeptide sequence SEQ ID 
NO.-1366. 


718 


79 


3105 


gil34926 
50 


Rattus 
norvegicus 


potassium channel beta subunit 
KChIP4 


1284 


99 


1105 


gil 15273 
18 


Mus musculus 


calsenilin-like protein 


1281 


99 


1105 


gil40913 
36 


Homo sapiens 


KCHIP4.1 (KCN1P4) mRNA, 
complete cds, alternatively 
spliced. 


1278 


99 


1106 


AAY131 
26 


Homo sapiens 


GEST Human secreted protein 
encoded by 5' EST SEQ ID NO: 
140. 


160 


96 


1107 


gi412659 
3 


Cyprinus carpto 


complement C3-S 


156 


29 


1107 


gi412658 
9 


Cyprinus carpio 


complement C3-H1 


148 


26 


1107 


gi305335 


Cavia porcellus 


complement C3 protein (GPC3) 
precursor 


146 


50 


1108 


gil 12448 
73 


Homo sapiens 


PR-domain-containing protein 
16 (PRDM16) mRNA, complete 
cds. 


6646 


99 


1108 


gil26978 
95 


Homo sapiens 


mRNA for KIAA1675 protein, 
partial cds. 


3570 


99 


1108 


gi545408 


human, leukemic 
eel! line SKH1, 
mRNA Mutant, 
5938 nt]. [Homo 
sapiens 


AML 1 -EVI- 1 -AML1 -EVI- 1 
fusion protein {rearranged 
translocation} 


3181 


53 


i inn 
1109 


gil07328 
15 


Homo sapiens 


concentrative Na+-nucIeoside 
cotransporterhCNT3 (CNT3) 
mRNA, complete cds. 


3609 


100 


1109 


gil07328 
17 


Mus musculus 


concentrative Na+-nucIeoside 
cotransporter mCNT3 


2872 


78 


1109 


gil54893 
79 


Mus musculus 


solute carrier family 28 (sodium- 
cuupiea nucleoside transporter), 
member 3 


2859 


77 


1110 


gil78651 
50 


Plasmodium 
berghei 


cysteine repeat modular protein 
3PbCRM3 


97 


30 


1110 


gi665071 
1 


Giardia 
intestinaiis 


variant-specific surface protein 
VSP1267-2 


96 


29 


1110 


gi861294 


Caenorhabditis 
elegans \ 


F35D2.4 gene product 


94 


31 


1111 


gi795917 
7 1 


Homo sapiens 

- .. -J 


mRNA for KIAA1458 protein, 
partial cds. 


2993 


100 
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1111 


AAB947 
91 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:15907. 


1037 


100 


1111 


AAU011 
01 


Homo sapiens 


HUMA- Gene 38 Human 
secreted protein homologous 
amino acid sequence. 


842 


99 


1112 


gi 173907 
60 


Mus musculus 


RIKENcDNA 2610205H19 
gene 


664 


99 


1112 


gil28482 
92 


Mvs musculus 


putative 


664 


99 


1112 


gi203072 


Rattus sp. 


0-44 protein 


661 


98 


1113 


gil42503 
19 


Homo sapiens 


clone IMAGE:3448367, 
mRNA, partial cds. 


2143 


93 


1113 


gi 145827 
73 


Homo sapiens 


sumo/sentrin-specific protease 


2138 


93 


1113 


gil70260 
32 


Macaca 
fascicularis 


hypothetical protein 


2068 


89 


1114 


AAB937 
77 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:13502. 


1064 


99 


1114 


AAM413 
87 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 6318. 


1064 


99 


1114 


AAM396 
01 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2746. 


1064 


99 


1115 


gi 152772 
40 


Homo sapiens ' 


genomic DNA, chromosome 
6p2L3, HLA Class I region, 
section 17/20. 


2256 


100 


1115 


gill 8754 
05 


Homo sapiens 


HZFwl protein mRNA, 
complete cds. 


2251 


99 


1115 


gill 8754 
07 


Homo sapiens 


HZFw2 protein mRNA, 
complete cds. 


1733 


99 


1116 


AAB957 
26 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:18602. 


1644 


99 


1116 


AAB951 ! 
09 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:17089. 


1644 


99 


1116 


gi 140420 
78 


Homo sapiens 


cDNA FU14510fis, clone 
NT2RM1 000623, weakly similar 
to RIBONUCLEASE 
INHIBITOR. 


1644 


99 


1117 


gil40093 
46 


Homo sapiens 


nGAP-like protein (AF9q34) 
mRNA, complete cds. 


5475 


98 


1117 


gil52775 
25 


Rattus 
norvegicus 


DOC2/DAB2 interactive protein 


5006 


96 


1117 


gil26980 
31 


Homo sapiens 


mRNA for KIAA1743 protein, 
partial cds. 


3024 


98 


1118 


AAB652 
11 


Homo sapiens 


GETH Human PR01152 
(UNQ582) protein sequence 
SEQ1DN0:216. 


1937 


99 


1118 


AAB688 
83 


Homo sapiens 


INCY- Human RECAP 
polypeptide, SEQ ID NO: 13. 


1937 


99 


1118 


AAU281 
83 


Homo sapiens 


HYSE- Novel human secretory 
protein, Seq ID No 352. 


1937 


99 


1119 


AAG021 
97 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6278. 


522 


99 


1119 


AAU172 
24 


Homo sapiens 


HUMA- Novel signal 
transduction pathway protein, 
Seq ID 789. 


485 


100 


1119 


AAU175 
97 


Homo sapiens 


HUMA- Novel signal 
transduction pathway protein, 


449 


100 
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Hit ID 
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Percent 
identity 








SeqID1162. 






1120 


" gil79636 


Homo sapiens 


cytoplasmic phosphotyrosyl 
protein phosphatase (clone type 
1) complete cds. 


717 


88 


lion 


mil 4TC1 

glI14/ol 


Homo sapiens 


Human red cell-type low 
molecular weight acid 
phosphatase (ACPI) gene, exon 
6 and 7, complete cds. 


717 


88 


1120 


gi575913 
1 


Rattus 
norvegicus 


low molecular weight protein 
tyrosine phosphatase isoform A 


647 


76 


1191 


gl /07U46 

6 


Mus muscuius 


unnamed protein product 


255 


61 


1121 


gi341381 

, u 


Musrausculus 


Bassoon 


105 


25 


1121 


gi50715 


Musrausculus 


myosin heavy chain 


103 


26 


1122 


gi 104403 
35 


Homo sapiens 


cDNA:FU23594fis, clone 
LNG14867. 


1513 


100 


1122 


gi 146028 

on 

89 


Homo sapiens 


clone MGC:13119 
IMAGE:41 00726, mRNA, 
complete cds. 


702 


53 


1 1 n 
1122 


gi 142498 
27 


Homo sapiens 


clone MGC: 10992 
IMAGE:3637387,mRNA, 
complete cds. 


702 


53 


1123 


ABB1120 


Homo sapiens 


HYSE- Human Kupffer cell 
receptor homologue, SEQ ID 
NO:1570. 


1838 


99 


1123 


gi 154890 
oo 


Mus muscuius 


Kupffer cell c-type lectin 
receptor 


1000 


45 


1123 


gil66936 
0 


Mus muscuius 


Kupffer cell receptor 


1000 


45 




A A VJftO 

AAY308 
47 


Homo sapiens 


HUMA- Human secreted protein 
encoded from gene 37. 


239 


100 


1124 


gil45960 
27 


Arabidopsis 
thaliana 


Unknown protein 


70 


45 


J 1/4 


gi975895 
7 


Arabidopsis 
thaliana 


contains similarity to unknown 
protein~gb|AAF64546.1~gene i 
d:MRB17.15 


70 


45 


1 1 o« 


AAY013 
90 


Homo sapiens 


HUMA- Secreted protein 
encoded by gene 8 clone 
HTXDJ88. 


301 


100 


1125 


gil56399 
42 


Treponema 
pallidum] > 
[Treponema 
pallidum 


dicarboxylate transporter (dctM) 


63 


38 


1196 


AAUOol 

89 


Homo sapiens 


GENO- Cytosolic thyroid 
hormone-binding protein SEQ 
ID NO; 105. 


152 


78 


1126 


gi35505 


Homo sapiens 


H <;aDien«! M a^np far Ml .tvnp 

x A.oapit^iia ivi gene iui ivi l-iypc 

and M2-type pyruvate kinase. 




no 

78 


1126 


gi338827 


Homo sapiens 


Human TCB gene encoding 
cytosolic thyroid hormone- 
binding protein, complete cds. 


152 


78 


1127 


gi426168 
9 


Homo sapiens 


complement factor B mRNA, 
complete cds. 


3976 


97 


1127 


gi297569 


Homo sapiens 


Hsapiens mRNA for 
complement factor B. 


3976 


97 


1127 


AAX041 


Homo sapiens 


UNIW Human Factor B 


3972 


97 
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SEQ 
10 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 




25 aal 




encoding cDNA. 






1128 


AAB500 
12 


Homo sapiens 


PHAA Wild-type human alpha7 
ligand gated ion channel. 


1794 


100 


1128 


AAB826 
90 


Homo sapiens 


TEWE- Nicotinic acetylcholine 
receptor alpha7. 


1794 


1 (\f\ 


1128 


AAB240 
88 


Homo sapiens 


GETH Human PR02145 protein 
sequence SEQ ID NO:77. 


1794 


100 


1129 


gi537329 


Homo sapiens 


Human (clone pHAIV2-12) 
alpha-2 collagen type IV 
(COL4A2) mRNA, 3' end. 


3756 


99 


1129 


gil02232 
3 


Mus musculus 


collagen alpha-2(IV) chain 


3480 


87 


1129 


gi556299 


Mus musculus 


alpha-2 type IV collagen 


3477 


87 


1130 


gil50114 
89 


Tetrahymena 
thermophila 


heme maturase 


68 


29 ! 


1131 


AAB509 
64 


Homo sapiens 


GETH Human PR01313 
protein. 


926 


100 


1131 


AAB472 
90 


Homo sapiens 


GETHPR01313 polypeptide. 


926 


100 


1131 


AAB244 
31 


Homo sapiens 


GETH Human PR01313 protein 
sequence SEQ IDNO:216. 


926 


loo ! 


1132 


gil28553 
07 


Mus musculus 


putative 


2919 


89 ! 


1132 


gi 155595 
25 


Homo sapiens 


Similar to RIKENcDNA 
493241 6D09 gene, clone 
IMAGE:4578228 S mRNA, 
partial cds. 


2523 


99 


1132 


AAY540 
52 


Homo sapiens 


PHAA An angiogenesis- 
associated protein which binds 
plasminogen. 


1435 


62 


1133 


AAY130 
84 


Homo sapiens 


GEST Human secreted protein 
encoded by 5' EST SEQ ID NO: 
98. 


127 


62 


1133 


gi 104404 
68 


Homo sapiens 


mRNA for FLJ00070 protein, 
partial cds. 


75 


41 


1133 


gi455864 
0 


Homo sapiens 


chromosome 19,cosmid 
R27516 ? complete sequence. 


74 


44 


1134 


AAM802 
75 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3921. 


1510 


99 


1134 


AAM792 
91 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1953. 


1500 


99 


1134 


gi 168774 
49 


Homo sapiens 


hypothetical protein 
MGC20781, clone MGC:21 670 
IMAGE.-3885455, mRNA, 
complete cds. 


1367 


100 


1135 


gi771086 
9 


Homo sapiens 


Human DNA sequence from 
clone RP1 1-3 1M2 on 
chromosome 9p23-24.3. 
Contains (part of) the gene for a 
novel protein similar to the GLI 
family of zinc finger proteins, 
STSs, GSSs and two putative 
CpG islands, complete sequence. 


1629 


100 


1135 


AAM254 
73 


Homo sapiens 


HYSE- Human protein sequence 
SEQ IDNO:988. 


861 


98 


1135 


gil44860 
69 


Drosophila 
melanogaster 


Zn finger transcription factor 
lame duck 


699 


63 
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identifv 


1136 


AAR055 
62 


Homo sapiens 


DANA- Laminin -binding 
protein encoded by insert from 
J9 lambda gtlO phage. 


1327 


94 


1136 


gi307105 


Homo sapiens 


Human colin carcinoma laminin- 
binding protein mRNA, 
complete cds. 


1327 


94 


1136 


gi 163076 
02 


Homo sapiens 


laminin receptor 1 (67kD, 
ribosomal protein SA), clone 
MGC:17122 IMAGE:3446816, 
mRNA, complete cds. 


1327 


94 


1137 


" AAB438 
84 


Homo sapiens 


HUMA- Human cancer 
associated protein sequence SEQ 
IDNO:1329. 


734 


Oft 


1137 


gil79281 


Homo sapiens 


ATP synthase beta subunit 
precursor (AIPSB) gene, 
complete cds. 


734 


98 


1137 


gil67413 
73 


Homo sapiens 


Similar to ATP synthase, H+ 
transporting, mitochondrial Fl 
complex, beta polypeptide, clone 
MGC:5231 IMAGE:2900336, 
mRNA, complete cds. 


734 


98 


1138 


AAG014 
68 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 5549. 


7XO 
4.0 ~ 


Vo 


1138 


AAG014 
67 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 5548. 


ivU 


1UU 


1138 


AAB438 
84 


Homo sapiens 


HUMA- Human cancer 
associated protein sequence SEQ 
IDNO:1329. 


206 


100 


1139 


gi 179279 


Homo sapiens 


Human ATP synthase beta 
subunit gene, exons 8-10. 


757 


69 


1139 


gil28456 
67 


Mas musculus 


putative 


744 


68 


1139 


gi28940 


Homo sapiens 


Human mRNA for Fl-ATPase 
beta subunit (F-l beta). 


742 


69 


1140 


AAB438 
84 


Homo sapiens 


HUMA- Human cancer 
associated protein sequence SEQ 
ID NO: 1329. 


1124 


RO 
oy 


1140 


gil79281 


Homo sapiens 


ATP synthase beta subunit 
precursor (ATPSB) gene, 
complete cds. 


1124 


89 


1140 


gi 167413 
73 


Homo sapiens 


Similar to ATP synthase, H+ 
transporting, mitochondrial Fl 
complex, beta polypeptide, clone 
MGC:5231 IMAGE:2900336, 
mRNA, complete cds. 


1124 


89 


1141 


AAW540 
79 


Homo sapiens 


TEXA Homo sapiens BARD1 
sequence. 


4101 


100 


1141 


ai!7ini7 
51 w JUi / 

5 


Homo sapiens 


Human BRCA1 -associated 
RING domain protein (BARD1) 
mRNA, complete cds. 


4101 


100 


1141 ' 


AAW540 
81 


Homo sapiens \ 


TEXA Homo sapiens BARD3 
P553 sequence. 


4097 


99 


1142 


AAW540 
89 


Homo sapiens 


TEXA Homo sapiens BARD1 
MR658C sequence. 


394 : 


100 


1142 


AAW540 
88 


Homo sapiens 


TEXA Homo sapiens BARD1 
MS761N sequence. 


394 


100 


1142 


AAW540 


flomo sapiens 


TEXA Homo sapiens BARD1 


394 


100 
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87 




MQ564H sequence. 






1143 


gil74322 
41 


Homo sapiens 


MSTP027 (MST027) mRNA, 
complete cds. 


730 


100 


1143 


gil60416 
88 


Homo sapiens 


hypothetical protein FLJ21661, 
clone MGC16816 
IMAGE:3922036, mRNA, 
complete cds. 


730 


100 


1143 


giI40398 
31 


Homo sapiens 


elongation factor G2 (EFG2) 
mRNA, complete cds; nuclear 
gene for mitochondrial product. 


730 


100 


1144 


AAM407 
29 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 5660. 


271 


98 


1144 


AAM389 
43 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2088. 


271 


98 


1144 


AAY123 
25 


Homo sapiens 


GEST Human 5 1 EST secreted 
protein SEQ ID NO:356. 


271 


98 


1145 


gil 73903 
20 


Homo sapiens 


clone MGC:9678 
IMAGE:3846678, mRNA, 
complete cds. 


872 


100 


1145 


gil 20020 
02 


Homo sapiens 


clone 022f05 My030 protein 
mRNA, complete cds. 


872 


100 


1145 


AAY360 
68 


Homo sapiens 


GEST Extended human secreted 
protein sequence, SEQ ID NO. 
453. 


844 


97 


1146 


gil 73903 
20 


Homo sapiens 


clone MGC:9678 
IMAGE:3846678, mRNA, 
complete cds. 


504 


95 


1146 


gil20020 
02 


Homo sapiens 


clone 022fi05 My030 protein 
mRNA, complete cds. 


504 


95 


1146 


AAY360 
68 


Homo sapiens 


GEST Extended human secreted 
protein sequence, SEQ ID NO. 
453. 


495 


94 


1147 


AAB616 
17 


Homo sapiens 


PROT- Human protein 
HP10688. 


1488 


100 


1147 


gil25784 
71 


Homo sapiens 


unnamed protein product 


1488 


100 


1147 


AAY027 
81 


Homo sapiens 


HUMA- Human secreted 
protein. 


1146 


77 


1148 


gil 65400 
2 


Homo sapiens 


Rsapiens mRNA for Sop2p-like 
protein. 


572 


99 


1148 


gil28050 
63 


Mus muscuius 


actin related protein 2/3 
complex, subunit 1 A (41 kDa) 


567 


97 


1148 


gil 26672 
58 


Rattus 
norvegicus 


suppressor of profilin/p41 of 
actin-related complex 2/3 


567 


97 


1149 


AAB952 
58 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO:17435. 


3560 


100 


1149 


gil 04347 
40 


Homo sapiens 


cDNA FLJ12957 fis, clone 
NlzlvrzUUDOji, weakly sirauar 
to PROTEIN 4.1. 


3560 


100 


1149 


gil 00471 
61 


Homo sapiens 


mRNA for KIAA1548 protein, 
partial cds. 


2487 


100 


1150 


AAR998 
44 


Homo sapiens 


SUME Human natural killer cell, 
cell surface mol. NKG7. 


552 


71 


1150 


AAQ863 
84 aal 


Homo sapiens 


ASAN/ G-CSF stimulated 
human myelocytic cell cDNA. 


307 


65 


1150 


AAQ985 
51 aal 


Homo sapiens 


ASAN/ Granulocyte colony 
stimulating factor (G-CSF)- 


307 


65 
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lUvMiiiy 








inducible myelocyte gene. 






1151 


AAM237 
81 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO: 1306. 


620 




1151 


AAM238 
37 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO: 1362. 


481 




1151 


AAY647 


Homo sapiens 


GEST Human 5' EST related 
polypeptide SEQ 1DNO:908. 


471 


98 


1152 


AAB652 
77 


Homo sapiens 


GETH Human PR01187 
(UNQ601) protein sequence 
SEQIDNO:399. 


656 


100 


1152 


AAY667 
54 


Homo sapiens 


GETH Membrane-bound protein 
PR01187. 


656 


100 


1152 


AAB240 
77 


Homo sapiens 


GETH Human PROl 1 87 protein 
sequence SEQ ID NO:55. 


656 


100 


1153 


gi833175 
7 


Homo sapiens 


X28 region near ALD locus 
containing dual specificity 
phosphatase 9 (DUSP9), 
ribosomal protein LI 8a 
(RPL18a),Ca2+/Calmodulin- 
dependent protein kinase I 
(CAMKI), creatine transporter 
jckik/, lum protein (CDM), 
adrenoleukodystrophy protein 
\i\Liy), piexin-reiatea protein 
(PLXB3), muscle-specific serine 
Mnasc ^Mooiv;, iNAU-isocitrate 
dehydrogenase (IDH), 

trarKlr>nrtn-fK<iriri;tfpi^ nrnt^In 

Ufuuiv/I/V/Il uoouoiaLCU UIUIGIJI 

delta (TRAP), and LU1 protein 
(LU1) genes, complete cds; and 
CCd Dseudopene mmnlptp 
sequence. 


1747 


92 


1153 


AAW884 
36 


Homo sapiens 


INCY- Disease associated 
protein kinase DAPK-5. 


1667 


88 


1153 


gi313539 
7 


Rattus 
norvegicus 


Ca2+/calmoduIin*dependent 
_protein kinase I beta 2 


1661 


87 


1154 


AAG675 
48 


Homo sapiens 


LEXI- Amino acid ^emipnrp nf a 
human transporter protein. 




00 


1154 


gi!65886 
84 


Homo sapiens 


anion tranSDOrter/exchanper-R 
(SLC26A8)mRNA, complete 
cds. 


4H 


02 


1154 


gil53415 
54 


Homo sapiens 


putative anion transporter 
(SLC26A8) mRNA, complete 
cds. 


4n 
tij 


oa i 


1155 


AAB649 
53 


Homo sapiens 


ROSE/ Human secreted protein 
sequence encoded by gene 12 
SEQIDNO:131. 


995 


98 


1155 


gi!28543 
24 | 


Mus musculus 


pulative 


781 




1155 


gil52170 

75 ■ 


Homo sapiens 


glioma pathogenesis-related 
protein (RTVP1) mRNA, 
complete cds. 


443 


40 


1156 


AAB262 
56 


Homo sapiens 


UNLO Wild-type human Racl 
protein. 


508 


100 


1156 


gi857403 
8 


Homo sapiens 


racl gene. 


508 


100 


1156 


gi232620 
6 


Homo sapiens 


Rac3 (RAC3) mRNA, complete 
cds. 


508 


100 
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ID 
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Speicies 
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S score 


Percent 
identity 


1157 


gil04398 
53 


Homo sapiens 


cDNA: FLJ23235 fis, clone 
CAS04980. 


1536 


1 f\n 
1UU 


1157 


AAW855 
96 


Homo sapiens 


MERI Human GABA-A 
receptor theta subunit. 


73 


35 


1157 


gil59782 
47 


Yersinia pestis 


putative membrane protein 


73 


27 


1158 


gil04398 
53 


Homo sapiens 


cDNA: FLJ23235 fis, clone 
CAS04980. 


1347 


90 


1158 


AAW855 
96 


Homo sapiens 


MERI Human GABA-A 
receptor theta subunit. 


73 


35 


1158 


gil59782 
47 


Yersinia pestis 


putative membrane protein 


73 


27 


1159 


gill9331 
49 


Homo sapiens 


mRNA for 6-phosphofructo-2- 
kinase heart isoform, complete 
cds. 


2452 


100 


1159 


gi309041 
9 


Homo sapiens 


pfkfb2 gene, exons 1 to 15. 


2329 


99 


1159 


gi309042 
1 


Homo sapiens 


mRNA for 6-phosphofructo-2- 
kinase. 


2319 


98 


1160 


gi617778 
5 


Homo sapiens 


mRNA for HKR1, partial cds. 


3083 


99 


1160 


gil33254 
27 


Homo sapiens 


clone IMAGE:3928207, 
mRNA, partial cds. 


2388 


99 


1160 


gi487783 


Homo sapiens 


Human zinc finger protein 
ZNF133. 


1592 


54 


1161 


gil45858 
59 


Homo sapiens 


hypothetical protein SB 138 


1558 


98 


1161 


AAB946 
41 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 15526. 


1543 


100 


1161 


AAG644 
03 


Homo sapiens 


SHAN- Human paneth cell 
enhanced expression-like 
protein. 


1543 


100 


1162 


AAY536 
41 


Homo sapiens 


CHIR A bone marrow secreted 
protein designated BMS42. 


2182 


99 


1162 


gi966315 
3 


Homo sapiens 


partial mRNA for transport- 
secretion protein 2.2, (TTS-2.2 
gene). 


2179 


98 


1162 


gi966315 
1 


Homo sapiens 


partial mRNA for transport- 
secretion protein 2. 1 (TTS-2. 1 
gene). 


2179 


98 


1163 


AAM933 
60 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 2919. 


3300 


100 


1163 


gil 68782 
06 


Homo sapiens 


hypothetical protein 
DKFZp434J037, clone 
MGC:29812 IMAGE:5088037, 
mRNA, complete cds. 


3300 


100 


1163 


gil20532 
81 


Homo sapiens 


mRNA; cDNA DKFZp434J037 
(trom clone Uisx£p*io*i-juo 
complete cds. 


3300 


100 


1164 


AAG812 
82 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO:82. 


3032 


100 


1164 


AAU171 
02 


Homo sapiens 


HUMA- Novel signal 
transduction pathway protein, 
Seq ID 667. 


3032 


100 | 


1164 


gil75299 
89 


Homo sapiens 


oxysterol-binding protein-like 
protein OSBPL9 (OSBPL9) 
mRNA, complete cds. 


3032 


100 
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SEQ 
ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 


1165 


gil4627l 
21 


Homo sapiens 


Human DNA sequence from 
clone RP5-824F16 on 
chromosome 20 Contains the 5' 
end of the ANGPT4 gene for 
angiopoietin 4, part of the gene 
for a novel protein similar to 
mouse thrombospondin type 1 
domain protein R-spondin, 
ESTs, STSs, GSSs and a CpG 
island, complete sequence. 


230 


•U«MUIJ 

64 


1165 


gil66053 
78 


Musmusculus 


unnamed protein product 


226 


42 


1165 


gi 128506 
80 


Musmusculus 


putative 


226 


42 


1166 


AAF8417 
1 aal 


Homo sapiens 


CHUG- Human OATP-B coding 
sequence. 


3573 


97 


1166 


AAZ9240 
3 aal 


Homo sapiens 


SCHE cDNA encoding human 
DC-PGT. 


3573 


97 


1166 


AAC618 
83 aal 


Homo sapiens 


CHIR cDNA encoding a human 
secreted protein. 


3573 


97 


1167 


gi30224 


Homo sapiens 


Rsapiens CRP mRNA for C- 
rcactive protein. 


327 


100 


1167 


gi30213 


Homo sapiens 


Rsapiens mRNA for C-reactive 
protein. 


327 


100 


1167 


gil81068 


Homo sapiens 


Human C-reactive protein gene, 
complete cds. 


327 


100 


1168 


AAH761 
94_aal 


Homo sapiens 


MILL- Human seven- 
transmembrane protein 3 1 945 
coding sequence. 


3429 


00 
yy 


1168 


AAB857 
67 


Homo sapiens 


MILL- Human seven- 
transmembrane protein 31945 
sequence. 


3429 


99 


1168 


gil65519 
33 


Homo sapiens 


cDNAFU31951fis, clone 
NT2RP7007177, weakly similar 
to Homo sapiens multiple 
membrane spanning receptor 
TRC8 mRNA. 


3429 


99 


1169 


AAB602 
99 


Homo sapiens 


MILL- Human aminopeptidase 
17867. 


5048 


99 


U69 


AAE0487 
9 


Homo sapiens 


INCY- Human protease protein- 
6(PRTS-6). 


5048 


99 


U69 


gill 0659 

Art 


Homo sapiens 


aminopeptidase mRNA, 
complete cds. 


5048 


99 


H70 


gi!28441 
36 


Musmusculus 


putative 


700 


52 


WO 


AAA999 
05 aal 


Homo sapiens 


GETH cDNA encoding human 
protein PR0846. 


662 


94 


1170 


AAB653 
00 




vein, rluman PRO 846 protein 
sequence SEQ ID NO:517. 


662 


94 


H7I 


gil26539 
43 


Homo sapiens 


clone MGC:2742 
IMAGE^822914,mRNA, 
complete cds. 


3104 


100 


1171 


AAG012 
36 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 5317. 


522 


94 


1171 


gil30438 
7 


Saccharomyces 
cerevisiae var. 
diastaticus 


glucoamylase 


196 


22 
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ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 


1172 


gil34304 
08 


Homo sapiens 


BTBD2 protein mRNA, 
complete cds. 


2705 


99 


1172 


gil 70260 
62 


Mus musculus 


glucose signal repressing protein 


1946 


77 


1172 


gil34304 
06 


Homo sapiens 


BTBD1 protein mRNA, 
complete cds. 


1937 


76 


1173 


gil 72261 
21 


^Homo sapiens 


F-box protein (FBG4) mRNA, 
complete cds. 


1503 


100 


1173 


gi!65539 
18 


Homo sapiens 


cDNA FLJ25205 fis, clone 
REC05844, highly similar to 
Mus musculus F-box protein 
FBX17 mRNA. 


1503 


100 


1173 


gil52145 
27 


Homo sapiens 


Similar to f-box only protein 17, 
clone MGC:9379 
IMAGE:3864760,mRNA, 
complete cds. 


1503 


100 


1174 


AAB883 
73 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone 
PSEC0109. 


2158 


100 


1174 


AAB932 
15 1 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 121 94. 


2158 


100 


1174 


AAB931 
42 


Homo sapiens 


HELI- Human protein sequence 
SEO ID NO: 12045. 


2158 


100 


1175 


AAB883 
73 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone 
PSEC0109. 


3646 


95 


1175 


AAB932 
15 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:12194. 


3646 


95 


1175 


gil 40425 
71 


Homo sapiens 


cDNA FLJ 14791 fis, clone 
NT2RP4001064, weakly similar 
to SYNAPTONEMAL 
COMPLEX PROTEIN SC65. 


3646 


95 


1176 


AAB363 
92 


Homo sapiens 


CHUG- Human tumour 
suppressor Grosl-S protein SEQ 
IDNO:4. 


3861 


99 


1176 


gil 11276 
38 


Homo sapiens 


GROS1-L protein mRNA, 
complete cds. 


3861 


99 


1176 


AAB883 
73 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone 
PSEC0109. 


3847 


99 


1177 


gii04385 
39 


Homo sapiens 


cDNA: FLJ22233 Fis, clone 
HRC02016. 


2015 


100 


1177 


AAE0489 
2 


Homo sapiens 


INCY- Human transporter and 
ion channel-5 (TRICH-5) 
protein. 


2009 


99 


1177 


gil 39256 
61 


Mus musculus 


sodium/calcium exchanger 
protein 


1708 


84 


1178 


AAB651 
92 


Homo sapiens 


GETH Human PR0839 
(UNQ472) protein sequence 
SEQIDN0:167. 


366 


100 


1178 


AAG814 
32 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ IDNO:382. 


366 


100 


1178 


AAY666 
69 


Homo sapiens 


GETH Membrane-bound protein 
PR0839. 


366 


100 


1179 


gi599683 


Bos taurus 


Cleavage and Polyadenylation 
specificity factor (CPSF) lOOkD 
subunit 


4034 


98 
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SEQ 

m 


Hit ID 


Speicies 


Description 


S score 


Percent 
_ identity 


1179 


gil54890 
17 


Mus musculus 


cleavage and polyadenylation 
specific factor 2, lOOkD subunit 


3993 


97 


1179 


gi233103 
6 


Mus musculus 


cleavage and polyadenylation 
specificity factor 


3993 


97 


1180 


AAZ3464 
8 aal 


Homo sapiens 


ZYMO Human growth factor 
zalpha5 cDNA. 


2182 


91 


1180 


AAZ4585 
2_aal 


Homo sapiens 


COMP- Human liver 
angiopoietin-like growth factor 
DNA sequence. 


2182 


91 


1180 


AAA497 
16 aal 


Homo sapiens 


GETH Human PR0179 cDNA 
clone DNA16451-1078. 


2182 


91 


1181 


AAH231 
83_aal 


Homo sapiens 


ISIS- Human macrophage 
migration inhibitory factor 
encoding DNA. 


564 


94 


1181 


AAB603 
25 


Homo sapiens 


KIRI Human wild-type 
glycosylation-inhibiting fector 
(GIF). 


564 


94 


1181 


AAB853 
43 


Homo sapiens 


ISIS- Human macrophage 
migration inhibitory factor. 


564 


94 


1182 


gi726393 
8 


Homo sapiens 


mRNA for sodium-glucose 
cotransporter (SGLT2 gene). 


3408 


100 


1182 


gi567946 
4 


Homo sapiens 


Human DNA sequence from 
clone RP1-90G24 on 
chromosome 22 Contains the 
RFPL2 gene for RET finger 
protein-like 2, a 
Immunoglobulin Lambda Light 
Chain C region (IGLC) 
pseudogene, the gene for 
SAAT1 (low affinity sodium 
glucosecotransporter 
(sodiumisolute symporter 
family)) and a Cleavage and 
Polyadenylation Specific Factor 
CPSF 160 kD subunit 
pseudogene. Contains ESTs, 
GSSs and three putative CpG 
islands, complete sequence. 


3408 


100 


1182 


AAY312 
23 


Homo sapiens 


KOEP/ Human SAAT1 protein. 


3390 


99 


1183 


gil36233 
01 


Homo sapiens 


Similar to Zink transporter 2, 
clone MGC: 11303 
IMAGE:3948165,mRNA, 
complete cds. 


1659 


100 


1183 


gil25637 
8 


Rattus 
norvegicus 


zinc transporter ZnT-2 


1186 


67 


1183 


gil76337 
4 


Mus musculus 


ZnT-3 


721 


44 


110/1 

1 184 


AAB9ZO 
36 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:10951. 


1480 


81 


1184 


gi702218 
5 


Homo sapiens 


cDNAFLJ10260fis, clone 
HEMBB1 000973, moderately 
similar to Mus musculus 
schlafen3 mRNA. ' 


1480 


81 


1184 


AAM403 
57 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 3502. 


1479 


81 


1185 


AAB953 
51 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:17641. 


3148 


99 
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ID 
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Speicies 


Description 


S score 
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identity 


1185 


gi 1 04350 
75 


Homo sapiens 


cDNAFLJ13170 fis, clone 
NT2RP3003809, weakly similar 
to SAV PROTEIN. 


3148 


99 


1185 


gi 126543 
21 


Homo sapiens 


clone MGC:5347 
IMAGE:2985725, mRNA, 
complete cds. 


3106 ! 


99 


1186 


gi 105675 
90 


Homo sapiens 


sodium bicarbonate 
cotransporter-like protein 
mRNA, partial cds. 


5645 


100 


1186 


gi 134477 
47 


Homo sapiens 


sodium bicarbonate 
cotransporter NBC4a (NBC4) 
mRNA, complete cds. 


5486 


99 


1186 


gi 1 541 95 
75 


Homo sapiens 


sodium bicarbonate 
cotransporter NBC4c (NBC4) 
mRNA, complete cds, 
alternatively spliced. 


5382 


98 


1187 


gil01858 
24 


Homo sapiens 


SEBOX(SEBOX) gene, 
complete cds. 


1209 


100 


1187 


gi 100921 
60 


Mus muscuius 


SEBOX 


581 


63 


1187 


gil01858 
26 


Rattus 
norvegicus 


SEBOX 


565 


63 


1188 


AAM435 
40 


Homo sapiens 


HUMA- Human polypeptide 
SEQ ID NO 218. 


246 


94 


1188 


AAM404 
68 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 5399. 


246 


94 


1188 


AAM386 
82 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 1827. 


246 


94 


1189 


gil04381 
35 


Homo sapiens 


cDNA: FLJ21924 fis, clone 
HEP04086. 


3703 


99 


1189 


AAM680 
38 


Homo sapiens 


MOLE- Human bone manow 
expressed probe encoded protein 
SEO ID NO: 28344. 


700 


100 


1189 


AAM556 
56 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded 
protein SEO ID NO: 27761. 


700 


100 


1190 


gil43885 
55 


Macaca 
fascicularis 


hypothetical protein 


1240 


96 


1190 


AAG932 
58 


Homo sapiens 


NISC- Human protein HP10582. 


979 


100 


1190 


gi!28548 
23 


Mus muscuius 


putative 


882 


72 


1191 


AAB945 
45 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 15296. 


2344 


99 


1191 


gil04352 
28 


Homo sapiens 


cDNA FLJ 13273 fis, clone 
OVARC1001010. 


2344 


99 


1191 


gi 128604 
75 


Mus muscuius 


putative 


1811 


83 


1192 


gi 104404 
42 


Homo sapiens 


mRNA for FLJ00057 protein, 
partial cds. 


3033 


99 


1192 


gil20823 
03 


Mus muscuius 


DNA helicase B 


1697 


61 


1192 


gi263521 
1 


Bacillus subtilis 


similar to conjugation transfer 
protein 


114 


22 


1193 


AAE1044 
5 


Homo sapiens 


BIOJ Human transmembrane 
j>rotein (TMP). 


2286 


100 


1193 


AAY862 


Homo sapiens 


HUMA- Human secreted protein 


1889 


85 
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ID 


Hit ID 

30 


opeicies 


Description 


S score 


Percent 
identity 


1193 


AAW680 
02 


Homo sapiens 


nJv.rl5L53, SEQ ID NO: 145. 
HUMA- Fragment of human 
secreted protein encoded by 
gene 69. 


798 


96 


1194 


ei 1 75117 
gj i / j 1 1 1 

29 


nomo Sapiens 


hypothetical protein FUI2598, 
clone MGC:3 1807 

complete cds. 


1180 


100 


H94 


AAB941 
62 


nomo sapiens 


TJT7T ITT a • 

HbLI- Human protein sequence 
SEQ ID NO: 14456. 


1373 


99 


~U94~ 


gil04341 
83 


Homo sapiens 


cDNAFU12598fis, clone 
NT2RM4001384. 


1173 


99 


1195 


gi724306 
9 

7 


Homo sapiens 


mRNA for KIAA1344 protein, 
partial cds. 


4137 


100 


1195 


' AAB940 
73 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 14262. 


1839 


99 




25 


Homo sapiens 


cDNAFU12501 fis, clone 
NT2RM2001681. 


1839 


99 


1196 


AAB088 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 4 
SEQIDNO:51. 


235 


67 


no* 

1 17\} 




Vesicular 
stomatitis virus 


glycoprotein 


71 


30 


1196 


gi296009 
3 


Mycobacterium 

tuberculosis 

H37Rv 


hypothetica] protein Rv3669 


71 


30 


1197 

I 17 1 


32 


Homo sapiens 


HUMA- Human reproductive 
system related antigen SEQ ID 
NO: 2970. 


1211 


98 


1197 

117/ 


6 


nbKV-H/env62 


envelope protein 


763 


36 


1 197 


gi*fVjyjo 
2 


Homo sapiens 


human endogenous retrovirus 
HERV-H19pol protein (poi) 
gene, partial cds; env protein 
(env) gene, complete cds; and 3' 
LTR, complete sequence. 


757 


36 


1198 


oil 401 11 
81 


Homo sapiens 


tumor endothelial marker 8 
precursor (TEM8) mRNA, 
complete cds. 


1512 


100 


1198 


gil04379 
39 


Homo sapiens 


cDNA: FLJ21776fis, clone 
rlbrUUl/l. 


1512 


100 


1198 


gil59875 
05 


Mus muscuius 


tumor endothelial marker 8 
precursor 


1484 


97 


1199 


AAB652 
70 


Homo sapiens 


GETH Human PR01158 
IUNQ588) protein sequence 
SEQIDNO:375. 


609 


100 


1199 


AAB875 
59 


Homo sapiens 


GETH Human PRO 1 1 58. 


609 


100 


1199 


AAY667 
47 


Homo sapiens 


GETH Membrane-bound protein 
PR01158. 


609 


300 


1200 


AAM413 
80 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 6311. 


844 


87 


1200 


AAY017 
85 


Homo sapiens 


INCY- Human ubiquitin- 
conjugating en2yme HUBI-1. 


818 


87 


1200 
1201 


AAY253 
41 

AAW748 ] 


iomo sapiens 
tfomo sapiens ] 


PROS- Human NCE-2 protein. 
1UMA- Human secreted protein 


818 
197 


57 
100 
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HUD) 


Speicies 
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S score 
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99 




encoded by gene 172 clone 
HODCW06. 






1201 


gil37944 
93 


Guillardia theta 


hypothetical protein 


67 


36 


1202 


AAS0794 
0_aal 


Homo sapiens 


AREN- Human cDNA encoding 
G-protein coupled receptor, 
hRUP13. 


2087 


92 


1202 


AAS1258 
3_aal 


Homo sapiens 


FARB DNA encoding human 
serotonin-like G protein-coupled 
receptor (5-HT-GPCR). 


2087 


92 


1202 


AAD195 
79_aal 


Homo sapiens 


INCY- Human G-protein 
coupled receptor, GCREC-3 
cDNA. 


2087 


92 


1203 


AAS0794 
0_aal 


Homo sapiens 


AREN- Human cDNA encoding 
G-protein coupled receptor, 
hRUP13. 


2318 


100 


1203 


AAS1258 
3_aal 


Homo sapiens 


FARB DNA encoding human 
serotonin-like G protein-coupled 
receptor (5-HT-GPCR). 


2318 


100 


1203 


AAD195 
79_aal 


Homo sapiens 


INCY- Human G-protein 
coupled receptor, GCREC-3 
cDNA. 


2318 


100 


1204 


AAM936 
12 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3436. 


3475 


100 


1204 


gil40431 
11 


Homo sapiens 


Similar to ubiquitin associated 
and SH3 domain containing, A, 
clone MGC: 15437 
IMAGE:2958242, mRNA, 
complete cds. 


3412 


100 


1204 


gil63041 
76 


Homo sapiens 


nm23-phosphorylated unknown 
substrate mRNA, complete cds. 


2759 


100 


1205 


AAB014 
24 


Homo sapiens 


MILL- Human TANGO 213. 


1264 


100 


1205 


AAM257 
35 


Homo sapiens 


HYSE- Human protein sequence 
SEQ IDNO:1250. 


1066 


100 


1205 


AAY762 
67 


Homo sapiens 


HUMA- Fragment of human 
secreted protein encoded by 
§ene 11. 


1066 


100 


1206 


AAW749 
39 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 49 clone 
HAGBI17. 


211 


100 


1207 


gil51265 
59 


Mus musculus 


Similar to Cd63 antigen 


504 


99 


1207 


gil41980 
88 


Mus musculus 


Cd63 antigen 


504 


99 


1207 


gi541060 
5 


Mus musculus 


tetraspanin membrane protein 
CD63 


504 


99 


1208 


AAB951 
48 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:17173. 


jDyyj 


00 
yy 


1208 


gil 19905 
93 


Homo sapiens 


organic anion transporter 
polypepti de-related protein 4 
(OATPRP4) mRNA, complete 
cds. 


3515 


99 


1208 


AAB491 
47 


Homo sapiens 


BRIM Human organic anion 
transport protein RP4 protein. 


3503 


99 


1209 


AAB733 
81 


Homo sapiens 


NANF- Human gas vesicle 
protein homologue hGvpT-b. 


1866 


100 
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1209 


gil20055 
09 


Homo sapiens 


HT025 mRNA, complete cds. 


1866 


100 


1209 


gi 104402 
36 


Homo sapiens 


cDNA:FU23518fis, clone 
LNG04878. 


1600 


100 


1210 


gU20530 
21 


Homo sapiens 


mRNA; cDNA 

DKFZp434L0714 (from clone 
DKFZp434L0714); complete 
cds. 


4230 


99 


1210 


AAG643 
76 


Homo sapiens 


BIOD- Human II aminoacyl- 
tRNA synthetase 75. 


3517 


99 


1210 


gil 04399 
91 


Homo sapiens 


cDNA: FU23339Fis, clone 
HEP13401. 


3010 


99 


1211 


AAB589 
41 


Homo sapiens 


HUMA- Breast and ovarian 
cancer associated antigen protein 
sequence SEQ ID 649. 


1188 


99 


1211 


gi!20059 
18 


Homo sapiens 


CDA0I6 mRNA, complete cds. 


1176 


98 


1211 


AAB366 
15 


Homo sapiens 


INCY- Human FLEXHT-37 
protein sequence SEQ ID 
NO:37, 


862 


Oft 


1212 


gil04371 
94 


Homo sapiens 


cDNA: FLJ21148 fis, clone 
CAS09413. 


2238 


96 


1212 


gil28456 
48 


Mus musculus 


putative 


1808 


80 


1212 


gi 122603 
3 


Saccharomyces 
cerevisiae 


unknown 


259 


25 


1213 


gil 04371 
94 


Homo sapiens 


cDNA: FLJ21148 fis, clone 
CAS09413. 


2203 


91 


1213 


gil28456 
48 


Mus musculus 


putative 


1774 


76 


1213 


gil 22603 
3 


Saccharomyces 
cerevisiae 


unknown 


249 


25 


1214 


gi663099 
2 


Danio rerio 


NCC receptor protein 1 


418 


42 


1214 


gil27116 
29 


Oreochromis 
niloticus 


nonspecific cytotoxic cell 
receptor protein 


389 


41 


1214 


gl663099 
8 


Ictalurus 
punctatus 


NCC receptor protein 1 


361 


40 


1215 


AAC843 
82 aal 


Homo sapiens 


MILL- Human TANGO 209 
polypeptide coding sequence. 


2430 


100 


1215 


AAS1457 
6_aal 


Homo sapiens 


ELIL Human cDNA encoding 
cysteine-rich secreted protein 
hCRSPL 


2430 


100 


1215 


AAB481 
07 


Homo sapiens 


MILL- Human TANGO 209 
polypeptide. 


2430 


100 


1216 


ABB 1188 

0 


Homo sapiens 


HYSE- Human lipocortin 
homologue, SEQ ID NO:2250. 


1091 


99 


1216 


AAB434 
43 


Homo sapiens 


HUMA- Human cancer 
associated protein sequence SEQ 
IDNO:888. 


1071 


99 


1216 


AAR224 
02 


Homo sapiens 


BIOS Human lipocortin. 


1050 


99 


1217 


ABB1188 
0 


Homo sapiens 


HYSE- Human lipocortin 
homologue, SEQ ID NO:2250. 


1511 


100 


1217 


AAB434 
43 


Homo sapiens 


HUMA- Human cancer 
associated protein sequence SEQ 
IDNO:888. 


1511 


100 
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1217 


AAY084 
12 


Homo sapiens 


U YMC- Human p-40/annexin I 
protein. 


1511 


100 


1218 


ABB1188 
0 


Homo sapiens 


HYSE- Human lipocortin 
homologue, SEQ ID NO2250. 


967 


100 


1218 


AAB434 
43 


Homo sapiens 


HUMA- Human cancer 
associated protein sequence SEQ 
IDNO:888. 


967 ■ 


100 


1218 


AAY084 
12 


Homo sapiens 


UYMC- Human p-40/annexin I 
protein- 


967 


100 


1219 


gi410698 
4 


Homo sapiens 


Human DMA from chromosome 
19-specific cosmld R30923, 
genomic sequence, complete 
sequence. 


2992 


100 


1219 


AAB427 
92 


Homo sapiens 


CURA- Human ORFX 
ORF2556 polypeptide sequence 
SEQ IDNO:5112. 


2967 


99 


1219 


gil46031 
76 


Homo sapiens 


Similar to RIKENcDNA 
24101 53K17 gene, clone 
MGC:19595 IMAGE:3840843 l 
mRNA, complete cds. 


2432 


100 


1220 


AAG814 
43 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO:404. 


492 


100 


1220 


gil 50802 
20 


Homo sapiens 


Similar to hypothetical protein, 
MGC:7764, clone MGC:20548 
IMAGE:3607345, mRNA, 
complete cds. 


492 


100 


1220 


gil40361 
94 


Homo sapiens 


unnamed protein product 


492 


100 


1221 


AAE0518 
3 


Homo sapiens 


INCY- Human drug 
metabolising enzyme (DME-14) 
protein. 


2258 


100 


1221 


AAY913 
48 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 3 
SEQIDNO:69. 


2258 


100 


1221 


gil 18545 
2 


Homo sapiens 


Human cytochrome P450 
monooxygenase CYP2J2 
mRNA, complete cds. 


932 


44 


1222 


AAE0518 
3 


Homo sapiens 


INCY- Human drug 
metabolising enzyme (DME-14) 
protein. 


1344 


100 


1222 


AAY913 
48 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 3 
SEQIDNO:69. 


1344 


100 


1222 


gil 18545 
2 


Homo sapiens 


Human cytochrome P450 
monooxygenase CYP2J2 
mRNA, complete cds. 


548 


44 


1223 


AAW346 
18 


Homo sapiens 


IMUT- Human C3 protein 
mutant DV-7N. 


597 


34 


1223 


AAW346 
17 


Homo sapiens 


IMUT- Human C3 protein 
mutant DV-6. 


597 


34 


1223 


AAW346 
16 


Homo sapiens 


IMUT- Human C3 protein 
mutant CV-5. 


597 


34 


1224 


gil73900 
00 


Homo sapiens 


Similar to RIKENcDNA 
5730455013 gene, clone 
MGC24718 IMAGE:4278022 3 
mRNA, complete cds. 


1693 


100 


1224 


AAB417 


Homo sapiens 


CURA- Human ORFX 


1166 


100 
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53 




ORF1517 polypeptide sequence 
SEQIDNO:3034. 








gi 128570 
19 


Mus musculus 


putative 


1036 


87 


1225 


AAG009 
03 


Homo sapiens 


GEST Human secreted protein, 
SEQ ED NO: 4984. 


294 


100 


1225 


gi339697 


Homo sapiens 


thymosin beta-10 gene, 3'end. 


169 


97 


1225 


" gi339687 


Homo sapiens 


Human thymosin beta-10 
mRNA, complete cds. 


169 


97 


1226 


AAY913 
86 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 41 
SEQ ID NO: 107. 


558 


100 


1226 


"11999289" 
3 


Homo sapiens 


phosphoinositol 3-phosphate 
binding protein-1 (PEPP1) 
mRNA, complete cds. 


75 


40 


1226 


gil01907 
44 


Homo sapiens] > 
[Homo sapiens 


pleckstrin homology domain- 
containing, family A 
(phosphoinositide binding 
specific) member 4; 
phosphoinositol 3-phosphate 
binding protein-1 


75 


40 


1227 


AAY600 
08 


Homo sapiens 


META- Human endometrium 
tumour EST encoded protein 68. 


2286 


100 


1227 


AAW747 
97 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 68 clone 
HKIXR69. 


2286 


100 


1227 


gl576230 
5 


Mus musculus 


COP1 protein 


2268 


99 


1228 


AAG892 
92 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 412. 


119 


71 


1228 


AAM937 
24 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3677. 


119 


71 


1228 


gil 28036 
69 


Homo sapiens 


CDK4~binding protein 
p34SEIl, clone MGC:3465 
IMAGE:3613213, mRNA, 
complete cds. 


119 


71 


1229 


AAY913 
70 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 25 
SEQIDNO:91. 


293 


100 


1230 


gil20045 
83 


Mus musculus 


unknown 


2566 | 


81 


1230 


gi 128365 
62 


Mus musculus 


putative 


2541 


80 


1230 


AAB418 
60 


Homo sapiens 


CURA- Human ORFX 
ORF1624 polypeptide sequence 
SEQIDNO:3248. 


1401 


100 


1231 


AAG760 
80 


Homo sapiens 


HUMA- Human colon cancer 
dnugen protein oby ID 
NO:6844. 


300 


84 


1231 


AAG013 
47 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 5428. 


300 


84 


1231 


AAG013 
46 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 5427. 


300 


84 


1232 


ABB1165 
5 


Homo sapiens 


HYSE- Human secreted protein 
homologue, SEQ ID NO:2025. 


2233 


99 


1232 


gil 59297 
48 


Mus musculus 


Unknown (protein for 
rMAGE:4222865) 


1826 


SI 
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1232 


gil28527 
13 


Mus musculus 


putative 


1815 


81 


1233 


AAB279 
77 


Homo sapiens 


HUMA- Human secreted protein 
BLAST search protein SEQ ID 
NO: 131. 


290 


96 


1233 


AAY134 
58 


Homo sapiens 


UYRQ Amino acid sequence of 
human Fe65. 


290 


96 


1233 


gi392493 
6 


Homo sapiens 


Fe65 protein gene, exons 3 
through 14 and partial cds. 


290 


96 


1234 


AAO077 
68 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 21660. 


294 


100 


1234 


AAY027 
75 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 12 clone 
HFTCU19. 


288 


98 


1234 


gi729722 
6 


Drosophila 
melanogaster 


CG4497 gene product 


67 


42 


1235 


AAB942 
20 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:14581. 


3391 


99 


1235 


gil04342 
88 


Homo sapiens 


cDNA FLJ 12661 fis, clone 
NT2RM4002189, weakly similar 
to GLUCOAMYLASE S1/S2 
PRECURSOR (EC 3.2.1.3). 


3191 


99 


1235 


gH40183 
79 


Schizosaccharom 
yces pombe 


hypothetical protein; sequence 
orphan; low similarity to 
glycoamylases and other cell 
surface proteins; contains ~250- 
270 copies of a 13 AA repeat, 
NSSTPITSSSIL 


355 


26 


1236 


AAU035 
93 


Homo sapiens 


INCY- Human DNA 
modification protein, DNAMP- 
8. 


4977 


98 


1236 


gi606313 
7 


Mus musculus 


F-box protein FBX1 8 


4406 


92 


1236 


AAB942 
00 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:14538. 


4039 


99 


1237 


AAB013 

95 ! 


Homo sapiens 


INCY- Neuron-associated 
protein. 


1507 


95 


1237 


AAU205 
29 


Homo sapiens 


HUMA- Human secreted 
protein, Seq ID No 521. 


1156 


86 


1237 


gil28430 
76 


Mus musculus 


putative 


644 


95 


1238 


gj 1 65508 
22 


Homo sapiens 


cDNA FLJ31400 fis, clone 
NT2NE1000185, weakly similar 
to UDP-N- 

ACETYLGLUCOSAMINE-- 
PEPTIDE N- 

ACETYLGLUCOSAMINYLTR 

ATvTCl?CO ACC 1 1fl IcTl A 

SUBUNIT (EC 2.4.1.-). 


1999 


96 


1238 


gil36041 
67 


Homo sapiens 


ARG99 mRNA, complete cds. 


781 


100 


1238 


gi672116 
1 


Arabidopsis 
thaliana 


putative O-linked GlcNAc 
transferase 


372 


27 


1239 


AAY734 
14 


Homo sapiens 


GEMY Human secreted protein 
clone ybl01_J protein sequence 
SEQ ID NO:50. 


472 


100 


1239 


gi719065 


Chlamydia * 


conserved hypothetical protein 


70 


28 
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1239 


gi332875 
9 


Chlamydia 
trachomatis 


hypothetical protein 


68 


28 


1240 


AAW886 
15 


Homo sapiens 


HUMA- Secreted protein 
encoded by gene 82 clone 
HNGBT31. 


525 


97 


1240 


AAY840 
40 


Homo sapiens 


USGO Amino acid sequence of 
cancer associated polypeptide 
CHl-9all-2. 


71 


30 


1240 


gi 148609 
75 


human 
herpesvirus 2 


DNA polymerase 


70 


36 


1241 


gi!11214 
83 


Homo sapiens 


mRNA for caIsyntenin-2 (CS2 
gene). 


5080 


100 


1241 


gilll215 
06 


Mus musculus 


calsyntenin-2 


4733 


94 


1241 


gil 15582 
48 


Gallus gallus 


calsyntenin-1 protein 


2962 


57 


1242 


AAM933 
76 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 2951. 


1034 


100 


1242 


AAW781 
51 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 26 clone 
HT3BE24. 


1034 


100 


1242 


AAY298 
65 


Homo sapiens 


GEMY Human secreted protein 
clone pe213 1. 


1034 


100 


1243 


gil 04395 
94 


Homo sapiens 


cDNA: FU23033 fis, clone 
LNG02005. 


2982 


99 


l243~ 


AAB413 
96 


Homo sapiens 


CURA- Human ORFX 

ORP1 160 polypeptide sequence 

SEQIDNO:2320. 


2486 


100 


1243 


gil28530 
18 


Mus musculus 


putative 


2002 


88 


1244 


AAY122 
52 


Homo sapiens 


GEST Human 5 f EST secreted 
protein SEQ ID NO: 565. 


321 


92 


1244 


AAU163 
32 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 1285. 


320 


92 


~T244~ 


AAU158 
73 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 826. 


320 


92 


1245 


AAE0487 
8 


Homo sapiens 


INC Y- Human protease protein- 
5 (PRTS-5). 


1542 


100 


1245 


gil35436 
81 


Homo sapiens 


clone MGC: 14793 
IMAGE:4047601,mRNA, 
complete cds. 


1524 


99 


1245 


AAB475 
27 


Homo sapiens 


MILL- Ubiquitin hydrolase-like 
protein -long form. 


1499 


100 


1246 


gil 40435 
23 


Homo sapiens 


clone IMAGE:4098694, 
mRNA, partial cds. 


1991 


97 


1246 


gil20608 
22 


Homo sapiens 


serologically defined breast j 
cancer antigen NY-BR-1 6 
mRNA, complete cds. 


1991 


97 


1246 


gil29638 
69 


Mus musculus 


gene trap ankyrin repeat 
containing protein 


1980 


96 ; 


1247 


AAB543 
57 


Homo sapiens 


HUMA- Human pancreatic 
cancer antigen protein sequence 
SEQIDNO:809. 


301 


100 


1247 
1247 


AAY486 

oo 

gil56129 


Homo sapiens 
Bacillus 


META- Human breast tumour- 
associated protein 61. 
3H0396~unknown conserved 


285 
63 


98 
32 
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59 


halodurans] > 

[Bacillus 

halodurans 


protein 






1248 


AAM405 
66 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 5497. 


379 


46 


1248 


AAM387 
80 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 1925. 


379 


46 


1248 


gi653960 
6 


Homo sapiens 


metastasis suppressor protein 
mRNA, complete cds. 


379 


46 


1249 


gil39925 
24 


Homo sapiens 


mRNA for type II alpha 
phosphatidylinositol 4-kinase 
gene. 


2546 


100 


1249 


gil31119 
89 


Homo sapiens 


Similar to hypothetical protein 
FLJ1 11 05, clone MGC:4395 
IMAGE:2905670, mRNA, 
complete cds. 


2546 


100 


1249 


gil36607 
55 


Rattus 
norvegicus 


55 kDa type II 

phosphatidylinositol 4-kinase 


2409 


OA 


1250 


AAB954 
25 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO-.17833. 




inn 


1250 


gil04354 
87 


Homo sapiens 


cDNA FU13465 lis, clone 
PLACE1003493, weakly similar 

• ^ ■nxir\/^T , tjrn i T tat OCT t 
to ENDOTHELIAL LJbL,L 

MULTIMERIN PRECURSOR. 


/ifi71 
4o/ 1 




1250 


AAY363 
00 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 77. 






1251 


gi644932 
6 


Mus musculus 


retinotc acid-responsive protein 
HA1R-62 




68 


1251 


gi647869 
4 


Soybean mosaic 
virus 


PI protease 


67 


33 


1252 


gil24078 
29 


Homo sapiens 


netrin 4 precursor (NTN4) 
mRNA, complete cds. 


3361 


99 


1252 


AAG664 
49 


Homo sapiens 


ObHO Human oeta-netnn. 


JJn / 


99 


1252 


gill 1200 
48 


Homo sapiens 


beta-netrin mRNA, complete 
cds. 


"^47 

jj 1 r / 


99 


1253 


gil 68780 
83 


Homo sapiens 


enolase 3, (beta, muscle), clone 
MGC:29581 IMAGE:4902149, 
mRNA, complete cds. 




94 

XT 


1253 


gi34789 


Homo sapiens 


H.sapiens mRNA for muscle 
speciiic enoiasc {ivio&j 
4.2.1.11). 


555 


94 


1253 


gi31170 


Homo sapiens 


Human EN03 mRNA for beta- 
enolase(EC 4.2.1.11). 


551 


93 | 


1254 


AAY078 
95 


Homo sapiens 


HUMA- Human secreted protein 
fragment encoded from gene 44. 


537 


100 


1254 


gil71320 
ft? 


Nostoc sp. PCC 
7120 


ORFJD:alr2988~hypothetical 
protein 


69 


38 


1255 


AAB937 
52 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO:13419. 


1000 


99 


1255 


gil 04327 
85 


Homo sapiens 


cDNAFLJ11515fis, clone 
HEMBA1002241, weakly 
similar to PROLEFERATING- 
CELL NUCLEOLAR 
ANTIGEN P120. 


1000 


99 


1255 


gi!56801 
85 


Homo sapiens 


Similar to RIKEN cDNA 
2810405F18gene, clone 


875 


99 
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MGC:22960 IMAGE :4865283, 
mRNA, complete cds. 






1256 


gil41608 
80 


Homo sapiens 


PKCI-I-related HIT protein 
mRNA, complete cds. 


827 


100 


1256 


gil36501 

on. 

CO 


Homo sapiens 


HIT-17kDa mRNA, complete 
cds. 


827 


100 


1KC 


AA*vj257 

10 


Kobjo sapiens 


h fSE- Human protein sequence 
SEQIDNO:1254. 


806 


94 


1257 


gil28492 

77 


Mus musculus 


putative 


793 


93 


1257 


gi642529 
5 


Caenorhabditis 
elegans 


predicted using 

Genefmder~contains similarity 
to Pfam domain: PF00023 (Ank 
repeat), Score=71.3,E- 
value=6.5e-18,N=2 


200 


40 


1257 


gi433575 
6 


Arabidopsis 
thaliana 


putative ankyrin 


195 


44 


l&JQ 


AAdo/o 
09 


Homo sapiens 


GETH Human PRO 1890. 


1307 


99 


1258 


AAB733 
09 


Homo sapiens 


UROG- Human C-type lectin 
transmembrane antigen PC- 
LECTIN,SEQIDNO:2. 


1307 


99 




AAU124 
41 


Homo sapiens 


GETH Human PRO 1890 
polypeptide sequence. 


1307 


99 




AAY053 
68 


Homo sapiens 


UYPR- Human HCMV 
inducible gene protein, SEQ ID 
NO 4. 


1682 


97 




A A VrtTA 

AAY070 
36 


Homo sapiens 


LUDW- Breast cancer 
associated antigen precursor 
sequence. 


1682 


97 


1259 


gi995603 

D 


Homo sapiens 


clone CDABP0047 mRNA 
sequence. 


1682 


97 


1260 


AAB189 

06 


Homo sapiens 


INCY- Amino acid sequence of 
a human transmembrane protein. 


1132 


100 


1260 


gil55303 
13 


Homo sapiens 


clone MGC:2853 
IMAGE:2987806,mRNA, 
complete cds. 


1132 


100 


1260 


gil39375 
95 


Homo sapiens 


Similar to RIKENcDNA 
1810017F10gene, clone 
MGC:2583 IMAGE:3 140820, 
mRNA, complete cds. 


1132 


100 


1261 


AAB189 


Homo sapiens 


INCY- Amino acid sequence of 
a human transmembrane protein. 


926 


85 


1261 


gil55303 
13 


Homo sapiens 


clone MGC:2853 
IMAGE:2987806, mRNA, 
complete cds. 


926 


85 


1261 


gil39375 
95 


Homo sapiens 


Similar to RIKENcDNA 
1810017F10 eene clone 
MGC:2583IMAGE:3140820, 
mRNA, complete cds. 


926 


85 


1262 


AAB944 
34 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 15054. 


1629 


100 


1262 
1262 


gil75121 
03 

gi!04348 


Homo sapiens 
Homo sapiens 


hypothetical protein FLJ13044, 
clone MGC:20950 
IMAGE:4577143, mRNA, 
complete cds. 

cDNAFLJ13044fis, clone 


1629 
1629 


100 
100 
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Jrerceni 

identity 




74 




NT2RP3001355, weakly similar 
to TRIC ARB OX YL ATE 
IKANSFUKi rKUlbllN 
PRECURSOR. 






1263 


gU78640 


Homo sapiens 


Human angiotensinogen mRNA, 
complete CDb. 


2366 


96 


1263 


gil 19749 
7 


Homo sapiens 


Rsapiens angiotensinogen gene 
exon 2 (and joined Lub). 


2366 


96 


1263 


AAB673 
50 


Homo sapiens 


UTAH Human angiotensinogen 
protein. 


2363 


96 


1264 


gi240126 
1 


Homo sapiens 


HLA-C gene (HLA-Cw*070l 
allele), complete cds. 


1099 


CIO 


1264 


gi!52772 
17 


Homo sapiens 


genomic DNA, chromosome 
6p2l.3, HLA Class I region, 
section 7/20. 


1099 


98 


1264 


gil 478 19 
7 


Homo sapiens 


Rsapiens mRNA for human 
leukocyte antigen C alpha chain. 


1099 


no 

98 


1265 


gi386775 


Homo sapiens 


Human MHC class I HLA-B8 
chain gene (Al,2; B5,8), 
complete cds. 


1033 


92 


1265 


gi240125 
9 


Homo sapiens 


HLA-B gene (HLA-B*080l 
allele), complete cds. 


1033 


92 


1265 


gil 52772 
16 


Homo sapiens 


genomic DNA, chromosome 
6p21.3, HLA Class I region, 
section 6/20. 


1033 


92 


1266 


AAM237 
60 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO: 1285. 


937 


1 Art 
1UU 


1266 


gi458664 


Homo sapiens 


Human MHC class I antigen 
HLA-B (HLA-B-0704 allele) 
mRNA, complete cds. 


937 


100 


1266 


gi307221 


Homo sapiens 


Human MHC HLA protein 
(allele B7) complete cds. 


937 


100 


1267 


gi32181 


Homo sapiens 


Rsapiens HLA-Bw57 gene. 


977 


88 


1267 


gi307222 


Homo sapiens 


Human MHC HLA protein, 
allele B57, complete cds. 


977 


88 


1267 


gi674637 
1 


Homo sapiens 


HLA class I histocompatibility 
antigen B-57 (HLA-B57) 
mRNA, complete cds. 


971 


88 


1268 


gil 49705 
74 


Homo sapiens 


HLA-A gene ror MJiL class i 
antigen, allele HLA-A*68011, 
exons 1-8. 


1 Qfil 


OA 


1268 


gil 72403 
4 


Homo sapiens 


Human HLA class I A locus 
antigen A*68new mRNA, 
complete cds. 


1 TG£ 

1 /yO 




1268 


gi613877 
0 


Homo sapiens 


HLA-A gene for MHC Class 1 
antigen, A*68 allele, exons 1-8. 


1792 


93 


1269 


gi307225 


Homo sapiens 


Human MHC HLA protein, 
allele A 25 comDlete cds 


1 1 An 
I loll 


yo 


1269 


gil42503 
59 


Homo sapiens 


clone MGC:17191 
IMAGE:41 57200, mRNA, 
complete cds. 


1160 


96 


1269 


gil 52697 
6 


Homo sapiens 


Rsapiens mRNA for human 
leucocyte antigen, HLA-A25. 


1152 


96 


1270 


gi645336 
5 


Homo sapiens 


mRNA for human leucocyte 
antigen B (HLA-B gene, 
B* 150 1102N allele). 


314 


88 


1270 


AAY647 


Homo sapiens 


GEST Human 5' EST related 


148 


81 
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49 




polypeptide SEQ ID NO:910. 






1270 


AAP7015 
5 


Homo sapiens 


BEHW Sequence encoded by 
genomic DNA encoding human 
histocorapatibilityantigen HLA- 
B 27. 


138 


62 


1271 


gi 825674 


Homo sapiens 


Rsapiens gene encoding HLA- 
Cw6, exons 1-3. 


1120 


99 


1271 


gi297097 


Homo sapiens 


Rsapiens mRNA for HLA- 
Cw*0602. 


1120 


99 


1271 


gil94448 
0 


Homo sapiens 


mRNA for HLA-Cw*0602, 
partial cds. 


1120 


99 


1272 


gi222589 
0 


Homo sapiens 


Human HLA-A26null allele, 
complete cds. 


977 


85 


1272 


gi487909 


Homo sapiens 


mRNA for HLA-A1J antigen 
Al 1.1, complete cds. 


847 


94 


1272 


gi446825 
6 


Homo sapiens 


mRNA for MHC class I antigen, 
allele A* 1103. 


847 


94 


1273 


gi860968 


Homo sapiens 


Human HLA-A1 gene. 


1122 


100 


1273 


gi825024 
5 


Homo sapiens 


HLA-A*010I gene for MHC 
class I antigen, exons 1-8. 


1122 


100 


1273 


gi386893 


Homo sapiens 


Human MHC class I HLA-AJ 
chain gene (Al,2; B8,5), 
complete cds. 


1122 


100 


1274 


AAB944 
86 


Homo sapiens 


HEL1- Human protein sequence 
SEQIDNO:15170. 


2982 


100 


1274 


AAM940 
18 


Homo sapiens 


HELI- Human stomach cancer 
expressed polypeptide SEQ ID 
NO 106. 


2982 


100 


1274 


gil40424 
96 


Homo sapiens 


cDNA FLJ14750 fis, clone 
NT2RP3002948, weakly similar 
to RING CANAL PROTEIN. 


2982 


100 


1275 


gil20532 
77 


Homo sapiens 


mRNA; cDNA DKFZp434B227 
(from done DKFZp434B227); 
complete cds. 


2242 


99 


1275 


gil04403 
05 


Homo sapiens 


cDNA: FU23571 fis, clone 
LNG 12303. 


2124 


94 


1275 


gill6116 
03 


Macaca 
fascicularis 


hypothetical protein 


2064 


90 


1276 


gi891893 
2 


Mus musculus 


unnamed protein product 


2826 


95 


1276 


gi632981 
2 


Homo sapiens 


mRNA for KIAA1 130 protein, 
partial cds. 


2716 


100 


1276 


AAS1459 
5_aal 


Homo sapiens 


MILL- Human cDNA encoding 
a novel glycosyltransferase 
33877. 


1606 


58 


1277 


AAB906 
76 


Homo sapiens 


GEMY Human BV141J2 
protein sequence SEQ ID 28. 


400 


98 


1277 


AAW589 

'Ui TV +JQJ 

85 


Homo sapiens 


OEM Y Homo sapiens adult 
brain clone BV341_2 encoded 
protein. 


201 


100 


1277 


gi295048 
1 


Schizosaccharom 
yces pombe 


hypothetical protein 


71 


34 


1278 


AAY144 
55 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 45 clone 
HCFBJ91. 


284 


100 


1279 


AAB858 
85 


Homo sapiens 


HELI- Human adenylate kinase 
3 (AK3)-like protein. 


135 i 


78 
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1279 


AAB934 
87 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:12786. 


135 


78 


1279 


AAB930 
66 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 11 883. 


135 


78 


1280 


AAB652 
42 


Homo sapiens 


GETH Human PR01291 
(UNQ659) protein sequence 
SEQIDNO:291. 


1378 


100 


1280 


AAB875 
55 


Homo sapiens 


GETH Human PR01291. 


1378 


100 


1280 


AAY667 
19 


Homo sapiens 


GETH Membrane-bound protein 
PR01291. 


1378 


100 


1281 


AAB956 
82 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 18481. 


1675 


96 


1281 


gil40419 
89 


Homo sapiens 


cDNA FLJ14456 fis, clone 
HEMBB1001915, moderately 
similar to UBIQUITIN 
CARBOXYL-TERMINAL 
HYDROLASE 64E(EC 
3.1.2.15). 


1675 


1 


1281 


gi729543 
6 


Drosophila 
melanogaster 


Ubp64E gene product 


892 


71 


1282 


AAW678 
41 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 35 clone 
HOABG65. 


500 


100 


1282 


AAY122 
38 


Homo sapiens 


GEST Human 5* EST secreted 
protein SEQ ID NO: 551. 


423 


100 


1282 


AAY119 
53 


Homo sapiens 


GEST Human 5' EST secreted 
protein SEQ ID No: 553. 


276 


94 


1283 


AAM259 
58 


Homo sapiens 


HYSE- Human protein sequence 
SEQ ID NO:1473. 


484 


78 


1283 


AAW678 
69 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 63 clone 
HHGDB72. 


484 


78 


1283 


AAY022 
85 


Homo sapiens 


GEMY Secreted protein clone 
. k232 2x polypeptide sequence. 


467 


78 

/o 


1284 


gi29963 


Homo sapiens 


Human gene for creatine kinase 
B (EC 2.7.3.2). 


162 


71 
/I 


1284 


gi 180570 


Homo sapiens 


Human creatine kinase isozyme 
CK-B gene ? exon 8. 


162 


71 


1284 


gi 180555 


Homo sapiens 


Human creatine kinase-B 
mRNA, complete cds. 




71 


1285 


gil28498 
20 


Musmusculus 


putative 


1 I/O 


71 


1285 


AAM253 
89 


Homo sapiens 


HYSE- Human protein sequence 
SEpiDNO:904. 


790 


98 


1285 


AAY249 
17 


Homo sapiens 


[NCY- Human phosphatase 
HPA-2. 


550 


39 


1286 


gi!28498 
20 


Mus niuscuius 


putative 


1456 


85 


1286 


AAY249 
17 


Homo sapiens 


ENCY- Human phosphatase 
HPA-2. 


798 


48 


1286 


gi897982 
5 


Homo sapiens 


Human DNA sequence from 
clone RP4-776F14on 
chromosome 20pl2.2-13. 
Contains'the 5' end of the 
FKBP1A gene for FK506- 
binding protein 1A (12kD), the 


798 


48 
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gene for P47 protein, part of a 
novel member of the PTPNS 
(protein tyrosine phosphatase, 
non-receptor type substrate 1) 
gene family, ESTs, STSs, GSSs 
and two CpG islands, complete 
sequence. 






1287 


gi3 17190 
8 


Homo sapiens 


mRNA for DnaJ protein. 


659 


100 


J287 


gil60418 
37 


Homo sapiens 


DnaJ (Hsp40) homolog, 
subfamily A, member 2, clone 
MGC:9488 IMAGE:3 922477, 
mRNA, complete cds. 


659 


100 


100T 


gi 152783 
95 


Homo sapiens 


Similar to DnaJ(Hsp40) 
homolog, subfamily A, member 
2, cloneMGC:819 
IMAGE:3505399,mRNA, 
complete cds. 


659 


100 


1288 


gil57412 
21 


Homo sapiens 


gene overexpressed in 
astrocytoma mRNA, complete 
cds. 


3391 


99 


1288 


gil35440 
35 


Homo sapiens 


clone IMAGE:3535476, 
mRNA, partial cds. 


2095 


100 


1288 


gil68781 
87 


Homo sapiens 


Similar to gene overexpressed 
in astrocytoma, clone 
MGC:29809 IMAGE:5017710, 
mRNA, complete cds. 


2079 


100 


1289 


AAY927 

1 c\ 

19 


Homo sapiens 


GENZ Human polycistin. 


20114 


99 


1289 


gi904223 


Homo sapiens 


polycystic kidney disease I 
protein (PKD1) mRNA, 
complete cds. 


20114 


99 


1289 


AAW238 
30 


Homo sapiens 


DEKR- Human PKD1 protein. 


20111 


99 


1290 


AAY559 

65 1 


Homo sapiens 


SUGE- Full length human ZC4 
protein. 


1906 


100 


1290 


AAY559 
34 


Homo sapiens 


SUGE- Human ZC4 protein. 


1808 


100 


1290 


gi278017 
3 


Homo sapiens 


Human DNA sequence from 
PAC 82J1 1 and cosmid U134E6 
on chromosome Xq22. Contains 
NIK like and Thyroxin-binding 
globulin precursor (T4-binding 
globulin, TBG) genes, ESTs and 
STSs. 


1588 


95 


1291 


gil04380 
63 


Homo sapiens 


cDNA: FLJ21868fis, clone 
HEP02432. 


1605 


99 




rrt 1 0*7*7/1 
gllJZ//4 

43 


Mus muscuius 


Unknown (protein for 
MGC:19083) 


1379 


84 


1291 


AAB429 
53 


Homo sapiens 


CURA- Human ORFX 
ORF2717 polypeptide sequence 
SEQIDNO:5434. 


522 


100 


1292 


gi795926 
3 


Homo sapiens 


mRNA for KIAA1501 protein, 
partial cds. 


1824 


100 


1292 


ABB1748 
8 


Homo sapiens 


HUMA- Human nervous system 
related polypeptide SEQ ID NO 
6145. 


984 


100 


1292 


AAB979 


Homo sapiens 


SHAN- Human G-protein 


965 


57 
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11 




activating protein 129 SEQ ID 
NO:2. 






1293 


gil04430 
48 


Homo sapiens 


Human DNA sequence from 
clone RP11-465L10 on 
chromosome 20. Contains 10 
CpO islands, ESTs, STSs and 
GSSs. Contains the gene for a 
novel protein similar to 
Drosophila CGI 1399, the gene 
for a novel C2H2 type zinc 
finger protein similar to chicken 
FZF-1, a Ferritin light 
polypeptide (FTL) pseudogene, 
the MMP9 gene for matrix 
metalloproteinase 9 (gelatinase 
B, 92kD gelatinase, 92kD type 
IV collagenase) (CLG4B), a 
novel gene, the SLC12A5 gene 
for solute carrier family 12, 
(potassium-chloride transporter) 
member 5 (KIAA1 176) and the 
3* end of gene K.IAA1637, 
complete sequence. 


3567 


90 


1293 


AAH282 
26 aal 


Homo sapiens 


PFIZ Nucleotide sequence of 
matrix metal loproteinase-9. 


3556 


90 


1293 


AAB204 
91 


Homo sapiens 


SMIK Human matrix 
metalloproteinase-9 (MMP-9). 


3556 


90 


1294 


AAH282 
26 aal 


Homo sapiens 


PFIZ Nucleotide sequence of 
matrix metalloproteinase-9. 


2375 


100 


1294 


AAB204 
91 


Homo sapiens 


SMIK Human matrix 
metaIloproteinase-9 (MMP-9). 


2375 


100 


1294 


AAB846 
11 


Homo sapiens 


PFIZ Amino acid sequence of 
matrix metalloproteinase-9. 


2375 


100 


1295 


AAG040 
88 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 8169. 


601 


91 


1295 


gi897761 


Homo sapiens 


H.sapiens mRNA for protein 
phosphatase 5. 


450 


92 


1295 


gi455863 
8 


Homo sapiens 


chromosome 19, BAG 82621 
(CIT-B~139al8), complete 
sequence. 


450 


.92 


1296 


AAY647 
86 


Homo sapiens 


GEST Human 5' EST related 
polypeptide SEQ ID NO:947. 


200 


100 


1296 


gil62640 
94 


Sinorhizobium 
meliloti] > 
[Sinorhizobium 
meliloti 


HYPOTHETICAL PROTEIN 


63 


35 


1297 


gil26980 
13 


Homo sapiens 


mRNA for KIAA1734 protein, 
partial cds. 


3889 


100 


1297 


gi 1043 86 
94 


Homo sapiens 


cDNA: FLJ22346 lis, clone 
HRC06158. 


Dot / 


QO 
77 


1297 


AAB943 
54 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 14875. 


2421 


99 


1298 


gil45756 
79 


Homo sapiens 


hemicentin mRNA, complete 
cds. 


10314 


89 


1298 


gil65519 
93 


Homo sapiens 


cDNA FU3 1 995 fis, clone 
NT2RP7009236, weakly similar 
to BASEMENT MEMBRANE- 
SPECIFIC HEPARAN 


4274 


91 
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SULFATE PROTEOGLYCAN 
CORE PROTEIN 
PRECURSOR. 


* 




1298 


gil38728 
13 


Homo sapiens 


partial mRNA for fibulin-6 
(FIBL-6 gene). 


2907 


99 


1299 


• c A ty ft ft A 

gi548084 


Rattus 


olfactory cyclic nucleotide-gated 
channei 


2811 


93 


1299 


gi538129 


Rattus 
norvegicus 


cyclic nucleotide gated cation 
channel 


2811 


93 


1299 


gi908824 


Bos taurus 


alpha subunit of CNG-channel 
expressed in bovine testis and 
retinal cone 


1576 


53 


1300 


AAB419 
63 


Homo sapiens 


CURA- Human ORFX 
ORF1727 polypeptide sequence 
SEQ IDNO:3454. 


514 


100 


131)0 


gi 155297 
03 


Homo sapiens 


importin 9 mRNA, complete 
cds. 


514 


100 


liUU 


gil51867 
58 


Mus musculus 


RANBP9isoform2 


514 


100 


1301 


gil05053 
49 


Homo sapiens 


regulator of G-protein signaling 
(RGS8) mRNA, complete cds. 


926 


100 


1301 


gil73820 
46 


Homo sapiens 


unnamed protein product 


926 


100 


1301 


gi266205 
7 


Rattus 
norvegicus 


RGS8 


921 


98 


1302 


AAB953 
02 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:17538. 


1242 


100 


1302 


gi 104349 
69 


Homo sapiens 


cDNAFXJI3105fis, clone 
NT2RP3002351, weakly similar 
to Human mRNA for NAD- 
dependent methylene 
tetrahydrofolate dehydrogenase 
cyclohydrolase(EC 1.5.1.15). 


1242 


100 


1302 


gi!28347 
26 


Mus musculus 


putative 


873 


94 


1303 


gi546603 


human, tumorous 
liver, mRNA 
Partial, 263 1 nt]. 
[Homo sapiens 


giutamine synthetase 


1787 


100 


1303 


gil75120 
38 


Homo sapiens 


clone MGC:20095 
IMAGE:3352740,mRNA, 
complete cds. 


1787 


100 


13UJ 


gil50801 
57 


Homo sapiens 


glutamate-ammonia ligase 
(giutamine synthase), clone 
MGC:20322 IMAGE :4 137547, 
mRNA, complete cds. 


1787 


100 


1304 


gi546603 


human, tumorous 
liver. mRNA 
Partial, 2631 nt]. 
[Homo sapiens 


giutamine synthetase 


432 


89 


1304 


gi31833 


Homo sapiens 


Human mRNA for giutamine 
synthetase (E.C. 6.3.1.2). 


432 


89 


1304 


gi31831 


Homo sapiens 


Human rearranged mRNA for 
giutamine synthase. 


432 


89 


1305 


gil65517 
55 


Homo sapiens 


cDNA FLJ3 1 807 fis, clone 
NT2RI2009215, moderately 
similar to ZINC FINGER 


492 


54 
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PROTEIN 165. 






1305 


AAM416 
49 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 6580. 


462 


51 


1305 


AAM939 
17 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 4075. 


462 


51 


1306 


AAU162 
46 


Homo sapiens 


HUMA- Human novel secreted 
protein, SeqID 1199. 


1010 


96 


1306 


gil28328 
45 


Mus musculus 


putative 


585 


83 


1306 


AAU162 
40 


Homo sapiens 


HUMA- Human novel secreted 
protein, SeqID 1193. 


342 


95 


1307 


gi979845 
2 


Homo sapiens 


mRNA for putative capacitative 
calcium channel (trp7 gene). 


4470 


100 


1307 


gi532685 
4 


Mus musculus 


receptor-activated calcium 
channel 


4392 


98 


1307 


gi229590 
3 


Homo sapiens 


Human putative calcium influx 
channel (htrp3) mRNA, 
complete cds. 


3529 


81 


1308 


gi611460 
1 


Homo sapiens 


mRNA for stromal antigen 3 
(STAG3 gene). 


281 


74 


1308 


gi309042 
3 


Mus musculus 


stag3 


203 


49 


1308 


gil31951 
63 


Rattus 
norvegicus 


stromal antigen 3 


199 


47 


1309 


gi6U460 
1 


Homo sapiens 


mRNA for stromal antigen 3 
(STAG3 gene). 


295 


82 


1309 


gi309042 
3 


Mus musculus 


stag3 


200 


55 


1309 


gil3l951 
63 


Rattus 
norvegicus 


stromal antigen 3 


198 


54 


1310 


gi985856 
2 


Homo sapiens 


Rh type B glycoprotein (RHBG) 
mRNA, complete cds. 


2176 


99 


1310 


gil57184 
71 


Homo sapiens 


Rh type B glycoprotein (RHBG) 
gene, exons 9, and 10 and 
complete cds. 


2176 


99 


1310 


gil43460 
06 


Pan troglodytes 


Rh type B glycoprotein 


2161 


99 


1311 


gi724314 
9 


Homo sapiens 


mRNA for KIAA1384 protein, 
partial cds. 


3377 


100 


1311 


gil28576 
73 


Mus musculus 


putative 


2817 


98 


1311 


gi724297 
3 


Homo sapiens 


mRNA for KIAA1309 protein, 
partial cds. 


913 


33 


1312 


AAY862 
97 


Homo sapiens 


HUMA- Human secreted protein 
HLDCE79, SEQ 1DN0:212. 


530 


100 


1312 


AAY216 
23 


Homo sapiens 


REGC Ligand binding domain 
of nuclear receptor hGR. 


74 


32 


1312 


AAP8091 
9 


Homo sapiens 


SALK Sequence of the primary 
protein sequence of human 
glucocorticoidreceptor (hGR). 


74 




1313 


gil 10226 
90 


Homo sapiens 


ifpl mRNA for interferon- 
responsive finger protein 1 long 
form, complete cds. 


4302 


99 


1313 


AAB955 
86 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 18252. 


2612 


100 


1313 


gil40428 
69 


Homo sapiens 


cDNA FU1 4970 fis, clone 
THYRO1000501, weakly 


2612 


100 
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similar to 52 KD RO PROTEIN. 






1314 


giI786!3 
72 


Homo sapiens 


lysyl oxidase-Iike 4 mRNA, 
complete cds. 


984 


93 


1314 


gil66601 
31 


Homo sapiens 


lysyl oxidase-Iike 4 (LOXL4) 
mRNA, complete cds. 


984 


93 


1314 


gi!46694 

/ 4 


Homo sapiens 


lysyl oxidase-related protein C 
(LOXC) mRNA, complete cds. 


984 


93 


1315 


gil65494 
49 


Homo sapiens 


cDNAFU30273fis, clone 
BRACE2002685, moderately 
similar to Homo sapiens 
androgen-regulated short-chain 
dehydrogenase/reductase 1 
(ARSDRl)mRNA. 


1612 


98 


1315 


giI28616 
68 


Mus musculus 


putative 


1374 


84 


1315 


gil67406 
49 


Mus musculus 


Similar to RIKEN cDNA 
A930033N07gene 


1315 


81 


1316 


gil 42862 
86 


Homo sapiens 


Similar to hypothetical protein 
FLJ205 1 5, clone MGC:2696 
IMAGE:2820596,mRNA, 
complete cds. 


1006 


100 


1316 


AAY530 
23 


Homo sapiens 


GEMY Human secreted protein 
clone qf662 3 protein sequence 
SEQ IDNO:52. 


990 


99 


1316 


AAE0483 
5 


Homo sapiens 


SUGE- Human SGP001 
phosphatase polypeptide. 


931 


95 


1317 


AAN50O 
69_aal 


Homo sapiens 


MITUDNA encoding 
cardiodilatin in plasmid 
pHANF48. 


771 


100 


1317 


AAW981 
93 


Homo sapiens 


CURA- Human atrial natriuretic 
peptide prohormone. 


771 


100 


1317 


AAP5I24 
1 


Homo sapiens 


BfOT- Sequence of pre-pro- 
atriai 

natriuretic/vasodilatorpolypeptid 
e(ANVP). 


771 j 


100 


1318 


AAE0618 
3 


Homo sapiens 


HUMA- Human gene 57 
encoded secreted protein 
fragment, SEQ IDNO:245. 


3182 


89 


1318 


AAY872 
06 


Homo sapiens 


HUMA- Human secreted protein 
sequence SEQIDNO:245. 


3182 


89 


1318 


AAE0609 
7 


Homo sapiens 


HUMA- Human gene 57 
encoded secreted protein 
HRACD80,SEQIDNO:159. 


2906 


88 


1319 


AAU089 
95 


Homo sapiens 


MILL- Human G protein- 
coupled receptor, GPCR, 45449. 


410 


96 


1319 


gil22142 
87 


Homo sapiens 


Human DNA sequence from 
clone RP3-402H5 on 
chromosome 6pl2.3-21.1 
Contains ESTs, STSs and GSSs. 
Contains the 3' part of a gene for 
a novel 7 transmembrane 
receptor of the rhodopsin family 
and a novel gene, complete 
sequence. 


410 


96 


1319 


gi!57973 
18 


Homo sapiens 


unnamed protein product 


410 


96 


1320 


gi897761 


Homo sapiens 


Rsapiens mRNA for protein 


387 


100 
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phosphatase 5. 






1320 


gi455863 


Homo sapiens 


chromosome iy, BAt ozozi 
(CIT-B-139al8), complete 
sequence. 


JO/ 


100 


1320 


gil 28050 
33 


Homo sapiens 


protein phosphatase 5, catalytic 
subunit, clone MGC:5260 
IMAGE:3459309, mRNA, 
complete cos. 


Jot 


100 

1 vv 


1321 


AAB566 
13 


Homo sapiens 


ROSE/ Human prostate cancer 
antigen protein sequence ocy 
IDNO:1191. 


1174 


100 


1321 


gil 82644 


Homo sapiens 


riuman riouo-uinaing protein 
25 (FKBP25) mRNA, complete 
cds. 


1 \69 

l 1U7 


100 


1321 


gil 82626 


Homo sapiens 


Human rapamycin binding 
protein (FK506) mRNA, 
complete cds. 


1169 


100 


1322 


gil 50426 
91 


Homo sapiens 


sorting nexin 18(SNX18) 
mRNA, complete cds. 


2895 


100 


1322 


gil 55590 
64 


Mus musculus 


SNAG1 


OA A f> 


oo 


1322 


AAW990 
23 


Homo sapiens 


MOUN 17G2 peptide sequence. 


1605 


95 


1323 


gil28048 
03 


Homo sapiens 


clone MGC:4499 
IMAGE:29645o5, mKNA, 
complete cds. 


1266 


100 


1323 


gil 26545 
15 


Homo sapiens 


clone MuC:2o27 
IMAGE:2964565, mRNA, 
complete cds. 


10 AA 




1323 


AAB543 
74 


Homo sapiens 


HUMA- Human pancreatic 
cancer antigen protein sequence 
oc\£ iu rsvJ.ozo. 


IZOl 




1324 


gil 19075 
99 


Homo sapiens 


protein kinase H1PK2 mRNA, 
complete cds. 


6242 


99 


1324 


AAJt5o30 

61 


Homo sapiens 


OUvJIj 1NUVC1 JJlUlClll Mliaoc, 

SEQIDNO: 188. 


6086 


97 


1324 


gl 172253 
77 


Homo sapiens 


nomeouoiudin jruereu*ung 
protein kinase 2 (HIPK2) 
mRNA, complete cds. 


6083 


97 


1325 


AAB656 
61 


Homo sapiens 


SUGE- Novel protein kinase, 

OE,\£ XXJ V**\J* I OO. 


6124 


99 


1325 


gil 72253 
77 


Homo sapiens 


homeodomain interacting 
protein kinase 2 (H1PK2) 
mRNA, complete cds. 


6121 


99 


1325 


gil 19075 
99 


Homo sapiens 


protein kinase HIPK2 mRNA, 
complete cds. 


6072 


97 


1326 


gil 65522 

70 


Homo sapiens 


cDNA FLJ32230 fis, clone 
PLACE6004464 weakly similar 
to Human placenta (DiFf48) 
mRNA. 


3064 


99 


1326 


gil32742 
02 


Homo sapiens 


Human DNA sequence from 
clone RP4-530I15on 
chromosome 20 Contains the 3' 
end of the PTPN1 gene for 
protein tyrosine phosphatase, 
non-receptor type 1 (EC 
3.1.3.48), the gene for a novel 


2261 


99 
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protein similar to placental 
protein DIFF40, an RPL36 (60S 
ribosomal protein L36) 
pseudogene, a novel gene, two 
putative novel genes, ESTs, 
STSsandGSSs, complete 
sequence. 






1326 


gi222471 
3 


Homo sapiens 


Human mRNA for KIAA0386 
gene, complete cds. 


963 


34 


1327 


AAU121 
77 


Homo sapiens 


GETH Human PRO305 
polypeptide sequence. 


1472 


82 


1327 


AAY814 
87 


Homo sapiens 


FUJY Human cathepsin L2. 


1472 


82 


1327 


AAY023 
58 


Homo sapiens 


ONOY Polypeptide identified by 
the signal sequence trap method. 


1472 


82 


1328 


AAU121 
77 


Homo sapiens 


GETH Human PRO305 
polypeptide sequence. 


1698 


84 


1328 


AAY814 
87 


Homo sapiens 


FUJY Human cathepsin L2. 


1698 


84 


1328 


AAY023 
58 


Homo sapiens 


ONOY Polypeptide identified by 
the signal sequence trap method. 


1698 


84 


1329 


AAY873 
29 


Homo sapiens 


INCY- Human signal peptide 
containing protein HSPP-106 
SEQIDNO:I06. 


692 


94 


1329 


gil51454 
28 


Caenorhabditis 
elegans 


Hypothetical protein 
Y22D7AL.14 


74 


23 


1329 


gi361801 
6 


Human 

immunodeficienc 
y virus type 1 


nef 


73 


35 


1330 


AAB823 
15 


Homo sapiens 


UYCO Human immunoglobulin 
receptor isoform IRTA2c. 


1120 


99 


1330 


AAB823 
14 


Homo sapiens 


UYCO Human immunoglobulin 
receptor isoform lRTA2b. 


1120 


99 


1330 


AAB823 
13 


Homo sapiens 


UYCO Human immunoglobulin 
receptor isoform IRTA2a. 


1120 


99 


1331 


gil43367 
57 


Homo sapiens 


16pl3.3 sequence section 6 of 8. 


1178 


100 


1331 


gil34362 
69 


Homo sapiens 


hypothetical protein FLJ20898, 
clone MGC:10688 
IMAGE.-3622114, mRNA, 
complete cds. 


1178 


100 


1331 


AAG814 
30 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO:378. 


954 


100 


1332 


gi568955 

7 1 


Homo sapiens 


mRNA for KIAA1 110 protein, 
partial cds. \ 


3820 


99 


1332 


ABB1171 
3 


Homo sapiens 


HYSE- Human KIAA1 1 10 
protein homologue, SEQ ID 
NO:2083. 


3809 


99 


1332 


gi388224 
7 


Homo sapiens 


mRNA for KIAA0763 protein, 
complete cds. 


1587 


44 


1333 


gil33662 
77 


Homo sapiens 


Human DNA sequence from 
clone RP5-998H6 on 
chromosome 20ql 3.1. Contains 
the gene for the ortholog of rat 
PB-Cadherin, ESTs, STSs, GSS, 
two CpG islands and genomic 
marker D20S17, complete 


4283 


100 
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sequence. 






1333 


gi476057 
8 


Mus musculus 


PB-Cadherin 


3918 


92 


1333 


gil39890 
6 


Rattus 
norvegicus 


long type PB-cadherin 


3907 


92 


1334 


gi3023O4 
4 


Enterococcus 
faecalis 


orfC 


302 


AS 

45 


1334 


gi496520 


Streptococcus 
pyogenes 


orfiota 


271 


39 


1334 


git 29570 
24 


Enterococcus 
faecalis 


hypothetical protein 


251 


38 


1335 


gil28384 
81 


Mus musculus 


putative 


491 


71 


1335 


gi207249 
7 


Mus musculus 


perforatorial protein PERF 15 


481 


68 


1335 


gil51952 
3 


Rattus 
norvegicus 


PERF 15 protein 


477 


68 


1336 


gi!04384 
54 


Homo sapiens 


cDNA: FLJ22171 fis, clone 
HRC00654. 


3750 


100 


1336 


gil04381 
50 


Homo sapiens 


cDNA: FLJ21935 fis, clone 
HEP04373. 


3734 


99 


1336 


gi 1 35295 
54 


Mus musculus 


Similar to hypothetical protein 
FLJ21935 


3203 


85 


1337 


gil28034 
45 


Homo sapiens 


clone MGC:2217 
IMAGE: 3 139026, mRNA, 
complete cds. 


463 


100 


1337 


gi729634 
4 


Drosophila 
melanogaster 


CG4186 gene product 


201 


47 


1337 


gi529133 


Saccharomyces 
cerevisiae 


Yhrll6wp 


113 


41 


1338 


gi 104343 
52 


Homo sapiens 


cDNA FLJ12697 fis, clone 
NT2RP1000522, weakly similar 
to UBIQUITIN CARBOXYL- 
TERMINAL HYDROLASE 
DUB-1 (EC 3.1.2.15). 


6400 


99 


1338 


AAB951 
46 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:17169. 


6396 


99 


1338 


AAB746 
71 


Homo sapiens 


INCY- Human protease and 
protease inhibitor PPIM-4. 


4021 


99 


1339 


gil08016 
26 


Macaca 
fascicularis 


hypothetical protein 


1668 


98 


1339 


gi 128367 
18 


Mus musculus 


putative 


1439 


84 


1339 


gil04383 
11 


Homo sapiens 


cDNA: FLJ22054 fis, clone 
HEP09634. 


1351 


99 


1340 


gi 175 120 
67 


Homo sapiens 


hypothetical protein 
DKFZp434D0421, clone 
MGC:20807 IMAGE:4330507, 
mRNA, complete cds. 


1903 


100 


1340 


gi 140437 
17 


Homo sapiens 


hypothetical protein 
DKFZp434D0421, clone 
MGC: 14446 IMAGE :43 04040, 
mRNA, complete cds. 


1903 


100 


1340 


gi!20531 
19 


Homo sapiens 


mRNA; cDNA 

DKF2p434D0421 (from clone 
DKFZp434D0421); complete 
cds. 


1903 


100 



173 



WO 02/070539 W W PC T/US02/05095 



SEQ 
! ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 


1341 


AAB087 
65 


Homo sapiens 


INCY- A human leukocyte and 
blood related protein (LBAP). 


716 


93 


1341 


AAM409 
91 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 5922. 


508 


93 


1341 


AAB747 
18 


Homo sapiens 


INCY- Human membrane 
associated protein MEMAP-24. 


456 


96 


1342 


AAB955 
63 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:18202. 


3214 


99 


1342 


gi 104359 
04 


Homo sapiens 


cDNAFLJ13782fis, clone 
PLACE4000489, weakly similar 
to PROTEIN GRAINY-HEAD. 


3214 


99 


1342 


gil28327 
62 


Mus musculus 


putative 


2094 


94 


1343 


AAG893 
36 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 456. 


801 


100 


1343 


AAG813 
52 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ IDNO:222. 


801 


100 


1343 


AAY914 
23 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 1 1 
SEQIDNO:144. 


801 


100 


1344 


AAY949 
78 


Homo sapiens 


GEMY Human secreted protein 
clone pw337_6 2nd protein 
sequence SEQ ID NO:238. 


444 


100 


1344 


gi759453 
4 


Arabidopsis 
thaliana 


putative protein 


79 


31 


1345 


gi430989 
4 


Homo sapiens 


PAC clone RP4-555LI4 from 
7q34-q36, complete sequence. 


818 


100 


1345 


gi 176464 
48 


Mus musculus 


gammaN-crystallin 


724 


83 


1345 


gil76464 
46 


Homo sapiens 


gammaN-crystallin variant 
(CRYGN) mRNA, complete cds. 


600 


100 


1346 


AAY762 
16 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 93. 


225 


97 


1347 


AAY114 
47 


Homo sapiens 


GEST Human 5' EST secreted 
protein SEQ ID No 269. 


210 


97 


1347 


gi761976 
9 


Streptomyces 
coelico!orA3(2) 


probable Lacl-family 
transcriptional regulatory 
protein. 


66 


53 


1347 


gi755580 


Streptomyces 
lividans 


ORF-RDR; Laclhomolog, 
similar to E. coli Lac repressor, 
Swiss-Prot Accession Number 
P03023 


66 


53 


1348 


gil73910 
52 


Homo sapiens 


clone MGC:9915 
IMAGE:3«71 205, mRNA, 
complete cds. 


2220 


100 


1348 


AAG741 
53 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID 
NO:4917. 


1072 


98 


1348 


gi525318 


Haematobia 
irritans 


putative ATPase 


937 


44 


1349 


gil73910 
52 


Homo sapiens 


clone MGC:9915 
IMAGE:3871205, mRNA, 
complete cds. i 


1919 


88 


1349 


AAG741 
53 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID 
NO:4917. 


782 


76 


1349 


gi525318 


Haematobia 


putative ATPase 


749 


39 
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irritans 








1350 


AAB433 
15 


Homo sapiens 


CURA- Human ORFX 
ORF3079 polypeptide sequence 
SEQIDNO:6158. 


1463 


99 | 


1350 


gi668259 
0 


Homo sapiens 


chromosome 14 clones RP1 1- 
111016 and RP11-61F4 
containing genes for nuclear 
receptor coactivator NCoA-62 
(nuclear receptor coactivator 
NCoA-62) gene, complete cds; 
and unknown gene, complete 
sequence. 


1463 


99 


1350 


AAW820 
03 


Homo sapiens 


GEMY Human foetal brain 
secreted protein fh3_6 
(alternative sequence). 


1249 


99 


1351 


git28411 
45 


Mus musculus 


putative 


1153 


93 


1351 


gil35292 
12 


Homo sapiens 


Similar to RIKEN cDNA 
1810018M11 gene, clone 
MGC: 12485 IMAGE:3932127, 
mRNA, complete cds. 


1136 


99 


1351 


AAY962 
02 


Homo sapiens 


UYNY IkappaB kinase (IKK) 
binding protein, Y2H56. 


1126 


98 


1352 


gil51267 
88 


Mus musculus 


Similar to ferritin heavy chain 


947 


100 


1352 


gi50954 


Mus musculus 


ferrerin H subunit 


947 


100 


1352 


gi50952 


Mus musculus 


ferritin heavy subunit (AA 1 - 
182) 


947 


100 


1353 


AAB705 
38 


Homo sapiens 


CURA- Human PR08 protein 
sequence SEQ lDNO:16. 


2777 


98 


1353 


AAB705 
37 


Homo sapiens 


CURA- Human PR07 protein 
sequence SEgiDNO:14. 


2777 


98 


1353 


gil31857 
25 


Homo sapiens 


n 1755 can be A, G, C, or T. 


2777 


98 


1354 


AAB535 
41 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein sequence SEQ 
IDNO:1081. 


110 


73 


1354 


AAR729 
86 


Homo sapiens 


GENZ Creatine-kinase subunit 
B. 


110 


73 


1354 


gi29963 


Homo sapiens 


Human gene for creatine kinase 
B (EC 2.7.3.2). 


110 


73 


1355 


gi 124074 
05 


Homo sapiens 


tripartite motif protein TRIM9 
isoform beta (TRIM9) mRNA, 
complete cds; alternatively 
spliced. 


2831 


100 


1355 


gil 67555 
24 


Rattus 
norvegicus 


Spring 


2783 


97 


1355 


gil 66580 
3 


Homo sapiens 


TT nVT A f— VTA AAOO 

Human mRNA for KIAA0282 
gene, partial cds. 


2575 




1356 


AAY536 
41 


Homo sapiens 


CHIR A bone marrow secreted 
protein designated BMS42. 


346 


98 


1356 


gi966315 
3 


Homo sapiens 


partial mRNA for transport- 
secretion protein 2.2, (TTS-2.2 
gene). 


346 


98 


1356 


gi966315 
1 


Homo sapiens 


partial mRNA for transport- 
secretion protein 2.1 (TTS-2.1 
gene). 


346 


98 
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1357 


gi!08011 
97 


Homo sapiens 


heparanase-like protein HPA2b 
mRNA, complete cds. 


2785 


100 


1357 


gil51326 
69 


Homo sapiens 


unnamed protein product 


2785 


100 


1357 


AAA910 
97_aal 


Homo sapiens 


INSI- Human heparanase, 
hnhpl, coding sequence. 


2626 


88 


1358 


gi633035 
8 


Homo sapiens 


mRNA for KIAA1 1 93 protein, 
partial cds. 


2885 


100 


1358 


AAU162 
16 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 1169. 


1216 


88 


1358 


AAY158 
35 


Homo sapiens 


PATE/ A human erl protein. 


751 


43 


1359 


AAB749 
45 


Homo sapiens 


YAMA Human ADAM type 
metal protease MDTS2 protein 
SEQIDNO:10. 


6065 


99 


1359 


gil 14935 
89 


Homo sapiens 


zinc metal loendopeptidase 
(ADAMTS10) mRNA, partial 
cds. 


5940 


99 


1359 


AAB723 
00 


Homo sapiens 


HIRO/ Human ADAMTS-10 
alternative amino acid sequence. 


5484 


97 


1360 


gil73842 
56 


Homo sapiens 


partial MUC5AC gene for mucin 
5, clone A. 


1291 


80 


1360 


gi563375 


Homo sapiens 


Rsapiens (JER47) MUC5AC 
mRNA for mucin (partial). 


978 


91 


1360 


gil73842 
54 


Homo sapiens 


partial mRNA for mucin 5 
(MUC5AC gene). 


905 


75 


1361 


AAM243 
95 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO: 1920. 


634 


100 


1361 


AAY761 
79 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 56. 


634 


100 


1361 


AAB473 
27 


Homo sapiens 


CURA-FCTR4. 


74 


27 


1362 


gil50802 
64 


Homo sapiens 


clone MGC:20279 
IMAGE:3949150,mRNA J 
complete cds. 


1043 


100 


1362 


gi!04390 
83 


Homo sapiens 


cDNA: FLJ22623 fis, clone 
HSI05687. 


1043 


100 


1362 


gil 73894 
37 


Homo sapiens 


hypothetical protein FLJ22623, 
clone MGC:221 73 
IMAGE:4274089, mRNA, 
complete cds. 


1031 


99 


1363 


AAH787 
30jul 


Homo sapiens 


HUMA- Human HIBCJ89 
serine/threonine phosphatase 
cDNA sequence. 


1635 


99 


1363 


AAU205 
55 


Homo sapiens 


HUMA- Human secreted 
protein, Seq ID No 547. ! 


1635 


99 


1363 


AAU206 


Homo sapiens 


HUMA- Human secreted 
protein, Seq ID No 655. 


1635 


99 


1364 


AAB957 
00 


Homo sapiens 


HELI- Human protein sequence 
SEQIDN0;18526. 


752 


100 


1364 


gil53418 
46 


Homo sapiens 


hypothetical protein FLJ14107, 
clone MGC:21 030 
IMAGE:4389733,mRNA, 
complete cds. 


752 


100 


1364 


gil04364 
85 


Homo sapiens 


cDNAFLJ14107fis, clone 
MAMMA1001252. 


752 


100 


1365 


AAY673 


Homo sapiens i 


GEMY Human secreted protein 


367 j 


100 
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16 




6L341 4 amino acid sequence. 






1365 


AAY086 
25 


Homo sapiens 


GEMY Human secreted protein 
BL341 4. 


367 


100 


1365 


AAW420 
18 


Homo sapiens 


JACO/ Clone BL341 __4 protein. 


363 


98 


1366 


AAU197 
15 


Homo sapiens 


HUMA- Human novel 
extracellular matrix protein, Seq 
ID No 365. 


2225 


99 


1366 


gi613679 
8 


Mus musculus 


synaptotagmin VIdeltaTM2 


2150 


96 


1366 


gi613679 
6 


Mus musculus 


synaptotagmin VIdeltaTMl 


2150 


96 


1367 


AAU197 
15 


Homo sapiens 


HUMA- Human novel 
extracellular matrix protein, Seq 
ID No 365. 


2173 


97 | 


1367 


gi613679 
8 


Mus musculus 


synaptotagmin VIdeltaTM2 


2098 


94 


1367 


gi613679 
6 


Mus musculus 


synaptotagmin VIdeltaTMl 


2098 


94 


1368 


AAE0517 
5 


Homo sapiens 


INCY- Human drug 
metabolising enzyme (DME-6) 
protein. 


2614 


97 


1368 


AAU122 
25 


Homo sapiens 


GETH Human PRO4404 
polypeptide sequence. 


2614 


97 


1368 


gi 119330 
56 


Sus scrofa 


cytochrome P450 


1305 


50 


1369 


AAW781 
35 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 10 clone 
HPMGQ80. 


385 


100 


1369 


AAO023 
10 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 16202. 


76 


39 


1369 


AAO087 
72 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 22664. 


74 


35 


1370 


gi 100473 
01 


Homo sapiens 


mRNA for KIAA1613 protein, 
partial cds. 


3532 


100 


1370 


AAB361 
71 


Homo sapiens 


LEXI- Novel human transporter 
protein SEQ ID NO: 28. 


3412 


100 


1370 


AAB361 
70 


Homo sapiens 


LEXI- Novel human transporter 
protein SEQ ID NO: 26. 


3408 


99 


1371 


AAB416 
73 


Homo sapiens 


CURA- Human ORFX 
ORF1437 polypeptide sequence 
SEQ IDNO:2874. 


1221 


96 j 


1371 


AAB616 
11 


Homo sapiens 


PROT- Human protein 
HP03377. 


1220 


100 


1371 


AAE0365 
6 


Homo sapiens 


INCY- Human extracellular 
matrix and cell adhesion 
molecule-20 (XMAD-20). 


1220 


100 


1372 


gi529588 
2 


Mus musculus 


kinesin like protein 9 


3618 




1372 


AAB947 
68 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:15849. 


3257 


99 


1372 


gil 04359 
68 


Homo sapiens 


cDNAFLJ13832fis, clone 
THYROl 000666, highly similar 
to Mus musculus mRNA for 
kinesin like protein 9. 


3257 


99 


1373 


AAH255 
68 aal 


Homo sapiens 


CURA- Nucleotide sequence of 
an interferon omega-1 like 


3294 


100 



177 
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SEQ 

n> 


| Hit ID 


Speicies 


Description 


S score 


Percent 
identity 








protein NOV2. 






1373 


AAG675 
23 


Homo sapiens 


SMIK Amino acid sequence of a 
human secreted polypeptide. 


3294 


100 


1373 


AAB844 
69 


Homo sapiens 


CURA- Amino acid sequence of 
an interferon omega-1 like 
protein NOV2. 


3294 


100 


1374 


gi592399 
2 


Homo sapiens 


Human DNA sequence from 
clone RP5-1043E3 on 
chromosome 6p21.1-21.2 
Contains part of a novel gene, an 
transcription fector E2F4 
pseudogene, ESTs, STSs and 
GSSs, complete sequence. 


945 


99 


1374 


gil42457 
13 


Giardia 
intestinalis 


kinesin-like protein 9 


543 


44 


1374 


gi 150223 
94 


Leishmania 
major 


possible kinesin-like protein 


531 


42 


1375 


AAG747 
79 


Homo sapiens 


HTJMA- Human colon cancer 
antigen protein SEQ ID 
NO:5543. 


1405 


99 


1375 


gi798126 
1 


Homo sapiens 


Human DNA sequence from 
clone RP1-50O24 on 
chromosome lp35. 1-35.3. 
Contains the 3' end of the gene 
for a novel protein similar to C. 
elegans K07B1.7 (Tr:O01886), 
the gene for a novel protein 
(translation of cDNA 
NT2RM2001100 
(Em:AK00121 1)), the SFN gene 
for stratifin (14-3-3 protein 
sigma), the gene for a novel 
protein with DHHC zinc finger 
domain, the gene for a novel 
protein (translation of cDNA 
KAT07271 (Em:AK000484)) 
and the gene for B120 (Clorf4) 
(ARID DNA binding domain 
containing protein). Contains 
ESTs, STSs, GSSs and six 
putative CpG islands, complete 
sequence. 


1122 


67 


1375 


AAG812 
54 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO:26. * 


1118 


72 


1376 


gi805223 
7 


Homo sapiens 


ASCL3gene,CEGPl gene, 
Cllorfl4 gene, CI lorfl5 gene, 
Cllorfl6gene andCl!orfl7 
gene. 


5605 


100 ! 


1376 


m 805219 
0 




Cegpl protein 


5054 


89 


1376 


AAG675 
29 


Homo sapiens 


SMIK Amino acid sequence of a 
human secreted polypeptide. 


3226 


61 


1377 


gil04370 
45 


Homo sapiens 


cDNA:FLJ21044fis, clone 
CAE11659. 


1663 


100 


1377 


gi420638 
6 


Mus musculus 


rig-1 protein 


1543 


72 


1377 


AAB570 
88 


Homo sapiens 


ROSE/ Human prostate cancer 
antigen protein sequence SEQ 


1518 


98 



178 
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SEQ 
ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 








IDNO:1666. 






1378 


AAY026 
97 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 48 clone 
HTNBR95. 


165 


100 


1379 


AAY733 
86 


Homo sapiens 


INCY- HTRM clone 3279329 
protein sequence. 


529 


100 


1379 


AAB631 
62 


Homo sapiens 


ROSE/ Human secreted protein 
sequence encoded by gene 29 
SEQIDNO:88. 


363 


100 


1379 


AAB951 
24 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:17122. 


83 


32 


1380 


gil01981 
15 


Homo sapiens 


2P domain potassium channel 
TREK2 (KCNK10) mRNA, 
complete cds. 


2760 


100 


1380 


gi845290 

o 


Rattus 
norvegicus 


potassium channel TREK-2 


2555 


95 


1380 


gi458479 
9 


Mus musculus 


TREK-1 K+ channel subunit 


1238 


64 


1381 


gil32766 
55 


Homo sapiens 


mRNA; cDNA 

DKFZp761M0423 (from clone 
DKFZp761M0423); complete 
cds. 


2626 


99 


1381 


AAE0436 
1 


Homo sapiens 


INCY- Human kinase (PKIN)-2. 


2588 


97 


1381 


gil83616 
1 


Rattus sp. 


Ca2H7calmodulin«dependent 
protein kinase IV kinase 
isoform; CaM-kinase kinase 
alpha 


2468 


93 j 


1382 


gil23827 
81 


Homo sapiens 


OSBP-related protein 4 mRNA, 
complete cds. 


1124 


100 


1382 


gil33592 
01 


Homo sapiens 


mRNA for KIAA1664 protein, 
partial cds. 


1036 


100 


1382 


gil42098 
40 


Homo sapiens 


oxysterol binding protein 2 
(OSBP2) gene, complete cds. 


919 


100 


1383 


gil28055 
53 


Mus musculus 


Unknown (protein for 
MGC:7583) 


792 


99 


1383 


gil28586 
56 


Mus musculus 


putative 


787 


98 


1383 


AAM238 
65 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO: 1390. 


672 


83 


1384 


gi300292 
5 


Homo sapiens 


T cell receptor beta chain 
(TCRBV13S 1-TCRBJ2S 1 ) 
mRNA, complete cds. 


679 


73 


1384 


gi298250 
8 


Homo sapiens 


mRNA for TCR beta chain, 
specific for Mage 3/HLA-A2. 


667 


71 


1384 


AAM240 
51 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO: 1576. 


655 


100 


1385 


gi 13 8792 
62 


Mus musculus 


nTT/T»\T -T\\T A A/T 1 nf\1 1 1 

RIKEN cDNA 0610011 El 7 
gene 


710 


y/ 


1385 


gil 28503 
53 


Mus musculus 


putative 


710 


97 


1385 


AAB429 
05 


Homo sapiens 


CURA- Human ORFX 
ORF2669 polypeptide sequence 
SEQIDNO:5338. 


582 


79 


1386 


AAB950 
62 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 16896. 


741 


99 


1386 


AAM241 


Homo sapiens 


HYSE- Human EST encoded 


681 


100 



179 
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SEQ 
ID 



Hit ID 



27 

AAV873 



Speicies 



Description 



protein SEQ ID NO: 1652, 



S score 



Percent 
identity 



1386 



28 



Homo sapiens 



INCY- Human signal peptide 
containing protein HSPP- 1 05 
SEQIDNO:105, 



681 



100 



1387 



AAM235 
76 



Homo sapiens 



HYSE- Human EST encoded 
protein SEQ ID NO: 1101. 



691 



100 



99 



Z YIviG Human zaipha48. 



306 



68 



AAE0658 
0 



Homo sapiens 



SAGA Human protein having 
hydrophobic domain, HP10786. 



306 



68 



AAB718 
63 



Homo sapiens 



MILL- Human hl5571 GPCR. 



6511 



97 



gil 59874 
91 



Homo sapiens 



tumor endothelial marker 5 
precursor (TEM5) mRNA, 
complete cds. 



6511 



97 



gil 59874 
99 



Mus muscuius 



tumor endothelial marker 5 
precursor 



5693 



85 



AAE0135 
4 



Homo sapiens 



HUMA- Human gene 3 encoded 
secreted protein HOHBL42, 
SEQ IDNO:76. 



3747 



99 



gi431420 



Mus muscuius 



MPS1 protein 



2734 



77 



gi505204 
8 



AAM242 
00 



Rattus 
norvegicus 



Mpg-1 protein 



2672 



Homo sapiens 



AAY195 



Homo sapiens 



HYSE- Human EST encoded 
protein SEQ ID NO: 1725. 



329 



HUMA- Amino acid sequence 
of a human secreted protein. 



329 



75 



100 



100 



AAG742 
49 



Homo sapiens 



HUMA- Human colon cancer 
antigen protein SEQ ID 
NO:5013. 



98 



gil28351 
18 



Mus muscuius 



putative 



884 



95 



gi730I76 
3 



Drosophila 
melanogaster 



CGII900 gene product 



497 



59 



gil28032 
69 



Homo sapiens 



Similar to CGI 0641 gene 
product, clone MGC:3052 
IMAGE:3343900,mRNA, 
complete cds. 



701 



100 



gi!04419 
42 



Homo sapiens 



clone PP3051 unknown mRNA. 



701 



100 



AAB954 

96 

AAY125 



Homo sapiens 



HELI- Human protein sequence 
SEQIDNO:18039. 



698 



99 



12 



Homo sapiens 



AAB948 
33 



Homo sapiens 



GEST Human 5' EST secreted 
protein SEQIDNO:543. 



320 



gil 04362 
42 

AAB428 



Homo sapiens 



HELI- Human protein sequence 
SEQ ID NO: 15998. 



2735 



cDNA FU13941 fls, clone 
Y79AA1000850. 



2735 



98 



100 



100 



18 



Homo sapiens 



AAB948 
33 



Homo sapiens 



CURA- Human ORFX 
ORF2582 polypeptide sequence 
SEQ ID NO:51 64. 



1115 



HELI- Human protein sequence 
SEQIDNO:15998. 



2713 



99 



97 



gil04362 
42 



Homo sapiens 



cDNAFLJ13941 fis, clone 
Y79AA1000850. 



2713 



97 



AAB428 
18 



Homo sapiens 



CURA- Human ORFX 
ORF2582 polypeptide sequence 



1093 



94 



180 
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SEQ 
ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 








SEOIDNO:5164. 






1396 


gi726392 
8 


Homo sapiens 


Human DNA sequence from 
clone RP1-61A9 on 
chromosome Ip35.2-36.13 
Contains part of the EPHB2 
gene for tyrosine-protein kinase, 
the gene KIAA0478 for a C2H2 
type zinc finger gene, ESTs, 
STSs, GSSs and three putative 
CpG Islands, complete sequence. 


5326 


99 


1396 


AAE0436 
2 


Homo sapiens 


INCY- Human kinase (PKIN)-3. 


5308 


99 


1396 


AAU006 
91 


Homo sapiens 


CURA- Ephrin type-A receptor 
8-Iike protein. 


5259 


99 


1397 


gi 104376 
26 


Homo sapiens 


cDNA: FU21511 fis, clone 
COL05748. 


3713 


99 


1397 


gil67414 
00 


Mus musculus 


Similar to hypothetical protein 
FU21511 


3125 


82 


1397 


gi 145889 
31 


Saccharomyces 
cerevisiae 


hypothetical protein 


690 


29 


1398 


AAB947 
13 


Homo sapiens 


HELI- Human protein sequence 
SEOIDNO:15714. 


2869 


100 


1398 


gi 104357 
45 


Homo sapiens 


cDNA FLU 3664 fis, clone 
PLACE1011649. 


2869 


100 


1398 


gil68772 
91 


Homo sapiens 


Similar to hypothetical protein 
24432, clone MGC:21034 
IMAGE:4400396, mRNA, 
complete cds. 


2843 


99 


1399 


AAB829 
40 


Homo sapiens 


UYNY Human androgen 
receptor trapped protein 5 
(ARTS). 


1429 


100 


1399 


AAB560 
85 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 9 
SEQ IDNO:179. 


1429 


100 


1399 


gi 104392 
04 


Homo sapiens 


cDNA: FLJ22709 fis, clone 
HSI13338. 


1429 


t r\f\ 

100 


1400 


gi 104396 
25 


Homo sapiens 


cDNA: FU23056 fis, clone 
LNG03287. 


1 190 


1 AA 


1400 


gi729573 
2 


Drosophila 
melanogaster 


ft gene product 


185 


27 


1400 


gi 157409 


Drosophila 
melanogaster 


fat protein 


185 


27 


1401 


gi 139362 
85 


Mus musculus 


TRH4 


1332 


61 


1401 


gi 128455 
40 


Mus musculus 


putative 


1330 


ol 


1401 


AAU007 


Homo sapiens 


INCY- Human apoptosis 

rti*/vfair\ ADHD O 

protein, Arur-z. 


1092 


65 


1402 


AAU004 
75 


Homo sapiens 


MILL- Human INTERCEPT 
394 alternative form protein. 


4272 


97 


1402 


AAU004 
73 


Homo sapiens 


MILL- Human INTERCEPT 
394 protein. 


4089 


99 


1402 


gil04384 
50 


Homo sapiens 


cDNA: FLJ22169 fis, clone 
HRC00632. 


3505 


99 


1403 


gi 107988 
04 


Homo sapiens 


HCMOGT-1 mRNA for sperm 
antigen, complete cds. 


3737 


98 


1403 


ABB1229 


Homo sapiens 


HYSE- Human secreted protein 


2811 


100 



181 
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SEQ 

IV 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 




7 




homologue, SEQ ID NO:2667. 






1 AM 

1403 


AAM252 
55 


Homo sapiens 


HYSE- Human protein sequence 
SEQIDNO:770. 


2778 


98 


1 A t\A 

1404 


AAB954 
25 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:17833. 


4871 


100 


1404 


gil04354 

Maw 
0/ 


Homo sapiens 


cDNA FIJI 3465 fls, clone 
PLACE1003493, weakly similar 
to ENDOTHELIAL CELL 
MULTIMERIN PRECURSOR. 


4871 


100 


1404 


AAY363 
00 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 77. 


2472 


98 


1405 


gil00472 
11 


Homo sapiens 


mRNA for KIAA1573 protein, 
partial cds. 


6270 


100 


1405 


gil43883 
34 


Macaca 
fascicularis 


hypothetical protein 


5174 


99 


1405 


AAB958 
83 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:18994. . 


3679 


100 


1406 


gil47146' 
04 


Homo sapiens 


clone MGC:17248 

IMAGE:4215164,mRNA, 
complete cds. 


3291 


96 


1406 


AAW803 
18 


Homo sapiens 


SMIK Neurodegenerative 
polypeptide HHPDZ65var. 


2893 


100 


1406 


4 


Homo sapiens 


mRNA for putative acid-sensing 
ion channel (ASIC4 gene). 


2893 


100 


1407 


AAB656 
97 


Homo sapiens 


SUGE- Novel protein kinase, 
SEQ ID NO: 225. 


1647 


100 


1407 


gi 140439 
28 


Homo sapiens 


clone IMAGE:41 39786, 
mRNA, partial cds. 


1117 


100 


1407 


AAG024 
79 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6560. 


442 


100 


1408 


AAR499 
43 


Homo sapiens 


YAMA/ Human hippocampal 
cholinergic neurotrophic peptide 
precursor. 


881 


89 


1408 


gi704465 


Homo sapiens 


H.sapiens mRNA for 
phosphatidylethanolamine 
binding protein. 


881 


89 


1408 


gi435638 


Homo sapiens 


Human mRNA for human 
homoiogue of rat 
phosphatidylethanolamine 
binding protein, complete cds. 


881 


89 


1409 


AAB955 
17 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:18089. 


4520 


99 


i A An 

1409 


gi 104357 
76 


Homo sapiens 


cDNA FU13687fis, clone 
PLACE2000061. 


4520 


99 


1409 


gi726465 
3 


Mus musculus 


Kiaa0575 


1867 


48 


1410 


gi 170283 
41 


Homo sapiens 


hypothetical protein FLJ21820, 
cione MuL:l4x?2 
IMAGE:361 1020, mRNA, 
complete cds. 


1732 


100 ■ 


1410 


gi 104379 
97 


Homo sapiens 


cDNA:FLJ2J820fis, clone 
HEP01232. 


1732 


100 


1410 


gil 67697 
18 


Drosophila 
melanogaster 


LP01162p 


437 


33 


1411 


gil50797 
29 


Homo sapiens 


hypothetical protein FLJ21 125, 
cione MGC: 14948 
IMAGE:4303449,mRNA, 


1530 


99 



182 
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complete cds. 






1411 


gil 04371 
58 


Homo sapiens 


cDNA:FLJ21125 fis, clone 
CAS06077. 


1530 


99 


1411 


AAY946 
74 


Homo sapiens 


ZYMO Human zsig83 mature 
protein sequence. 


142 


34 


1412 


gil 04381 
26 


Homo sapiens 


cDNA: FLJ21918 fis, clone 
HEP04006. 


3799 


100 


1412 


AAY844 
40 


Homo sapiens 


INCY- Amino acid sequence of 
a human RNA-associated 
protein. 


2085 


59 


1412 


gi7020O9 
4 


Homo sapiens 


cDNA FU20171 fis, clone 
COL09761. 


1246 


66 


1413 


gil4031O 
72 


Homo sapiens 


Human DNA sequence from 
clone RP3-331H24 on 
chromosome 6 Contains a 
putative novel gene, part of the 
gene for hypothetical protein 
FLJ21079, similar to opioid 
growth factor receptor, ESTs, 
STSs, GSSs and a CpG island, 
complete sequence. 


1307 


99 


1413 


gil 04370 
94 


Homo sapiens 


cDNA: FLJ21079 fis, clone 
CAS02253, 


1307 


99 | 


1413 


gil28434 
68 


Mus musculus 


putative 


922 


73 


1414 


AAB943 
98 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:14968. 


3638 


100 


1414 


gil04347 
85 


Homo sapiens 


cDNA FIJI 2987 fis, clone 
NT2RP3OO0068, weakly similar 
to SON OF SEVENLESS 
PROTEIN HOMOLOG 1. 


3638 


100 


1414 


AAB956 
39 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 183 76. 


3001 


100 


1415 


gil 36234 
91 


Homo sapiens 


clone MGC:13 125 
IMAGE:4111572, mRNA, 
complete cds. 


3054 


100 


1415 


gil65537 
93 


Homo sapiens 


cDNAFLJ25103fis, clone 
CBR01405. 


1586 


74 


1415 


AAM662 
79 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 26585. 


1301 


100 


1416 


ABB 1167 
5 


Homo sapiens 


HYSE- Human secreted protein 
homologue, SEQ ID NO:2045. 


1238 


99 


1416 


gil 59289 
21 


Homo sapiens 


hypothetical protein FLJ14393, 
clone MGC: 17935 
IMAGE:3916148, mRNA, 
complete cds. 


1238 


99 


1416 


AAY452 
72 


Homo sapiens 


iiUiVlA- riuman secreted proicin 
encoded from gene 16. 


1236 


99 


1417 


gil36232 
49 


Homo sapiens 


Similar to RIKEN cDNA 
31 10082117 gene, clone 
MGC:11257 IMAGE:3941780, 
mRNA, complete cds. 


945 


95 


1417 


gi!28520 
07 


Mus musculus 


putative 


466 


61 


1417 


AAW679 
36 


Homo sapiens 


HUMA- Fragment of human 
secreted protein encoded by 


329 


98 



183 
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n> 


Hit ID 


Spcides 


Description 


S score 


Percent 
identity 


1418 


gi 139382 
74 


Homo sapiens 


gene 57. 

clone MOC:15548 
IMAGE:3051320,mRNA, 
complete cds. 


3136 


99 


1418 


AAU163 
73 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 1326. 


963 


100 


1418 


AAU159 
22 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 875. 


963 


100 


1419 


AAB930 
81 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:11914. 


1812 


98 


1419 


gi 140424 
19 


Homo sapiens 


cDNAFU14712fis, clone 
NT2RP3000825, weakly similar 
to NEUROGENIC LOCUS 
NOTCH 3 PROTEIN. 


1812 


98 


1419 


AAY727 
13 


Homo sapiens 


HUMA- HWAAQ40 clone 
human attractin-like protein. 


1212 


99 


1420 


AAU124 
18 


Homo sapiens 


GETH Human PR01275 
polypeptide sequence. 


643 


98 


1420 


AAY993 
79 


Homo sapiens 


OETH Human PR01275 
(UNQ645) am j no ac ^ se q ue nce 
SEQ IDN0.136. 


643 


98 


1420 


AAB256 
83 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 19 
SEQIDNO:72. 


643 


98 


1421 


gi 104387 
12 


Homo sapiens 


cDNA: FU22358 fis, clone 
HRC06415. 


3025 


100 


1421 


gil42111 
39 


Homo sapiens 


NADPH oxidase 5 gamma 
mRNA, complete cds. 


3019 


99 


1421 


giI421Il 
37 


Homo sapiens 


NADPH oxidase 5 alpha mRNA, 
complete cds. 


3019 


99 


1422 


gil26583 
05 


Homo sapiens 


kappa B and V(D)J 
recombination signal sequences 
binding protein (KRC) mRNA, 
complete cds. 


8934 


99 


1422 


gil00471 
75 


Homo sapiens 


mRNA for KIAA1 555 protein, 
partial cds. 


8588 


99 


1422 


gi 137788 
6 


Mus musculus 


DNA binding protein Rc 


6216 


76 


1423 


gi 173892 
08 


Homo sapiens 


clone MGC: 16889 
IMAGE:3883022,mRNA, 
complete cds. 


2465 


100 


1423 


gi 152781 
67 


Homo sapiens 


differentiation-related DIF1 4 
long form (DIF14) mRNA, 
complete cds, alternatively 
spliced. 


2448 


99 


1423 


gi965122 
0 


Mus musculus 


LMBR1 long form 


2391 


96 


1424 


gi 104370 
78 


riuinu sapiens 


cDNA: FLJ21069 fis, clone 
CAS01594. 


2523 


99 


1424 


gil59297 
78 


Homo sapiens 


hypothetical protein FLJ21069, 
clone MGC:21026 
IMAGE:443 1888, mRNA, 
complete cds. 


2517 


99 


1424 


gil28597 
74 


Mus musculus 


putative 


2182 


86 


1425 


AAM937 
35 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3701. J_ 


1364 


99 
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1425 


gil28553 
79 


Mus musculus 


putative 


1332 


87 


1425 


gil63593 
63 


Mus musculus 


Similar to hypothetical protein 
MGC12921 


652 


53 


1426 


gil65536 
18 


Homo sapiens 


cDNAFLJ33140fis, clone 
UTERU1000160, moderately 
similar to ZINC FINGER 
PROTEIN 191. 


2173 


99 


1426 


gil04400 
85 


Homo sapiens 


cDNA: FU23407 fis, clone 
HEP19601. 


1146 


100 


1426 


gil42501 
46 


Homo sapiens 


hypothetical protein FLJ23407, 
clone MGC:14819 
IMAGE:4248596, mRNA, 
complete cds. 


1143 


99 


1427 


gil27341 
04 


Homo sapiens 


Human DNA sequence from 
clone RP11-371L19 on 
chromosome 20. Contains two 
novel genes, the gene for a novel 
protein similar to 40S ribosomal 
protein S10 (RPS10), ESTs, 
STSs, GSSs and five CpG 
islands, complete sequence. 


2452 


100 


1427 


gil55241 
16 


Homo sapiens 


unnamed protein product 


2431 


98 


1427 


gil46024 
88 


Homo sapiens 


clone MGC: 10698 
IMAGE:3689286, mRNA, 
complete cds. 


2395 


98 


1428 


AAG675 
09 


Homo sapiens 


SMIK Amino acid sequence of a 
human secreted polypeptide. 


4286 


100 


1428 


gil56208 
67 


Homo sapiens 


mRNA for KIAA 1904 protein, 
partial cds. 


4272 


99 


1428 


gi319197 
5 


Homo sapiens 


Human DNA sequence from 
clone RP1-63G5 on 
chromosome 22ql2.3-13.1 
Contains the 3' part of the 
PSCD4 gene for a human SEC7 
homologB2-l (cytohesin-2, 
Arno, ARF exchange factor) 
LIKE protein, a novel gene and 
the gene coding for a Leucine 
rich protein. Contains ESTs, 
STSs, GSSs and three putative 
CpG islands, complete sequence. 


953 


100 


1429 


AAG023 
49 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6430. 


468 


100 


1429 


gil77046 
2 


Homo sapiens 


H.sapiens mRNA for M-phase 
phosphoprotein, mpp6. 


468 


100 


1429 


gil50296 
28 


Homo sapiens 


Similar to M-phase 
phosphoprotein 6, clone 
MGC:13538 IMAGE:4287267, 
mRNA, complete cds. 


468 


100 


1430 


AAB883 
77 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone 
PSEC0113. 


239 


100 


1430 


AAB089 
04 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 14 
SEQIDNO:61. 


239 


100 


1430 


gil42726 


Homo sapiens 


unnamed protein product 


239 


100 
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08 










1431 


gi999289 
3 


Homo sapiens 


phosphoinositol 3-phosphate 
binding protein-1 (PEPP1) 
mRNA, complete cds. 


3906 


95 


1431 


AAB420 
86 


Homo sapiens 


CURA- Human ORFX 
ORF1850 polypeptide sequence 
5EQIDNG:37G0. 


427 


71 


1431 


gi458958 
2 


Homo sapiens 


mRNA for KIAA0969 protein, 
complete cds. 


256 


31 


1432 


gi999289 
3 


Homo sapiens 


phosphoinositol 3-phosphate 
binding protein-1 (PEPP1) 
mRNA, complete cds. 


4152 


99 


1432 


AAB420 
86 


Homo sapiens 


CURA- Human ORFX 
ORF1850 polypeptide sequence 
SEQJDNO:3700. 


672 


99 


1432 


AA0125 
92 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 26484. 


475 


100 


1433 


AAY949 
47 


Homo sapiens 


GEMY Human secreted protein 
clone cw!292_8 protein 
sequence SEQ ID NO: 100. 


408 


100 


1433 


AAB651 
95 


Homo sapiens 


GETH Human PRO830 
(UNQ470) protein sequence 
SEQIDNO:175. 


215 


64 


1433 


AAY666 
72 


Homo sapiens 


GETH Membrane-bound protein 
PRO830. 


215 


64 


1434 


gil65538 
18 


Homo sapiens 


cDNA FU25124 fis, clone 
CBR06414. 


1573 


100 


1434 


AAG021 
37 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6218. 


554 


98 


1434 


AAH755 
22 aal 


Homo sapiens 


SHAN- Human myosin heavy 
chain 74 encoding cDNA. 


275 


22 


1435 


AAF8417 
1 aal 


Homo sapiens 


CHUG- Human OATP-B coding 
sequence. 


188 


92 


1435 


AAZ9240 
3 aal 


Homo sapiens 


SCHE cDNA encoding human 
DC-PGT. 


188 | 


92 


1435 


AAC618 
83 aal 


Homo sapiens 


CHIR cDNA encoding a human 
secreted protein. 


188 


92 


1436 


gi 112304 
87 


Rattus 
norvegicus 


NTPDase6 


501 


96 


1436 


AAB722 
42 


Homo sapiens 


HYSE- Mature human CD39 
like protein CD39-L2 amino 
acid sequence. 


414 


80 


1436 


AAB722 
41 


Homo sapiens 


HYSE- Human CD39 like 
protein CD39-L2 amino acid 
sequence. 


414 


80 


1437 


gi724322 
9 


Homo sapiens 


mRNA for KIAA1424 protein, 
partial cds. 


6604 


99 


1437 


AAR070 
11 


riomo sapiens 


SHAN- Human G-protein 
activating protein 129 SEQ ID 
NO:2. 


6021 


99 


1437 


AAB416 
60 


Homo sapiens 


CURA- Human ORFX 
ORF1424 polypeptide sequence 
SEQIDNO:2848. 


4377 


99 


1438 


AAB088 
94 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 4 
SEQ IDNO:51. 


211 r 


69 


1438 


gi 156262 


Buffalopox virus 


p8 protein homologue 


69 


31 J 
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57 










1438 


gi583067 
8 


variola minor 
virus 


A14L protein 


Do 


9*7 


1439 


AAG017 
13 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 5794. 


o/u 


yy 


1439 


gil20019 
70 


Homo sapiens 


clone 015hl2 My015 protein 
mRNA, complete cds. 


495 


96 


1439 


gi996391 
0 


Xenopus laevis 


Churchill protein 


495 


71 


1440 


gil40178 
31 


Homo sapiens 


mRNA for KIAA1807 protein, 
partial cds. 


1751 


100 


1440 


gil04388 
85 


Homo sapiens 


cDNA: FLJ22479 fis, clone 
HRC10831. 


1524 


100 


1440 


gi!44245 
58 


Homo sapiens 


KIAA0239 protein, clone 
IMAGE:4301096, mRNA, 
partial cds. 


157 


28 


1441 


AAB436 
17 


Homo sapiens 


HUMA- Human cancer 
associated protein sequence SEQ 
ID NO: 1062. 


1481 


93 


1441 


AAP9191 
3 


Homo sapiens 


BEHW Anticoagulative PP4X. 


1481 


93 


1441 


gil89617 


Homo sapiens ■ 


Human protein PP4-X mRNA, 
complete cds. 


1481 


93 


1442 


AAE0379 
0 


Homo sapiens 


HUMA- Human gene 9 encoded 
secreted protein fragment, SEQ 
ID NO:60. 


391 


100 


1442 


AAE0378 
5 


Homo sapiens 


HUMA- Human gene 9 encoded 
secreted protein HMWDW68, 
SEQ IDNO:55. 


391 


100 


1442 


AAY734 
25 


Homo sapiens 


GEMY Human secreted protein 
clone yj3_l protein sequence 
SEQIDNO:72. 


391 


100 


1443 


gi388232 
9 


Homo sapiens 


mRNA for KIAA0804 protein, 
partial cds. 


6282 


100 


1443 


AAB943 
56 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 14879. 


4093 


99 


1443 


gil 04346 
28 


Homo sapiens 


cDNA FU12883 fis, clone 
NT2RP2003981, weakly similar 
to VACUOLAR PROTEIN 
SORTING-ASSOCIATED 
PROTEIN VPS8. 


4093 


99 


1444 


gil25396 
15 


Homo sapiens 


AKAP-associated sperm protein 
(ASP) mRNA, complete cds. 


1215 


99 


1444 


gil57790 
77 


Homo sapiens 


AKAP-associated sperm 
protein, clone MGC:26950 
IMAGE:4820798, mRNA, 
complete cds. 


1212 


99 


1 A A A 

1444 


gil 15/82 

18 


Mus musculus 


r* A K/TP HpnpnHpnt nrAtpifj kinase 

regulatory subunit 


937 


78 


1445 


AAB435 
99 


Homo sapiens 


HUMA- Human cancer 
associated protein sequence SEQ 
ID NO: 1044. 


1283 


100 


1445 


gil41249 
28 


Homo sapiens 


clone MGC:3644 
IMAGE:2966331,mRNA, 
complete cds. 


1219 


100 


1445 


gi!40438 
53 


Homo sapiens 


thymidine kinase 1, soluble, 
clone MGC: 14441 


1219 


100 
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IMAGE:4303880,mRNA, 
complete cds. 






1446 


AAE0040 
4 


Homo sapiens 


ZYMO Human 
phosphodiesterase zcytorl3 
. protein. 


2733 


100 


1446 


gil39223 


Homo sapiens 


unnamed protein product 


2733 


100 


1 A AC 

1446 


AAM255 
48 


Homo sapiens 


HYSE- Human protein sequence 
SEQ ID NO: 1063. 


2584 


100 


1447 


gi376644 
3 


Homo sapiens 


QDPR gene, exon 1 and joined 
CDS. 


1069 


87 


1447 


gi30819 


Homo sapiens 


Human mRNA for 
dihydropteridine reductase 
(hDHPR). 


1069 


87 


1 A An 

1447 


gu8l553 


Homo sapiens 


Human dihydropteridine 
reductase (hDHPR) mRNA, 
complete cds. 


1069 


87 


1 A A O 

1448 


gi 132766 
31 


Homo sapiens 


mRNA; cDNA DKF2p761F241 
(from clone DKFZp761F241); 
complete cds. 


747 


100 


1448 


gil28448 
72 


Mus musculus 


putative 


650 


87 


1448 


AAY597 
95 


Homo sapiens 


META- Human normal ovarian 
tissue derived protein 72, 


554 


100 


1449 


AAB429 
06 


Homo sapiens 


CURA- Human ORFX 
ORF2670 polypeptide sequence 
SEQIDNO:5340. 


834 


100 


1449 


gil31951 
51 


Homo sapiens 


transcription factor TZP (TZP) 
mRNA, complete cds. 


534 


54 ! 


1449 


gil02414 
61 


Homo sapiens 


Human DNA sequence from 
clone RP5-1121G12 on 
chromosome 20 Contains the 3' 
end of a gene encoding the 
hepatocellular carcinoma- 
associated antigen 58 (HCA58), 
the SCAND1 gene encoding the 
domain-containing 1 protein, a 
novel gene, 2 CpG islands, 
ESTs, STSs and GSSs, complete 
sequence. 


534 


54 


1450 


AAY120 

01 


Homo sapiens 


GEST Human 5' EST secreted 
protein SEQ ID NO: 334. 


265 


97 


1450 


gi962621 

0 
0 


Beet curly top 
virus 


ORF20.1 > [Beet curly top 


63 


27 


1451 


AAG018 
78 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 5959. 


348 


92 


A HO I 


58 


Homo sapiens 


HUMA- Human pancreatic 
cancer flntipen nrntpin Qpmtpnpp 
SEQIDNO:610. 


225 


91 


1452 


AAO083 
54 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 22246. 


1451 


89 


1452 


AAY733 
84 


Homo sapiens 


INCY- HTRM clone 2284580 
protein sequence. 


1451 


89 


1452 


gil36999 
02 


Homo sapiens 


mRNA for nucleolar 
phosphoprotein Nopp34, 
complete cds. 


1451 


89 


1453 


gil42499 


Homo sapiens 


Similar to hypothetical protein 


2387 


100 
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53 




FLJ22376, clone MGC:16044 
IMAGE:3610443, mRNA, 
complete cds. 






1453 


gil67405 
59 


Homo sapiens 


clone MGC13247 
IMAGE;4040497, mRNA, 
complete cds. 


1067 


100 


1453 


gil65517 
33 


Homo sapiens 


cDNAFLJ31791fis, clone 
NT2RI2008749, weakly similar 
to SPLICEOSOME 
ASSOCIATED PROTEIN 49. 


1023 


53 


1454 


AAM663 
21 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 26627. 


883 


50 


1454 


AAM539 
33 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded 
protein SEQ ID NO: 26038. 


883 


50 


1454 


gi449038 
8 


Felis siivestris 


polyprotein 


672 


44 


1455 


AAB948 
15 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:I5961. 


719 


100 


1455 


gil 04362 
05 


Homo sapiens 


cDNAFLJ13920fis, clone 
Y79AA1000420. 


719 


100 


1455 


gil28475 
56 


Mus musculus 


putative 


169 


46 


1456 


gi33044 


Homo sapiens 


Human mRNA for insulin-like 
growth factor II (clone P2 1 ). 


742 


97 


1456 


gil 82528 


Homo sapiens 


Human preproinsulin-like 
growth factor II (IGF-II) variant 
mRNA, complete cds. 


717 


78 


1456 


AAY703 
64 


Homo sapiens 


UYLO- Insulin-like growth 
factor II. 


714 


78 


1457 


AAY993 
51 


Homo sapiens 


GETH Human PR01481 
(UNQ750) amino acid sequence 
SEQIDNO:41. 


1725 


100 


1457 


AAB102 
59 


Homo sapiens 


GEMY Human fetal placenta 
protein fragment B Al 76_1 i. 


1631 


88 


1457 


AAB102 
51 


Homo sapiens 


GEMY Human adult testes 
protein fragment AJ142 li. 


761 


97 


1458 


AAB436 
07 


Homo sapiens 


HUMA- Human cancer 
associated protein sequence SEQ 
ID NO: 1052. 


664 


88 


1458 


gi695360 


Homo sapiens 


nuclear-encoded mitochondrial 
cytochrome c oxidase Va 
subunit mRNA, complete cds. 


658 


87 


1458 


gil28585 
80 


Mus musculus 


putative 


544 


73 


1459 


gi754922 
3 


Mus musculus 


PALS1 


3386 


96 


1459 


AAB941 
80 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO:14494. 


2590 


99 


1459 


gil04342 
10 


Homo sapiens 


cDNAFU12615fis, clone 
NT2RM4001629, weakly similar 
to MAGUK P55 SUBFAMILY 
MEMBER 3. 


2590 


99 


1460 


gil26979 
87 


Homo sapiens 


mRNA for KIAA1721 protein, 
partial cds. 


3859 


99 


1460 


AAB944 


Homo sapiens 


HELI- Human protein sequence 


3853 


99 
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35 




SEQIDNO:15056. 






1460 


gi 104348 
78 


Homo sapiens 


cDNAFLJ13046fis, clone 


3853 


99 


1461 


gil52144 
23 


Homo sapiens 


clone IMAGE:4563921, 
mRNA, partial cds. 


2603 


100 


1461 


oi|70A17 

49 


Homo sapiens 


unnamed protein product 


2603 


100 


1461 


gi 167407 


Musmusculus 


Similar to hexokinase 1 


2411 


91 


1462 


AAG755 

7Q 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID 
NO:6343. 


714 


85 


1462 


AAB435 
66 


Homo sapiens 


HUMA- Human cancer 
associated protein sequence SEQ 
IDNO:l0ll. 


714 


85 


1462 


gi239865 

7 


Homo sapiens 


Esapiens mRNA translocon- 
associated protein delta subunit 
precursor. 


714 


85 


I HO J 


A Af3QQ1 
AAUoyl 

28 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 248. 


2076 


100 


1463 


gil63068 
50 


Homo sapiens 


hypothetical protein FLJ22637, 
clone MGC:2443 
IMAGE:2821972, mRNA, 
complete cds. 


2076 


100 


l HQ J 


04 


Homo sapiens 


cDNA: FU22637 fis, clone 
HSI06677. 


2076 


100 


1W4 


A A W/I/IO 

A A W C4z 

62 


Homo sapiens 


BGHM Human neutrophil 
elastase. 


1326 


96 




A A oomi 
AAroUJJ 

5 


Homo sapiens 


TORA ) TORAY IND INC 
(AOKI/ Sequence of serine 
protease (SP) of human myeloid 
cellorigin and leader peptide. 


1326 


96 


1464 




Homo sapiens 


Human neutrophil elastase gene; 
exon 5. 


1326 


96 


1465 


AAY1 n 
85 


Homo sapiens 


GEb r Human 5 EST secreted 
protein SEQ ID No 207. 


220 


100 


1465 


AAB677 
83 


Homo sapiens 


INRM Amino acid sequence of a 
human thyroid NADPH-oxidase. 


75 


33 


1465 


AAM245 


Homo sapiens 


CORI- Colon tumour related 
amino acid sequence for C799P. 


75 


33 


l466~ 


gi289575 
8 


Bos taurus 


phosphatide acid-preferring 
phospholipaseAl 


4245 


91 




gnzoy /i/ 
55 


Homo sapiens 


mRNA for KIAA1705 protein, 
j>artial cds. 


2582 


99 


j tuy 


84 


Homo sapiens 


cDNA FLJ25408 lis, clone 
TST02965, highly similar to Bos 
taurus phosphatide acid- 
preferring phospholipase Al 
mRNA. 


2378 


100 


1467 


gi!47900 
25 


Homo sapiens 


clone MGOSm 
IMAGE:3876839,mRNA, 
complete cds. 


1488 


100 


1467 


gil67686 
82 


Drosophila 
melanogaster 


HL028I5p 


1155 


49 


1467 


gil07269 
44 


Drosophila 
melanogaster 


CGI 1306 gene product 


1155 


49 


1468 


AAB747 


Homo sapiens 


HUMA- Human secreted protein 


902 j 


91 
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60 




sequence encoded by gene 18 
SEQIDNO:69. 






1468 


AAB747 
59 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 18 
SEQ IDNO:68. 


902 


91 


1468 


AAB747 
50 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 18 
SEQIDNO:59. 


902 


91 


1469 


AAB747 
60 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 1 8 
SEQ IDNO:69. 


1006 


96 


1469 


AAB747 
59 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 18 
SEQIDNO:68. 


1006 


96 


1469 


AAB747 
50 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 18 
SEQIDNO:59. 


1006 


96 


1470 


AAW131 
08 


Homo sapiens 


ONYX- Human 14-3-3 beta or 
HS1 1054. 


1225 


95 


1470 


gi279155 
2 


Homo sapiens 


Human DNA sequence from 
clone RP1-148E22 on 
chromosome 20ql2~13.12 
Contains the YWHAB gene 
encoding tyrosine 3- 
monooxygenase/ntryptophan 5- 
monooxygenase activation 
protein, beta polypeptide, a 
novel gene similar to PABPC1 
(poly (A)-binding protein, 
cytoplasmic 1), 2 CpG islands, 
ESTs, STSs and GSSs, complete 
sequence. 


1225 


95 


1470 


gi23 1 14 


Homo sapiens 


H.sapiens mRNA for HS1 
protein. 


1225 


95 


1471 


AAB948 
34 


Homo sapiens 


HELI- Human protein sequence 
SEQ 1DNO:16000. 


2638 


100 


1471 


gi 104362 
44 


Homo sapiens 


cDNA FU13942 fis, clone 
Y79AA1000962, weakly similar 
to MYOSIN HEAVY CHAIN, 
NON-MUSCLE. 


2638 


100 


1471 


gil42905 
66 


Homo sapiens 


hypothetical protein FLJ 13942, 
clone MGC:9884 
IMAGE:3867690,mRNA, 
complete cds. 


1501 


100 


1472 


AAB105 
50 


Homo sapiens 


HOFM/ Human aspartate 
protease psl 4 protein. 


1925 


100 


1472 


AAB088 
60 


Homo sapiens 


INCY- Amino acid sequence of 
a human secretory protein. 


1925 


100 


1472 


AAB089 
71 


Homo sapiens 


HUMA- Human secreted proxein 
sequence encoded by gene 25 
SEQ 1DN0:128. 


1090 


yy 


1473 


AAG892 
62 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 382. 


231 


100 


1473 


AAY307 
21 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted protein. 


231 


100 


1473 


AAB236 
15 


Homo sapiens 


ALPH- Human secreted protein 
SEQ ID NO: 30. 


222 


97 


1474 


gi702033 


Homo sapiens 


cDNAFLJ20318fis, clone 


2962 


100 
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6 




HEP08704. 






1474 


AAM406 
51 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 5582. 


1804 


47 


1474 


AAM388 
65 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2010. 


1804 


47 


1475 


gil44956 
21 


Homo sapiens 


hypothetical protein FLJ22578, 
cioneMGC;14892 
IMAGE:3506508, mRNA, 
complete cds. 


816 


100 


1475 


gil 04390 
14 


Homo sapiens 


cDNA:FU22578fis, clone 
HSI02546. 


802 


100 


1475 


AAM728 
25 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 33131. 


616 


100 


1476 


gi222469 
7 


Homo sapiens 


Human mRNA for KIAA0378 
gene, partial cds. 


4017 


100 


1476 


gi668158 
3 


Homo sapiens 


ELKS mRNA, complete cds. 


3463 


72 


1476 


gi 134457 
84 


Musmusculus 


Rab6-interacting protein 2 
isoform A 


3423 


70 


1477 


gil55303 
23 


Homo sapiens 


clone MGC:4131 
IMAGE:2961417, mRNA, 
complete cds. 


3200 


99 


1477 


gil 63075 
02 


Mus musculus 


Unknown (protein for 
MGC:1I530) 


3076 


95 


1477 


gil52772 
34 


Homo sapiens 


genomic DNA, chromosome 
6p2l.3,HLA Class I region, 
section 12/20. 


2227 


99 


1478 


AAY117 
94 


Homo sapiens 


GEST Human 5' EST secreted 
protein SEQ ID No: 394. 


375 


100 


1478 


AAB949 
77 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:16558. 


68 


35 


1478 


gi 104331 
41 


Homo sapiens 


cDNAFUll 800 fis, clone 
HEMBA1006252. 


68 


35 


1479 


AAB087 
32 


Homo sapiens 


UYCO Amino acid sequence of 
a human OLD-35 polypeptide. 


3498 


98 


1479 


gil28358 
17 


Mus musculus 


putative 


2439 


89 


1479 


AAB926 
84 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:11065. 


2369 


99 


1480 


gi 144245 
68 


Homo sapiens 


MovlO (Moloney leukemia 
virus 10, mouse) homolog, clone 
MGC:15000 IMAGE:4 109453, 
mRNA, complete cds. 


4983 


100 


1480 


gil28034 
47 


Homo sapiens 


Similar to Moloney leukemia 
virus 10, clone MGC:2948 
IMAGE:3138543,mRNA, 
complete cds. 


4983 


100 


1480 


gil00473 
39 


Homo sapiens 


mRNA for KIAA1631 protein, 
partial cds. 


4983 


100 1 


1481 


AAU055 
84 


Homo sapiens 


OXFO- Human breast cancer 
membrane protein 81, BCMP- 
81. 


718 


100 


1481 


AAU257 
27 


Homo sapiens 


OXFO- Breast cancer-associated 
membrane protein (BCMP) 81. 


718 


100 


1481 


AAW857 
38 


Homo sapiens 


SAGA Polypeptide with 
transmembrane domain. 


718 


100 
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• J__XJA„ 

identity 


1482 


gil40179 
57 


Homo sapiens 


mRNA for KIAA1870 protein, 
partial cds. 


1496 


94 


1482 


AAB420 
00 


Homo sapiens 


CURA- Human ORFX 
ORF1764 polypeptide sequence 
SEQ ID NO.3528. 


1 1 AO 

1302 




1482 


AAB938 
66 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 13743. 


OOU 


inn 


1483 


gil40l79 
57 


Homo sapiens 


mRNA for KIAA 1 870 protein, 
partial cds. 


loOo 


inn 
IUU 


1483 


AAB420 
00 


Homo sapiens 


CURA- Human ORFX 
ORF1764 polypeptide sequence 
SEQIDNO:3528. 


1 A 1 A 

1414 


1 nn 
100 


1483 


AAB938 
66 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:13743. 


860 


1 AA 

100 


1484 


gi499l84 


Felis catus 


neuronal protein 


617 


96 


1484 


AAB950 
41 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO:16804. 


507 


77 


1484 


gi795925 
1 


Homo sapiens 


mRNA for KIAA1495 protein, 
partial cds. 


507 


77 


1485 


AAB604 
57 


Homo sapiens 


INCY- Human cell cycle and 
proliferation protein CCYPR-5, 
SEQIDNO:5. 


928 


100 


1485 


gi 150805 
50 


Homo sapiens 


hypothetical protein FU23467, 
clone MGC:21000 
IMAGE:4509736, mRNA, 
complete cds. 


928 


100 


1485 


gil04401 
66 


Homo sapiens 


cDNA: FLJ23467 fis, clone 
HSI11213. 


925 


99 


1486 


AAB933 
01 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 12369. 


4341 


99 


1486 


gi 140426 
07 


Homo sapiens 


cDNA FLJ14812 fis, clone 
NT2RP4002081, weakly similar 
to TRANSCRIPTION 
INITIATION FACTOR IIA 
ALPHA AND BETA CHAINS. 


4341 


99 


1486 


gi 100471 
79 


Homo sapiens 


mRNA for KIAA1557 protein, 
partial cds. 


4168 


AA 

99 


1487 


AAB948 
04 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:15939. 


2050 


1 AA 

100 


1487 


gi 150825 
06 


Homo sapiens 


hypothetical protein FLJ13910, 
clone MGC:20406 
IMAGE:4636136, mRNA, 
complete cds. 


o Atf A 

2050 


f A A 

1 00 


1487 


gi 104361 
89 


Homo sapiens 


«T\\I A T?T T1 O A1 A .ft*» - 1 _ _ 

cDNA FLJ13910 fis, clone 
Y79AA1000131. 


one a 
205 U 


1 AA 
10U 


1488 


gi 12 8460 
13 


Mus musculus 


putative 


lo/o 


97 


1 AQQ 
14oo 


gl /ojyjj 

9 


riomo sapiens 


rj\v m kin a } compiece cas. 


17RO 


yo 


1488 


gi 136041 
69 


Homo sapiens 


ARG147 mRNA, complete cds. 


1575 


99 


1489 


gU26979 
35 


Homo sapiens 


mRNA for KIAA1695 protein, 
partial cds. 


2124 


100 


1489 


gil04386 
24 


Homo sapiens 


cDNA: FLJ22297 fis, clone 
HRC04521. 


2124 


100 


1489 


AAB424 
21 


Homo sapiens 


CURA- Human ORFX 
ORF2185 polypeptide sequence 


1571 


100 
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SEQIDNO:4370. 






1400 
147U 


A/\o4/D 

62 


Homo sapiens 


INCY- Protease PRTS-4. 


4321 


99 


l*f7U 


85 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3804. 


4317 


99 


1400 


gll 28363 


Mus musculus 


putative 


4152 


95 


1401 


guo553o 
16 


Homo sapiens 


cDNAFU25123fis, clone 
CBR06154. 


1752 


93 


1401 


A A rvi i o 

34 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 25726. 


1347 


98 


1401 


AAM257 
94 


Homo sapiens 


HYSE- Human protein sequence 
SEQIDNO:1309. 


919 


99 


1400 


gl 140433 
13 


Homo sapiens 


clone IMAGE:3609599, 
mRNA, partial cds. 


780 


100 


1492 


AAY122 
25 


Homo sapiens 


GEST Human 5 1 EST secreted 
protein SEQ ID NO: 538. 


511 


97 


1492 


AAG005 
45 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 4626. 


500 


97 




AAM934 
50 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3100. 


2693 


99 


1493 


AAY077 
54 


Homo sapiens 


HUMA- Human secreted protein 
fragment encoded from gene 1 1 . 


1723 


100 


1493 


AAW790 
94 


Homo sapiens 


GEMY Human secreted protein 
do568 11. 


1699 


98 


1494 


AAG648 
94 


Homo sapiens 


BIOD- Human phosphoenol 
pyruvate carboxylase 81. 


3851 


100 


1494 


AAB952 
50 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:17415. 


1589 


100 


1494 


gil32766 
65 


Homo sapiens 


mRNA; cDNA 

DKFZp761K1524 (from clone 
DKFZp761K1524); complete 
cds. 


1493 


100 


1495 


AAB437 

37 


Homo sapiens 


HUMA- Human cancer 
associated protein sequence SEQ 
IDNO:1182. 


817 


89 


1495 


AAR592 
88 


Homo sapiens 


SHIO Human reg protein. 


817 


89 


I4yj 


gl 5 76455 
5 


Homo sapiens 


lithostathine (REG1A) mRNA, 
complete cds. 


817 


89 


i4yo 


gl 126525 
61 


Homo sapiens 


Similar to cytochrome b-56l, 
clone MGC:3308 
IMAGE:3509626, mRNA, 
complete cds. 


1129 


96 




crJ 1 0 A040 

35 


Homo sapiens 


Similar to cytochrome b-561, 
clone MGC:2190 
IMAGE:3535771,mRNA, 
comnlete crta i 


1126 


95 


1496 


gi939707 


Homo sapiens 


Human cytochrome b561 gene, 
exon 5 and complete cds. 


1124 


95 


1497 


gil04370 
90 


Homo sapiens 


cDNA:FU21077fis, clone 
CAS02152. 


2182 


99 


1497 


gil04372 
11 


Homo sapiens 


cDNA:FLJ21159fis, clone 
CAS09969. 


1885 


100 


1497 


AAB639 
60 


Homo sapiens 


LUDW- Human prostate cancer 
associated antigen protein 
sequence SEQ ID NO: 13 22 


904 


96 
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1498 


AAB409 
96 


Homo sapiens 


CURA- Human ORFX ORF760 
polypeptide sequence SEQ ID 

NO: 1520. 


01 A1 




1498 


gil 04369 
63 


Homo sapiens 


cDNA: FLJ20986 fis, clone 
CAE01156. 


3137 


99 


1498 


AAM935 
25 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3259. 


2627 


99 


1499 


AAB998 
91 


Homo sapiens 


CHUG- Human RNA helicase 
gene helicain B protein sequence 
SEQ ID NO:4. 


3818 


100 


1499 


gil 65665 
50 


Homo sapiens 


DEAD/DEXH helicase DDX31 
mRNA, complete cds. 


3817 


nn 

99 


1499 


gil52152 
73 


Homo sapiens 


hypothetical protein FLJ13633, 
clone MGC:14872 
IMAGE:3941452, mRNA, 
complete cds. 


3455 


1 AA 

100 


1500 


gil 20055 
11 


Homo sapiens 


HT027 mRNA, complete cds. 


744 


1 A A 

100 


1500 


gil 04368 
44 


Homo sapiens 


cDNA: FLJ20886 fis, clone 
ADKA03257. 


739 




1500 


gil 10369 
73 


Homo sapiens 


HSP22-like protein interacting 
protein 17 mRNA, complete cds. 


459 


100 


1501 


AAB952 
61 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 17444. 


906 


100 


1501 


gil 04347 
55 


Homo sapiens 


cDNA FLJ12967 fis, clone 
NT2RP2005806. 


906 


100 


1501 


AAB942 
67 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 14682. 


587 


95 


1502 


gil 65534 
61 


Homo sapiens 


cDNA FLJ33 132 fis, clone 
UMVEN2000133, weakly 
similar to RABPHILIN-3A. 


2594 


yy 


1502 


gil 04386 
90 


Homo sapiens 


cDNA: FLJ22344 fis, clone 
HRC06080. 


1661 


AA 

yy 


1502 


AAB935 
62 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:12957. 






1503 


gil 65493 
30 


Homo sapiens 


cDNA FU301 65 fis, clone 
BRACE2000698, weakly similar 
to ANKYRIN 2. 


2280 


98 


1503 


gi 126527 
41 


Homo sapiens 


clone MGC-.3130 
IMAGE :335285 1 , mRNA, 
complete cds. 


2267 


78 


1503 


AAO018 
50 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 1 5742. 


1975 


89 


1504 


AAB662 
95 


Homo sapiens 


ICOS- Human tankyrase2 
TANK2-SHORT SEQ ID NO: 
135. 


6088 


99 


1504 


AAB662 

OA 


Homo sapiens 


ICOS- Human tankyrase2 
133. 


6088 


yy 


1504 


AAB662 
90 


Homo sapiens 


ICOS- Human tankyrase2 clone 
consensus protein SEQ ID NO: 
107. 


6088 


99 


1505 


gi568942 
7 


Homo sapiens 


mRNA for KIAA1045 protein, 
partial cds. 


2087 


99 


1505 


gi 133586 
52 


Macaca 
fascicularis 


hypothetical protein 


1205 


96 


1505 


ABB1680 


Homo sapiens 


HUMA- Human nervous system 


370 


100 
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identity 




6 




related polypeptide SEQ ID NO 
5463. 






1506 


" gi568942 
7 


Homo sapiens 


mRNA for KIAA1045 protein, 
partial cds. 


2052 


94 


1506 


gil33586 
52 


Macaca 
fascicularis 


hypothetical protein 


1205 


96 


« 

1 JVO 


ABB i 650 
6 


Homo sapiens 


HtlMA- Human nervous system 
related polypeptide SEQ ID NO 
5463. 


370 


100 


1507 


gil04390 
66 


Homo sapiens 


cDNA: FU22612fis, clone 
HS104965. 


2767 


100 


1507 


gil45298 
86 


Mus musculus 


bM14504.1 (novel protein) 


2276 


78 


1507 


gil28553 
00 


Mus musculus 


putative 


2276 


78 


1508 


gil 14933 
65 


Homo sapiens 


Human DNA sequence from 
clone RP5-1009E24 on 
chromosome 20 Contains the SN 
gene encoding sialoadhesin, a 
novel gene similar to 
KIAA0417, the CENPB gene for 
centromere protein B, the 
CDC25B gene for Cell division 
cycle protein 25B, three novel 
genes, the 5* end of gene 
KIAA1271, nine CpG islands, 
ESTs, STSs and GSSs, complete 
sequence. 


6334 


99 


1508 


gil26561 
30 


Homo sapiens 


sialoadhesin mRNA, complete 
cds. 


6330 


99 


1508 


gil04404 
38 


Homo sapiens 


mRNA for FLJ00055 protein, 
partial cds. 


5046 


99 


1509 


AAY765 
39 


Homo sapiens 


META- Human ovarian tumor 
EST fragment encoded protein 
35. 


261 


98 


1510 


AAE0488 
4 


Homo sapiens 


INCY- Human protease protein- 
ll(PRTS-ll). 


424 


100 


1510 


AAB732 ' 
63 


Homo sapiens 


UYAL- Human triacylglycerol 
hydrolase, TGH. 


209 


51 


1510 


gil 80950 


Homo sapiens 


Human carboxylesterase mRNA, 
complete cds. 


209 


51 


151 1 


AAB944 

05. 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 14983. 


4412 


100 


1 CI 1 

loll 


gil 04347 
98 


Homo sapiens 


cDNAFLJ12994fis, clone 
NT2RP3000207, weakly similar 
toGLUCOAMYLASESl/S2 
PRECURSOR (EC 3.2.1.3). 


4422 1 


100 


1511 


eil04371 
27 


Homo sapiens 


cDNA:FLJ21104fis, clone 
CAS04958. 


978 


100 


1512 


AAB621 
75 


Homo sapiens 


PLAC Human pi 10FYVE 
protein. 


4028 


99 


1512 


AAF5740 
3 aal 


Homo sapiens 


PLAC Human pi 10FYVE 
protein encoding DNA. 


4027 


99 


1512 


gil 13449 
51 


Homo sapiens 


FYVE-finger-containing Rab5 
effector protein Rabenosyn-5 
mRNA, complete cds. 


4027 


99 


1513 


gi795926 


Homo sapiens | mRNA for KIAA1500 protein, 


4061 


100 
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1 




partial cds. 






1513 


gi983742 
7 


Lytechinus 
variegatus 


embryonic blastocoelar 
extracellular matrix protein 
precursor 


1085 


34 


1513 


AAG733 
54 


Homo sapiens 


HUMA- Human gene 9-encoded 
secreted protein HETAM53, 
SEQIDNO:125. 


517 


100 


1514 


gil65501 
08 


Homo sapiens 


cDNA FLJ30829 fis, clone 
FEBRA2001790, highly similar 
to Xenopus laevis RRM- 
containing protein SEB-4 
mRNA. 


914 


100 


1514 


gi!36244 
61 


Homo sapiens 


Human DNA sequence from 
clone RP1-259A10 on 
chromosome 6p22.1-23 Contains 
the gene for an ssDNA binding 
protein (SEB4D), ESTs, STSs, 
GSSs and a CpG island, 
complete sequence. 


914 


100 


1514 


gi889569 
8 


Xenopus iaevis 


RRM-containing protein SEB-4 


790 


oo 
OO 


1515 


gil65515 
80 


Homo sapiens 


cDNA FLJ31673 fis, clone 
NT2RI2005061. 


3158 


99 


1515 


AAB949 
29 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO;16399. 


2275 


100 


1515 


gi 104328 
47 


Homo sapiens 


cDNA FLJ1 1 565 fis, clone 
HEMBA1003229. 


2275 


1 


1516 


gi3 94768 
8 


Homo sapiens 


mRNA for Sec24 protein 
(Sec24A isoform), partial. 




QQ 


1516 


AAM791 
11 


Homo sapiens 


HYSE- Human protein bfcQ iu 
NO 1773. 






1516 


gi394769 
0 


Homo sapiens 


mRNA lor Sec24 protein 
(Sec24B isoform). 






1517 


AAY120 
49 


Homo sapiens 


GEST Human 5* EST secreted 
protein SEQ ID NO: 362. 


253 


92 


1517 


gi415506 
3 


Helicobacter 
pylori J99 


putative 


75 


29 


1517 


gi573852 
2 


Schizosaccharom 
yces pombe 


putative pre-mrna splicing factor 
atp-dependent rna helicase 


O? 


jj 


1518 


AAB203 
49 


Homo sapiens 


UYRQ Human vomeronasal-like 
receptor hVLRl (long form). 


1 Q<Q 
lOJy 


QO 

77 


1518 


AAG642 
95 


Homo sapiens 


HELI- Human GTP-binding 
protein-coupled receptor 
GPRv31. 




OO 


1518 


gi998858 
5 


Homo sapiens 


putative pheromone receptor 
V1RL1 long form (V1RL1) 
mRNA, complete cds. 




OQ 

yy 


1 CI Q 

i5iy 


09 


riomo sapiens 


rlnnp 1 74 
cione iivij\\jxii*j Jvy * /*t, 

mRNA, partial cds. 


2759 


90 


1519 


gil42499 
07 


Homo sapiens 


clone IMAGE:3506145, 
mRNA, partial cds. 


2759 


90 


1519 


AAY993 
55 


Homo sapiens 


GETH Human PR01295 
(UNQ664) amino acid sequence 
SEQ IDNO:54. 


1265 


100 


1520 


gil60416 
86 


Homo sapiens 


hypothetical protein FLJ22393, 
clone MGC: 16798 
IMAGE:3916157, mRNA, 


1470 


100 
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complete cds. 






i ^on 
i 


gnU4Jo/ 
63 


Homo sapiens 


cDNA: FLJ22393 fis, clone 
HRC07880. 


1463 


99 




gi 12 8534 
19 


Mus muscul us 


putative 


1446 


98 


i «ii 
1521 


A AT>1CA 

AAR350 
no 


Homo sapiens 


UYPR- Human t-complex 

associated testes expressed 
protein 1. 


2576 


97 


1521 


gi20!9l0 


Mus musculus 


Tcte-1 peptide 


1883 


74 


1 CO 1 

1521 


gi730028 
5 


Drosophila 
melanogaster 


CGI 4325 gene product 


348 


27 


1522 


gil285l7 

^o 

62 


Mus musculus 


putative 


689 


88 


1522 


AAG022 

no 

98 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6379. 


382 


100 


1522 


gi 1 67690 
34 


Drosophila 
melanogaster 


LD15209p 


294 


38 


1523 


AAQ905 
26 aal 


Homo sapiens 


OKLA- Human Smi5kDa 
subunit cDNA. 


426 


100 


1523 


AAW138 
50 


Homo sapiens 


OKLA- Human RNA 
polymerase transcription factor 
elongin 15kDa subunit. 


426 


100 


1523 


AAR750 
87 


Homo sapiens 


OKLA- Human SHI 15 kDa 
subunit. 


426 


100 


1524 


gil28556 
72 


Mus musculus 


putative 


2165 


85 


1524 


AAU174 
29 


Homo sapiens 


HUMA- Novel signal 
transduction pathway protein, 
Sea. ID 994. 


987 


98 


1524 


AAG040 
81 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 8162. 


507 


99 


1525 


AAW330 
85 


Homo sapiens 


SAGA Human E2 ubiquinone 
binding enzyme. 


667 


88 


1525 


gi 130971 
95 


Homo sapiens 


ubiquitin-conjugating enzyme 
E2N (homologous to yeast 
UBC13), clone MGC:5063 
IMAGE:2900313,mRNA, 
complete cds. 


667 


88 


1525 


gi 126532 
55 


Homo sapiens 


ubiquitin-conjugating enzyme 
E2N (homologous to yeast 
UBC13), clone MGC:8489 
IMAGE:2822013,mRNA, 
complete cds. 


667 


88 


1526 


AAY872 

71 


Homo sapiens 


1NCY- Human signal peptide 
containing protein HSPP-48 
SEQIDN0:48. 


471 


86 


1JZ0 


gi J /IZiSU 

86 


Corynebacterium 
glutamicum 


cdsA 


70 


27 


1526 


gi 125442 
26 


Corynebacterium 
glutamicum 


RXA01894 


70 


27 


1527 


gil33589 
42 


Macaca 
fascicularis 


hypothetical protein 


2660 


97 


1527 


AAB875 
87 


Homo sapiens 


GETH Human PR01693. 


2647 


100 


1527 


AAU124 
39 


Homo sapiens 


GETH Human PR01693 
polypeptide sequence. 


2647 


100 


1528 


AAB425 


Homo sapiens 


CURA- Human ORFX 


2137 


99 
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73 




ORF2337 polypeptide sequence 
SEQIDNO:4674. 






1528 


gil28458 
23 


Mus musculus 


putative 


1792 


95 


1528 


gi449506 
3 


Homo sapiens 


Human DNA sequence from 
clone 108K11 on chromosome 
6p21 Contains SRP20 (SR 
protein family member), Ndr 
protein kinase gene similar to 
yeast' suppressor protein SRP40, 
EST and GSS, complete 
sequence. 


1468 


99 


1529 


gil58236 
36 


Homo sapiens 


ALS2 mRNA, complete cds, 
long form. 


8660 


99 


1529 


gil6076g 
12 


Homo sapiens 


alsin mRNA, complete cds. 


8646 


99 


1529 


gil58236 
40 


Mus musculus 


Als2 


8005 


91 


1530 


AAG641 
71 


Homo sapiens 


TAKE Human profilin IIL. 


750 


100 


1530 


gil28042 
13 


Homo sapiens 


profilin 2, clone MGC:1684 
IMAGE:3533907, mRNA, 
complete cds. 


750 


100 


1530 


gil09525 
20 


Homo sapiens 


profilin Ha (PFN2) mRNA, 
complete cds, alternatively 
spliced. 


750 


100 


1531 


AAG641 
71 


Homo sapiens 


TAKE Human profilin IIL. 


636 


87 


1531 


gi 128042 
13 


Homo sapiens 


profilin 2, clone MGC:1684 
IMAGE:3533907, mRNA, 
complete cds. 


636 


87 


1531 


gi 109525 
20 


Homo sapiens 


profilin Ha (PFN2) mRNA, 
complete cds, alternatively 
spliced. 


636 


87 


1532 


AAB949 
52 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:16482. 


1830 


99 




gI17yU/7 
91 


Homo sapiens 


1 A1P-2 mRNA tor 1 Or -beta 
induced apotosis protein 2, 
complete cds. 


1830 


99 


1532 


gil04330 

JO 


Homo sapiens 


cDNA FLJ11 703 fis, clone 

f TT*»x m a "lAACATC 

rlblvLD A 1 005075. 


1830 


99 


1533 


gil41332 
JLo 


Homo sapiens 


mRNA for KIAA0876 protein, 
partial cds. 


4559 


100 


1533 


gi691056 

5 


Homo sapiens 


chromosome 19, BC335474 
(CIT-HSPC__482H14), complete 
sequence. 


4370 


99 


1533 


gi 1393 80 


Mus musculus 


Similar to KIAA0677 gene 
prouuci 


3313 


73 


1534 


gi724319 
1 


Homo sapiens 


mRNA for KIAA1405 protein, 
partial cds. 


3986 


99 


1534 


gil23136 
47 


Mus musculus 


MmKIF17 


3319 


77 


1534 


gi4l0218 
4 


Homo sapiens 


KIF3-related motor protein 
(KIF3X) mRNA, partial cds. 


1084 


90 


1535 


gi 120533 
11 


Homo sapiens 


mRNA; cDNA DKFZp434K229 
(from clone DKFZp434K229); 
complete cds. 


1600 


100 



199 
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1535 


gil04389 
86 


Homo sapiens 


cDNA: FLJ22557fis, clone 

ijoTni/ioi 
rlMU1483. 


1405 


100 


1535 


gil52145 
57 


Mus musculus 


RIKENcDNA2410042D21 
gene 


1369 


86 


1536 


gil22749 
33 


Homo sapiens 


mRNA for alanine:g!yoxylate 
aminotransferase 2 homolog 1, 

splice form I (AGXT2Li gene). 


2018 


100 


1536 


gil28367 
24 


Mus musculus 


putative 


1689 


83 


1536 


gil58596 
90 


Homo sapiens 


unnamed protein product 


1189 


66 


1537 


gil40178 
47 


Homo sapiens 


mRNA forKIAA1815 protein, 
partial cds. 


2117 


100 


1537 


gil23141 
59 


Homo sapiens 


Human DNA sequence from 
clone RP11-207C16 on 
chromosome 9p23-24.3. 
Contains the 3' end of the gene 
for a novel protein similar to C 
elegans R06F6.8 (Sw:Q09417) 
(contains KIAA1432), the J end 
of the gene for a novel protein 
similar to predicted yeast, plant 
and worm proteins, ESTs, STSs 
and GSSs, complete sequence. 


2117 


100 




nil A/ISOO 

48 


Homo sapiens 


cDNA: FLJ23309 fis, clone 
HEP11618. 


1725 


99 




cri 104 "271 

87 


Homo sapiens 


cDNA: FLJ21 144 fis, clone 
CAS07955. 


1919 


99 


1 SIR 


60 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:17664. 


1912 


99 


1 K"IQ 
I J JO 


10 


Mus musculus 


putative 


1672 


84 


j joy 


6 


Homo sapiens 


mRNA for KIAA061 1 protein, 
partial cds. 


4702 


100 


1539 


gil 39053 

CO 


Mus musculus 


Similar to ATPase, class II, type 
9A 


3961 


98 


1539 


gi643496 
g 


Mus musculus 


putative E1-E2 ATPase 


3942 


98 


1540 


AAB939 
76 


Homo sapiens 


HELI- Human protein sequence 
abQ ID NO:1405l. 


2580 


100 


1540 


gil04338 
68 


Homo sapiens 


cDNA FIJI 240 1 fis, clone 

\K A X A\ A A 1 f\t\r\nn f 

MAMMA 1 002796. 


2580 


100 


1540 


AAM698 
00 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 30106. 


1108 


100 


1541 


AAB949 
01 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:16290. 


1317 


99 


1541 


gil04327 
16 


Homo sapiens 


cDNAFLJl 1457 fis clone 
HEMBA1001522. 


1317 


oo 
yy 


1541 


gil 28531 
91 


Mus musculus 


putative 


887 


62 


1542 


AAB735 
07 


Homo sapiens 


INCY- Human transferase 
HTFS-14,SEQIDNO:14. 


1698 


99 


1542 


gil 65524 
96 


Homo sapiens 


cDNA FLJ32390 fis, clone 
SKMUS1000177, weakly 
similar to PROTEIN-L- 
ISOASPARTATE O- 


1698 


99 
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METHYLTRANSFERASE (EC 
2.1.1.77). 






1542 


gill323l 
92 


Homo sapiens 


Human DNA sequence from 
clone RP5-1022E24 on 
chromosome 20 Contains the 3' 
end oftheOPRLl gene 
encoding Opiate receptor-like 1 
protein, the GPR8 gene encoding 
a G protein-coupled receptor, the 
KIAA0835 gene encoding a 
protein similar to the myelin 
transcription factor 1 (MYT1), a 
novel gene, 7 CpG islands, 
ESTs, STSs and GSSs, complete 
sequence. 


1156 


69 


1543 


AAB946 
44 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:15536. 


1941 


100 


1543 


gi 146258 
75 


Homo sapiens 


mRNA for putative autophagy- 
related cysteine endopeptidase 
(AUTL1 gene). 


1941 


100 


1543 


gi 140426 
98 


Homo sapiens 


cDNA FLJ14867 fis, clone 
PLACE 10023 19. 


1941 


100 


1544 


gi996720 
4 


Macaca 
fascicularis 


hypothetical protein 


2621 


100 


1544 


AAB883 
51 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone 
PSEC0076. 


2616 


99 


1544 


gi 142725 
56 


Homo sapiens 


unnamed protein product 


2616 


99 


1545 


gi 120054 
29 


Homo sapiens 


homeobox-containing 
transcripton factor HOXD1 
(HOXD1) mRNA, complete cds, 


1726 


100 


1545 


gill 0956 
18 


Homo sapiens 


HOX Dl protein (HOXD1) 
gene, complete cds. 


1726 


100 


1545 


gi 156802 
45 


Homo sapiens 


homeo box Dl, clone 
MGC:23 144 IMAGE:486901 9, 
mRNA, complete cds. 


1718 


99 


1546 


AAB953 
66 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 17677. 


2511 


100 


1546 


gi 140425 
38 


Homo sapiens 


cDNAFLJl4773fis, clone 
NT2RP3004566, weakly similar 
to ZINC FINGER PROTEIN 84. 


2511 


100 


1546 


gi 104386 
30 


Homo sapiens 


cDNA: FLJ22301 fis, clone 
HRC04777. 


2511 


100 


1547 


gil55596 
28 


Homo sapiens 


mitochondrial ribosomal protein 
S5, clone MGC:20735 
IMAGE:4561399, mRNA, 
complete cds. 


2262 


99 


1547 


gil362U5 
81 


Homo sapiens 


JVlKr o5 IT1KJNA IOr 

mitochondrial ribosomal protein 
S5, complete cds. 




yy 


1 547 


gi 1 36208 
83 


Mus muscuJus 


mitochondrial ribosomal protein 
S5 


1821 


79 


1548 


gi 126979 
11 


Homo sapiens 


mRNA for KJAA1683 protein, 
partial cds. 


1772 


72 


1548 


gi 120532 
39 


Homo sapiens 


mRNA; cDNA DKFZp4340194 
(from clone DKFZp4340194); 
complete cds. 


1690 


49 
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1548 


AATI17d 
84 


Homo sapiens 


HUMA- Novel signal 
transduction pathway protein, 

Cam TT\ 1 AVlft 

oeq ID 1049. 


1659 


100 


1549 


gi!0438l 


Homo sapiens 


cDNA: FU21945fis, clone 
HEP04702. 


3738 


99 


1549 


gi!044l9 

56 


Homo sapiens 


clone PP384 unknown mRNA. 


1396 


99 




AAUZZZ 

52 


Homo sapiens 


f IT tt ML TV 

HUMA- Human cardiovascular 
system antigen polypeptide SEQ 
ID No 1026. 


480 


98 


1550 


gi!28524 

521 
01 


Mus musculus 


putative 


1049 


56 


1550 


AAB189 
oo 


Homo sapiens 


INCY- Amino acid sequence of 
a human transmembrane protein. 


922 


89 


1550 


AAB427 

01 


Homo sapiens 


CURA- Human ORFX 
ORF2545 polypeptide sequence 
SEQ ID NO.-5090. 


513 


100 


UJ 1 


1 


Homo sapiens 


mRNA for KIAA 1 199 protein, 
partial cds. 


2521 


97 


LJJ I 


A AV057 

93 


Homo sapiens 


Y TT T1 ML TV 

HUMA- Human secreted protein 
fragment encoded from gene 12. 


1919 


96 


UJl 


glOJ 1 0 1 O 

8 


Homo sapiens 


mRNA full length insert cDNA 
clone EUROIMAGE 2155535. 


1352 


95 


1 JJ£. 


17 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO; 1 501 6. 


1027 


99 


1 J JZ, 


gl 1 U4 j*lo 

35 


Homo sapiens 


cDNAFU13018fis, clone 
NT2RP3000685. 


1027 


99 




gl IZOO/S 

67 


Mus musculus 


putative 


972 


92 


1553 


gi 159294 
11 


Homo sapiens 


clone IMAGE:4040789, 
mRNA, partial cds. 


1369 


100 




a xrni/i 
90 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 7571. 


450 


96 


1553 


gi 128458 

Zo 


Mus musculus 


putative 


376 


68 


1554 


gi998895 
8 


Homo sapiens 


F-box protein FBX30 mRNA, 
complete cds. 


1231 


100 


1 JJ'T 


44 


Homo sapiens 


Similar to F-box only protein 6, 
clone MGC: 141 40 
IMAGE:40544I4, mRNA, 
complete cds. 


1211 


100 


1554 


89 


— : 

Homo sapiens 


CURA- Human ORFX 
ORF2753 polypeptide sequence 

CCA TT^k XlA.Cffftrf' 

abQ ID NO:5506. 


675 


99 


1555 


gi938026 


Homo sapiens 


Human mRNA for RanBP 1 
(Ran-binding protein I), 
complete cds. 


916 


100 


1555 


AAB566 
19 


Homo sapiens 


ROSE/ Human prostate cancer 
antigen protein sequence SEQ 
IDNO:1197. 




oo 

yy 


1555 


gi620083 


Homo sapiens 


Rsapiens mRNA for RanBP 1. 


904 i 


99 


1556 


gi!33832 
65 


Homo sapiens 


mRNA for actin related protein, 
complete cds. 


1962 


100 


1556 


gil65500 
55 


Homo sapiens 


cDNAFLJ30784fis, clone 
FEBRA2000881, moderately 
similar to ACTIN 6. 


1950 


99 


1556 


gi!39383 


Homo sapiens 


clone MGC: 15664 


1949 


99 
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19 




IMAGE:3349184, mRNA, 
complete cds. 






1557 


gi623545 


Oryctolagus 
cuniculus 


sarcoplasmic reticulum 
glycoprotein 


2366 


C\£L 

yo 


1557 


gil64861 


Oryctolagus 
cmiiculus 


sarcolumenin precursor 


2307 


97 


1557 


gi496325 


Gallus gallus 


53 kDa glycoprotein 


21 60 


87 


1558 


AAB946 
41 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:15526. 


1379 


98 


1558 


AAG644 
03 


Homo sapiens 


SHAN- Human paneth cell 
enhanced expression-like 
protein. 


1379 


98 


1558 


AAM940 
28 


Homo sapiens 


HELI- Human stomach cancer 
expressed polypeptide SEQ ID 
NO 126. 


1379 


98 


1559 


gi874531 . 
5 


Homo sapiens 


putative GTP-binding protein 
(GTPBP2) mRNA, partial cds. 


2742 


99 


1559 


gil35610 
07 


Homo sapiens 


Human DNA sequence from 
clone RP1 1-22124 on 
chromosome 6 Contains the 3* 
part of the POLH gene for DNA 
directed polymerase eta and the 
GTPBP2 gene for GTP binding 
protein 2, ESTs, STSs, GSSs and 
a CpG island, complete 
sequence. 


2742 


99 


1559 


gi874531 
7 


Mus musculus 


putative GTP-binding protein 


2731 


99 


1560 


AAG026 
53 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6734. 


425 


100 


}560 


gi 150804 
59 


Homo sapiens 


clone MGC:9017 
IMAGE:3860059, mRNA, 
complete cds. 


425 


100 


1560 


gi233792 
0 


Homo sapiens 


Human syntaxin 7 mRNA, 
complete cds. 


421 


98 


1561 


AAB944 
68 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:15128. 


4375 


99 


1561 


gil04349 
44 


Homo sapiens 


cDNAFU13089fis, clone 
NT2RP3002108, weakly similar 
to DEC 1 PROTEIN. 


4375 


99 


1561 


gi730058 
1 


Drosophiia 
melanogaster 


CG4845 gene product 


1083 


31 


1562 


AAU121 
77 


Homo sapiens 


GETH Human PRO305 
polypeptide sequence. 


226 


100 


1562 


AAY647 
34 


Homo sapiens 


GEST Human 5' EST related 
polypeptide SEQ IDNO:895. 


226 


100 


1562 


AAY814 
87 


Homo sapiens 


FUJY Human cathepsin L2. 


226 


100 


1563 


gil20527 
26 


Homo sapiens 


mRNA; cDNA 

DKFZp761N0411 (from clone 
DKFZp761N0411); complete 
cds. 


2381 


99 


1563 


gil42509 
20 


Homo sapiens 


mRNA for SMC6 protein. 


2374 


99 


1563 


gil42509 
22 


Mus musculus 


SMC6 protein 


2163 


88 


1564 


gil49705 


Homo sapiens 


mRNA for WDR9 protein 


524 


98 



203 
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62 




(WDR9geneUormA. 






1 CCA 

1564 


gi 149705 
91 


Mus musculus 


WDR9 protein, form A 


363 


79 


1564 


AAB344 
87 


Hoipo sapiens 


HUMA- Human secreted protein 
BLAST search protein SEQ ID 
NO: 105. 


169 


52 


1565 


gi680832 
9 


Homo sapiens 


mKNA; cDNA 

DKF2p434K0410 (from clone 
DKFZp434K0410); partial cds. 


787 


100 


1565 


AAB931 
88 


Homo sapiens 


HELI- Human protein sequence 
SEQ1DNO:12140. 


528 


51 


1565 


AAB927 
02 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:11102. 


528 


51 


1566 


gi!28038 
41 


Homo sapiens 


Similar to retinoic acid induced 
12, clone MGC:3373 
IMAGE:3629369,mRNA, 
complete cds. 


1576 


99 


3566 


AAB267 
93 


Homo sapiens 


UYFU- Human melanoma 
growth related factor-1 amino 
acid sequence. 


1199 


95 


1566 


giI32777 
02 


Mus musculus 


retinoic acid induced 12 


1182 


75 


1567 


AAB534 
05 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein sequence SEQ 
IDNO:945. 


1288 


93 


1567 


gi29587 


Homo sapiens 


Human mRNA for carbonic 
anhydrase II (EC 4.2.1.1). 


1288 


93 


1567 


gi 179795 


Homo sapiens 


Human carbonic anhydrase 11 
mRNA, complete cds. 


1288 


93 


1568 


gil82146 


Homo sapiens 


eosinophil peroxidase (EPP) 
gene, exon 12 and complete cds. 


3757 


100 


1568 


gi31183 


Homo sapiens 


Human mRNA for eosinophil 
peroxidase. 


3549 


97 


1568 


gil 77737 
8 


Mus musculus 


eosinophil peroxidase 


3376 


89 


1569 


AAB941 
83 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:14500. 


2405 


100 


1569 


gil 04342 
15 


Homo sapiens 


cDNAFLJ12618fis, clone 
NT2RM4001666, weakly similar 
to HYPOTHETICAL 48.6 KD 
PROTEIN IN ALP A-GABP 
INTERGENIC REGION. 


2405 


100 


1569 


gil35439 
55 


Homo sapiens 


Similar to hypothetical protein 
FLJ1261 8, clone MGC: 12994 
IMAGE:3504996,mRNA, 
complete cds. 


2082 


94 


i tin 


AAB939 
04 


Homo sapiens 


HELI- Human protein sequence 
cpn m Kin* 1 1 qao 


2499 


99 


1570 


gi 142862 
14 


Homo sapiens 


hypothetical protein FLJ12150, 
clone MGC: 15043 
IMAGE:3634992,mRNA, 
complete cds. 


2499 


99 


1570 


gil04335 
59 


Homo sapiens 


cDNAFU12150fis, clone 
MAMMA1 000422. 


2499 


99 


1571 


gi768460 
5 


Mus musculus 


smoothelinB 


640 


54 


1571 


gi768460 


Mus musculus 


smoothelin A 


640 


54 
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4 










1571 


gi754725 
8 


Mus musculus 


smoothelin small isoform SI 


640 


54 


1572 


AAB950 
33 


Homo sapiens 


HEL1- Human protein sequence 
SEQ IDNO:16784. 


1174 


100 


1572 


gil04334 
42 


Homo sapiens 


cDNA FLJ12056 fis, clone 
HEMBB1002050. 


1174 


100 


1572 


gi530876 


Chlamydomonas 
reinhardtii 


amino acid feature: Rod protein 
domain, aa 266 .. 468; amino 
acid feature: globular protein 
domain, aa 32 .. 265 


142 


26 


1573 


gil28572 
47 


Mus musculus 


putative 


2111 


92 


1573 


gil07279 
09 


Drosophila 
melanogaster 


CG6169 gene product 


688 


48 


1573 


gi238891 
1 


Schizosaccharom 
yces pombe 


hypothetical PSUl-Iike protein 


585 


47 


1574 


AAB419 
54 


Homo sapiens 


CURA- Human ORFX 
ORF1718 polypeptide sequence 
SEQIDNO:3436. 


1765 


97 


1574 


AAB427 
73 


Homo sapiens 


CURA- Human ORFX 
ORF2537 polypeptide sequence 
SEQ ID NO:5074. 


1134 


93 


1574 


gil75122 
54 


Homo sapiens 


hypothetical protein FU21 1 56, 
clone MGC:29459 
IMAGE:5020837, mRNA, 
complete cds. 


1089 


100 


1575 


AAY400 
90 


Homo sapiens 


HUMA- Peptide sequence 
derived from a human secreted 
protein. 


918 


98 


1575 


gil28546 
39 


Mus musculus 


putative 


443 


69 


1575 


gi 170661 
07 


Homo sapiens 


partial TTN gene for titin. 


86 


25 


1576 


gil04384 
73 


Homo sapiens 


cDNA: FLJ22184 fis, clone 
HRC00983. 


3291 


99 


1576 


gil02417 
12 


Homo sapiens 


mRNA; cDNA 

DKF2p761K08J6 (from clone 
DKFZp761K08l6). 


1238 


99 


1576 


gi600118 


Zea mays 


extensin-like protein 


666 


33 


1577 


gi 165492 
61 


Homo sapiens 


cDNA FLJ30107fis, clone 
BNGH41000198, weakly similar 
to TETRACYCLINE 
RESISTANCE PROTEIN, 
CLASS E. 


1453 


100 


1577 


gil47150 
55 


Homo sapiens 


Similar to RIKEN cDNA 
1110002C08gene, clone 
MGC:9564 IMAGE:3 872267, 
mRNA, complete cds. 


1453 


100 


1577 


gil28338 
45 


Mus musculus 


putative 


1339 


90 


1578 


gil04376 
69 


Homo sapiens 


cDNA: FLJ21551 fis, clone 
COL06266. 


1925 


99 


1578 


AAE0179 
1 


Homo sapiens 


HUMA- Human gene 22 
encoded secreted protein 
HOHDF66, SEQ ID NO:l 12. 


1840 


99 


1578 


AAB417 


Homo sapiens 


CURA- Human ORFX 


1473 


99 
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ii 
i i 




UKF1475 polypeptide sequence 
SEQIDNO:2950. 






1579 


gil43279 
15 


Homo sapiens 


clone MGC:1 1186 
IMAGE:3844322,mRNA, 
complete cds. 


3446 


100 


1579 


gil40431 


Homo sapiens 


clone MGC:15388 
IivlAGBrSSoOSTS, mRNA, 
complete cds. 


3446 


100 


ij/y 


AAB947 
22 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:15741. 


3439 


99 


1580 


gil34461 
90 


Homo sapiens 


Human DNA sequence from 
clone RP4-717M23 on 
chromosome 20 Contains the 
gene encoding a CRP2 binding 
protein (CRP2BP), a 
pseudogene,ESTs, STSs, GSSs 
and CpG islands, complete 
sequence. 


3955 


97 


1580 


gil-40431 
03 


Homo sapiens 


clone MGC:15388 
IMAGE:3350378,mJRNA, 
complete cds. 


3952 


97 


1580 


gil43279 
15 


Homo sapiens 


clone MGC: 13 186 
IMAGE:3844322, mRNA, 
complete cds. 


3281 


96 


1581 


gil65516 

1 A 


Homo sapiens 


cDNAFU31697fis, clone 
NT2RI2005851, weakly similar 
to PLECTIN. 


1911 


99 


I JOl 


A A \KCC A 

AAMoo4 
35 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 26741. 


588 


100 




A A K A C A A 

AAM540 
44 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded 
protein SEQ ID NO: 26149. 


588 


100 


1 SR9 


A AT JOfM 

43 


Homo sapiens 


HUMA- Human secreted 
protein, Seq ID No 435. 


940 


94 


1 joZ 


A AX/fOIT 

13 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3654. 


756 


100 


15R9 


ml 71 /I OA 

gn /i4yu 
39 


Homo sapiens 


MTOl-hke protein gene, 
complete cds; nuclear gene for 
mitochondrial product 


756 


100 


1583 


ARB 1999 

0 


Homo sapiens 


HYSE- Human peroxisomal Ca- 
dependent solute carrier 
homologue, SEQ:2590. 


344 


100 


1583 


gil28536 
85 


Mus musculus 


putative 


168 


52 


1583 


61 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3707. 


165 


55 


1584 


gil 43491 
25 


Homo sapiens 


mRNA for alpha2- 
glucosyltransferase (ALG10 
gene). 


/IV 


0*2 

Jo 


1584 


gi351345 
1 


Rattus 
norvegicus 


potassium channel regulator 1 


681 


90 


1584 


AAB257 
15 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 3 
SEQIDNO:104. 


617 


93 1 


1585 


gil26979 
39 1 


Homo sapiens 


mRNA for KIAA1697 protein, 
partial cds. 


1904 


100 
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1585 


gi 1 04402 
49 


Homo sapiens 


cDNA:FLJ23529fis, clone 
LNG06042. 


1897 


99 


1585 


gi729341 
5 


Drosophila 
melanogaster 


Dhcl6F gene product 


786 


44 


1586 


AAZ3583 
4_aal 


Homo sapiens 


INCY- Human vesicle 
trafficking protein 2 encoding 
cDNA. 


804 


84 


1586 


AAB936 
64 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:13188. 


804 


84 


1586 


AAY499 
59 


Homo sapiens 


INCY- Human vesicle 
trafficking protein 2. 


804 


84 


1587 


AAY765 
61 


Homo sapiens 


META- Human ovarian tumor 
EST fragment encoded protein 
57. ! 


623 


92 


1587 


gi 136235 
85 


Homo sapiens 


Similar to RIKJENcDNA 
1500034E06 gene, clone 
MGC:14151 IMAGE:3690202, 
mRNA, complete cds. 


623 


92 


1587 


gi 1 28586 
76 


Mus muscuius 


putative 


595 


88 


1588 


AAU206 
47 


Homo sapiens 


HUMA- Human secreted 
protein, Seq ID No 639. 


927 


99 


1588 


AAU205 
23 


Homo sapiens 


HUMA- Human secreted 
protein, Seq ID No 515. 


927 


99 


1588 


AAB954 
32 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:17854. 


927 


99 


1589 


AAB591 
91 


Homo sapiens 


UYCO Human NADE. 


293 


57 


1589 


gi845289 
4 


Homo sapiens 


p75NTR-associated cell death 
executor (NADE) mRNA, 
complete cds. 


293 


57 


1589 


gi 189379 


Homo sapiens 


Human unknown protein from 
clone pHGR74 mRNA, 
complete cds. 


293 


57 


1590 


AAG017 
16 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 5797. 


894 


99 


1590 


AAY824 
73 


Homo sapiens 


LLTL- Human APG12 protein 
sequence. 


724 


100 


1590 


gi411573 
1 


Homo sapiens 


mRNA for Apgl2, complete cds. 


724 


100 


1591 


AAW877 
01 


Homo sapiens 


INCY- A human membrane 
fusion protein designated 
SYTAXL 


1357 


99 


1591 


gi420024 
1 


Homo sapiens 


Rsapiens gene from PAC 42616, 
similar to syntaxin 7. 


1325 


100 


1591 


gil47150 
19 


Mus muscuius 


Unknown (protein for 
MGC:6471) 


1280 


93 


1592 


AAA540 
89 aal 


Homo sapiens 


GETH PR0211 cDNA. 


1944 


87 


1592 


AAB530 
75 


Homo sapiens 


GETH Human angiogenesis- 
associated protein PR021 1, SEQ 
IDNO:57. 


1944 


87 


1592 


AAB612 
31 


Homo sapiens 


MILLr Human TANGO 331 
protein. 


1944 


87 


1593 


gi668159 
2 


Homo sapiens 


HSJ2 mRNA for DnaJ homolog, 
complete cds. 


1567 


93 


1593 


gi 128032 


Homo sapiens 


MR J gene for a member of the 


1567 


93 
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63 




ujnaj protein tamily, clone 
MGC:1152IMAGE:3346070, 
mKiNA, complete cds. 






1593 


AAW940 
66 


Homo sapiens 


INC Y- Human DnaJ-like 
protein, HSPJ2. 


1516 


98 


1594 


gil 12308 
58 


Homo sapiens 


mRNA forHMG-box 

ixanscripiion factor TCF-3, 
complete cds. 


3096 


100 


1594 


oft 19363 
8 


Mus rousculus 


TCF-3 protein 


2934 


95 


1594 


oi14?7QQ 

82 


Xenopus laevis 


T-cell factor XTCF-3 


2195 


77 


1595 


pi163fl7fl 

74 


Homo sapiens 


hypothetical protein FLJ22724, 
clone MGC:16793 
IMAGE:3900548, mRNA, 
complete cds. 


932 


100 


1595 


oil 043 Q9 
25 


Homo sapiens 


cDNA: FU22724 fis, clone 
HSI14868. 


932 


100 


1595 


gil28543 
96 


Mus musculus 


putative 


618 


68 


1596 


gil00472 
29 


Homo sapiens 


mRNA for KIAA1577 protein, 
partial cds. 


3874 


100 


1596 


gi795928 
3 


Homo sapiens 


mRNA forKIAA1513 protein, 
partial cds. 


3002 


74 


1596" 


gil04404 

X 0 


Homo sapiens 


mRNA for FU00044 protein, 
partial cds. 


2010 


63 


1597 


AAY131 
17 


Homo sapiens 


GEST Human secreted protein 
encoded by 5' EST SEQ ID NO: 
131. 


268 


100 


1597 


oil 381 £4 

07 


ouiroioous 
solfataricus 


Dehydrogenase, putative 


66 


43 


1597 


gil51400 
82 


Sinorhizobium 
meliloti 


HYPOTHETICAL PROTEIN 


66 


37 


1598 


oil £0437 

20 


Homo sapiens 


mRNA for SOX7 protein. 


2106 


100 


1598 


14 


Homo sapiens 


cDNAFU30994fis, clone 
HLUNG1000076, highly similar 
to Mus musculus mRNA for 
mSox7. 


2106 


100 


1598 


gil32791 
64 


Homo sapiens 


Similar to SRY-box containing 
gene 7, clone MGC:10895 
1MAGE:3622936, mRNA, 
complete cds. 


2106 


100 


1599" 


gil76461 
46 


Homo sapiens 


B lymphocyte activation-related 
protein mRNA, complete cds. 


1111 


97 


1599 


gil20062 
23 


Homo sapiens 


NPD017 mRNA, complete cds. 


1111 


97 


159F" 


gil51267 
45 


Homo sapiens 


hypothetical protein FU21 3 74, 
clone MGC:5372 
IMAGE:3445403,mRNA, 
complete cds. 


1105 


07 


1600 


gil43288 
79 


Homo sapiens 


CUB domain containing protein 
l(CDCPl) mRNA, complete 
cds. 


4394 


100 


1600 


gi!04395 . 
15 


rlomo sapiens 


cDNA:FLJ22969fis, clone 
KAT10759. 


4385 


99 


1600 


AAY914 j 


Homo sapiens 


■IUMA- Human secreted protein 


3633 


38 
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56 




sequence encoded by gene 6 
SEQIDNO:129. 






1601 


gil20528 
46 


Homo sapiens 


mRNA; cDNA 

DKFZp564K2464 (from clone 
DKFZp564K2464); complete 
cds. 


2152 


100 


1601 


gil26527 
23 


Homo sapiens 


clone MGC:3295 
IMAGE:3508204, mRNA, 
complete cds. 


2035 


100 


1601 


AAW790 
88 


Homo sapiens 


GEMY Human secreted protein 
bi!29 2. 


1115 


100 


1602 


gi508161 
0 


Mus muscuius 


huntington yeast partner C 


4295 


94 


1602 


gil 56368 
98 


Gallus gallus 


formin binding protein 11- 
related protein 


2507 


55 


1602 


gi508160 
8 


Mus muscuius 


formin binding protein 1 1 


2505 


54 


1603 


gi508161 
0 


Mus muscuius 


huntington yeast partner C 


4046 


90 


1603 


gi6808O3 
8 


Homo sapiens 


mRNA; cDNA 

DKFZp434H2121 (from clone 
DKFZp434H2121); partial cds. 


2341 


100 


1603 


gil 56368 
98 


Gallus gallus 


formin binding protein 1 1- 
related protein 


2339 


52 


1604 


AAB950 
53 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 16855. 1 


680 


100 


1604 


gil04335 
25 


Homo sapiens 


cDNA FLJ12122 fis, clone 
MAMMA1000129. 


680 


100 


1604 


AAE0609 
6 


Homo sapiens 


HUMA- Human gene 56 
encoded secreted protein 
HRABA80, SEQ ID NO: 1 58. 


151 


48 


1605 


AAB943 
09 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:14777. 


2318 


99 


1605 


gil04345 
01 


Homo sapiens 


cDNA FLJ12800 fis, clone 
NT2RP2002079, weakly similar 
to HISTONE HI. GONADAL. 


2318 


99 


1605 


gil73912 
25 


Mus muscuius 


Similar to hypothetical protein 
FU12800 


1515 


70 


1606 


AAG645 
02 


Homo sapiens 


UYFU- Human EPIP2 
(endometrial progesterone- 
induced protein). 


1914 


100 


1606 


gil 73902 
89 


Homo sapiens 


Similar to phosphoserine 
aminotransferase, clone 
MGC:9290 IMAGE:3 883843, 
mRNA, complete cds. 


1914 


100 


1606 


gil34360 
74 


Homo sapiens 


Similar to phosphoserine 
aminotransferase, clone 
MGC:10519 IMAGE:j938160 3 
mRNA, complete cds. 


1914 


100 


1607 


gi532680 
2 


Homo sapiens 


phosphoserine aminotransferase 
(PSA) mRNA, complete cds. 


1673 


100 


1607 


AAG645 
02 


Homo sapiens 


UYFU- Human EPIP2 
(endometrial progesterone- 
induced protein). 


1616 


87 


1607 


gil73902 
89 


Homo sapiens 


Similar to phosphoserine 
aminotransferase, clone 
MGC:9290 IMAGE:3 883843, 


1616 • 


87 
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mRNA, complete cds. 






1608 


gi!34477 

01 


Homo sapiens 


cystatin and DUF19 domain- 
containing protein 1 
(CSDUFD1) mRNA, complete 
cds. 


736 


100 


1608 


AAG745 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID 
NO:5277. 


570 


100 


i rrvo 
lOUo 


AAB937 
98 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:13564. 


281 


47 


louy 


1 Ait il AO 

gi 104402 

i(\ 
j\) 


Homo sapiens 


cDNA:FU23514fis, clone 
LNG04628. 


2005 


100 


1609 


gi 128529 
is 


Musmusculus 


putative 


1509 


69 


1609 


gil24077 


Arabidopsis 
thaliana 


initiation factor 3a 


138 


22 | 


1610 


AAY649 
94 


Homo sapiens 


GEST Human 5' EST related 
polypeptide SEQ ID NO:l 155. 


372 


100 


1610 


AAM008 
52 


Homo sapiens 


HYSE- Human bone marrow 
protein, SEQ ID NO: 215. 


69 


39 


1610 


gi332565 


Orf virus 


ORF2 


68 


42 


loll 


gi 1043 97 
05 


Homo sapiens 


cDNA: FLJ23121 fis, clone 
LNG07996. 


3137 


100 


1611 


AAB949 
96 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 16625. 


1618 


99 


1 CI 1 

1611 


gi 104332 
57 


Homo sapiens 


cDNAFLJl 1889 fis, clone 
HEMBA1007251, weakly 
similar to Homo sapiens F-box 
protein FBX29 (FBX29) mRNA. 


1618 


99 


loll 


AAB952 
34 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 17375. 


3584 


100 


1612 


gil04346 
74 


Homo sapiens 


CDNAFU12911 fis, clone 
NT2RP2004425, highly similar 
to Mus musculus axotrophin 
mRNA. 


3584 


100 


1612 


gi505203 
i 


Mus musculus 


axotrophin 


2983 


85 


1613 


gi965095 

A 


Mus musculus 


beta-l,6-N- 

acetylglucosaminyltransferase B 


1254 


73 


1613 


AAV160 
OOjaal 


Homo sapiens 


UOL- Human beta-l,6-N- 
acetylglucosaminyltransferase 
(IGnT) encoding cDNA. 


1044 


65 


1613 


AAQ892 
ui aal 


Homo sapiens 


LTOL- I-branching enzyme 
cDNA. 


1044 


65 


1614 


gil00473 
11 


Homo sapiens 


mRNA for KIAA16 17 protein, 
partial cds. 


4792 


100 


161 A 


3 


Homo sapiens 


RU1 (RUI) mRNA, complete 
cds. 


2467 


55 ! 


1614 


gi!57790 
95 


Homo sapiens 


Similar to RUI, clone 
MGC:3342 IMAGE:3 029598, 
mRNA, complete cds. 


2467 


55 


1615 


gi468069 
3 


Homo sapiens 


CGI-27 protein mRNA, 
complete cds. 


1414 


92 


1615 


gil75117 
62 


Homo sapiens 


CGI-27 protein, clone 
MGC:31852IMAGE:4851517, 
mRNA, complete cds. 


1414 


92 


1615 


gil70463 


Homo sapiens 


C21orfl9-like protein mRNA, 


1414 


92 
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03 




complete cds. 






1616 


gil00472 
69 


Homo sapiens 


mRNA for KIAA 1597 protein, 
partial cds. 


4638 


99 


1616 


gil36470 
69 


Mus musculus 


synaptotagmin-like protein 2-a 
delta 2S-III 


3717 


81 


1616 


gil36470 
09 


Mus musculus 


synaptotagmin-like protein 2-a 


3666 


77 


1617 


AAM937 
72 


Homo sapiens 


HEL1- Human polypeptide, SEQ 
ID NO: 3778. 


1945 


98 


1617 


ABB1173 
1 


Homo sapiens 


HYSE- Human granuphilin-a 
homoloeue, SEQ IDNO.-2101. 


1945 


98 


1617 


gil59302 
18 


Homo sapiens 


synaptotagmin-like 2, clone 
MGC:9588 IMAGE :3 887570, 
mRNA, complete cds. 


1945 


98 


1618 


gi776873 
9 


Homo sapiens 


genomic DNA, chromosome 
21q, section 89/105. 


3747 


100 


1618 


gil28573 
81 


Mus musculus 


putative 


1233 


78 


1618 


gi488438 
6 


Homo sapiens 


mRNA; cDNA 

DKFZp586F0422 (from clone j 
DKFZp586F0422); partial cds. 


870 


80 


1619 


AAR225 
46 


Homo sapiens 


NEUR- Truncated Dopamine D 1 
receptor encoded by pseudogene 
clone GL-39. 


1157 


93 


1619 


AAR210 
82 


Homo sapiens 


NEUR- Dopamine Dl receptor 
encoded by clone GL-30. 


1028 


85 


1619 


gi32049 


Homo sapiens 


Human HD5DR gene for D5 
dopamine receptor. 


1028 


85 


1620 


AAE1032 
9 


Homo sapiens 


INCY- Human transporter and 
ion channel-6 (TRICH-6) 
protein. 


1339 


92 


1620 


AAM258 
77 


Homo sapiens 


HYSE- Human protein sequence 
SEQIDNO:1392. 


1284 


100 ! 


1620 


gil28565 
98 


Mus musculus 


putative 


382 


64 | 


1621 


AAB942 
78 


Homo sapiens 


HELI- Human protein sequence 
SEO IDNO:14707. 


1465 


100 


1621 


gi!35434 
48 


Homo sapiens 


hypothetical protein FLJ12750, 
clone MGC:4691 
IMAGE:3533384,mRNA, 
complete cds. 


1465 


100 


1621 


gil04344 
28 


Homo sapiens 


cDNA FU12750 fis, clone 
NT2RP2001168, weakly similar 
to VERPROLIN. 


1465 


100 


1622 


AAY026 
69 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 20 clone 
HMKAH10. 


288 


100 


1622 


gi751042 
1 


Caenorhabditis 
elegans 


hypothetical protein YoUo.l - 
Caenorhabditis elegans > 


OO 


^7 

3 / 


1623 


AAB953 
93 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:17745. 


795 


100 


1623 


gil04352 
17 


Homo sapiens 


cDNAFU 13265 fis, clone 
OVARC1000937. 


795 


100 


1623 


AAB570 
19 


Homo sapiens 


ROSE/ Human prostate cancer 
antigen protein sequence SEQ 
ID NO: 1597. 


275 


91 


1624 


AAG892 


Homo sapiens 


GEST Human secreted protein, 


936 


100 
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81 




SEQ ID NO: 401. 






1624 


AAU159 
32 


Homo sapiens 


HUMA- Human novel secreted 
^protein, Seq ID 885. 


936 


100 


1624 


gil 04391 
93 


Homo sapiens 


cDNA: FU22700fis, clone 
HSII2073. 


936 


100 


1625 


gil28579 

64 


Mus musculus 


putative 


1533 


86 


1625 


giI02413 
97 


Homo sapiens 


Human DNA sequence from 
clone RP3-336K20 on 
chromosome 6 Contains parts of 
2 genes for novel proteins, ESTs, 
STSs and GSSs, complete 
sequence. 


964 


100 


1625 


gil 65523 
03 


Homo sapiens 


cDNAFLJ32234fis J clone 
PLACE6004687. 


721 


97 


1626 


gil04381 
58 


Homo sapiens 


cDNA:FLJ21940fis, clone 
HEP04512. 


3307 


99 


1626 


AAG737 
12 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID 
NO:4476. 


455 


98 


1626 


gil04406 
14 


Oryza saliva 


putative ATP-dependent RNA 
helicase 


452 


32 


1627 


gil04378 
37 


Homo sapiens 


cDNA:FU21687fis, clone 
COL09385. 


1466 


100 


1627 


gi618017 
8 


Homo sapiens 


transcription factor IGHM 
enhancer 3, JM1 1 protein, JM4 
protein, JM5 protein, T54 
protein, JM10 protein, A4 
differentiation-dependent 
protein, triple LIM domain 
protein 6, and synaptophysin 
genes, complete cds; and L-type 
calcium channel alpha- 1 subunit 
gene, partial cds, complete 
sequence. 


1182 


98 


1627 


ABB1156 
1 


Homo sapiens 


HYSE- Human JM10 protein 
homologue, SEQ ID NO:1931. 


947 


100 


1628 


AAM252 
27 


Homo sapiens 


HYSE- Human protein sequence 
SEQ1DN0:742. 


2069 


100 


1628 


AAB853 
62 


Homo sapiens 


1NCY- Human phosphatase (PP) 
(clone ID 2522707CD1). 


2021 


100 


1628 


gil50805 
05 


Homo sapiens 


Similar to RIKENcDNA 
5730568A12gene, clone 
MGC:17651 IMAGE: 3 857480, 
mRNA, complete cds. 


1907 


100 


1629 


gi656284 
5 


Rattus 
norvegicus 


type A/B hnRNP p40 


1661 


91 




dill 770*7 

9 


Rattus 
norvegicus 


AlF-Cl 


1654 


91 


1629 


gil 81427 
4 


Homo sapiens 


Human apobec-1 binding protein 
1 mRNA, complete cds. 


1631 


92 


1630 


AAY360 
83 


Homo sapiens 


GEST Extended human secreted 
protein sequence, SEQ ID NO. 
468. 


430 


98 


1630 


AAG005 
83 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 4664. 


414 


100 


1630 


gi358259 


Cnemidophorus 


NADH dehydrogenase subunit 4 


66 


37 
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4 


tigris 








1631 


AAB102 
84 


Homo sapiens 


GEMY Human fetal placenta 
protein fragment AG 175 2i. 


852 


94 


1631 


gi398346 
3 


Homo sapiens 


microfibril-associated 
glycoprotein 2 (MAGP2) gene, 
exon 10 and complete cds. 


852 


94 


1631 


gil35434 
86 


Homo sapiens 


Microfibril-associated 
glycoprotein-2, clone 
MGC:14490 IMAGE:4247343, 
mRNA, complete cds. 


852 


94 


1632 


gil61984 
56 


Homo sapiens 


Similar to RIKEN cDNA 
0610040E02 gene, clone 
MGC:l7973 IMAGE:39l9892, 
mRNA, complete cds. 


1339 


99 


1632 


AAY026 
61 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 12 clone 
HFTCU19. 


1142 


yy 


1632 


gi 167406 
89 


Mus musculus 


RIKEN cDNA 0610040E02 
gene 


1059 


77 


1633 


AAB530 
94 


Homo sapiens 


GETH Human angiogenesis- 
associated protein PR0826, SEQ 
IDNO:158. 


510 


100 


1633 


AAB509 
16 


Homo sapiens 


GETH Human PR0826 protein. 


510 


1 AA 

100 


1633 


AAB652 
04 


Homo sapiens 


GETH Human PR0826 
(UNQ467) protein sequence 
SEQ ID NO:201. 


510 


100 


1634 


AAB530 
94 


Homo sapiens 


GETH Human angiogenesis- 
associated protein PR0826, SEQ 
IDNO:158. 


A 1 "i 

413 


OJ 


1634 


AAB509 
16 


Homo sapiens 


GETH Human PR0826 protein. 


A 1 1 


OJ 


1634 


AAB652 
04 


Homo sapiens 


GETH Human PR0826 
(UNQ467) protein sequence 
SEQ ID NO:2Ul. 






1635 


gil 04390 
08 


Homo sapiens 


cDNA: FLJ22573 fis, clone 
HSI02387. 


1937 


100 


1635 


AAM878 
76 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO:l 54oy. 


lo/' 




1635 


gil33464 
3 


Xenopus laevis 


APEG precursor protein 


93 


37 


1636 


gil04390 
08 


Homo sapiens 


cDNA: FLJ22573 fis, clone 
HSIQ23o7. 


578 


100 


1636 


AAM878 
76 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 15469. 


155 


94 


AOjO 


guy^ijo 
5 


rioinu sapiens 


T-R PT <ypnp py fin 1 1 and 

complete cds. 


79 


37 


1637 


AAM938 
71 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3980. 


3761 


99 


1637 


gil433II 
31 


Homo sapiens 


scinderin mRNA, complete cds. 


3749 


99 


1637 


AAR804 
81 


Homo sapiens 


NAKA/ Recombinant human 
adseverin. 


3527 


92 


1638 


gil04369 
70 


Homo sapiens 


cDNA: FLJ20991 fis, clone 
CAE02103. 


239 


67 



213 
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1 /CIO 


-.\C A CF\f\ A 

gl646024 
0 


Deinococcus 
radiodurans 


DNA-binding response regulator 


86 


37 




gl 10/548 
77 


Cyprinus carpio 


Smad4type4 


79 


27 


1&39 


AAY849 
01 


Homo sapiens 


INCY- A human proliferation 
and apoptosis related protein. 


2821 


95 


1639 


25 


Komo sapiens 


mRNA; cDNA 

DKFZp434P2235 (from clone 
DKFZp434P2235); complete 
cds. 


2806 


95 


1639 


gi37330 


Homo sapiens 


H.sapiens mRNA for tre 
oncogene (clone 210). 


2053 


78 


1640 


AAY849 
01 


Homo sapiens 


INCY- A human proliferation 
and apoptosis related protein. 


2846 


96 


1640 


gil20532 
25 


Homo sapiens 


mRNA; cDNA 

DKFZp434P2235 (from clone 
DKFZp434P2235); complete 
cds. 


2834 


95 


1640 


gi37330 


Homo sapiens 


H.sapiens mRNA for tre 
oncogene (clone 210). 


2050 


77 


1 CA 1 

1641 


gil39362 
85 


Musmusculus 


TRH4 


1522 


77 


1 CA 1 

1641 


gil28455 
40 


Musrausculus 


putative 


1520 


77 


1641 


AAU007 
82 


Homo sapiens 


INCY- Human apoptosis 
protein, APOP-2. 


1345 


98 


1642 


gi 172253 
31 


Homo sapiens 


MY0876G05 protein (MY876) 
mRNA, complete cds. 


1209 


100 


1 642 


gil20020 
42 


Homo sapiens 


brain my048 protein mRNA, 
complete cds. 


1209 


100 


1642 


gi!76461 
46 


Homo sapiens 


B lymphocyte activation-related 
protein mRNA, complete cds. 


911 


78 


1 643 


gi 104373 
07 


Homo sapiens 


cDNA:FLJ21240fis, clone 
COL01132. 


2090 


100 


1643 


AAB747 
30 


Homo sapiens 


INCY- Human membrane 
associated protein MEMAP-36. 


856 


42 


1643 


AAY949 
06 


Homo sapiens 


GEMY Human secreted protein 
clone rb649 3 protein sequence 
SEQlDNO:18. 


853 


42 


I644 


AAY144 
48 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 38 clone 
HFGAH44. 


316 


100 


1 645 


gil 03344 
43 


Homo sapiens 


Human DNA sequence from 
clone RP11-291L22 on 
chromosome 10 Contains the 3' 
end oftheHSD17B7 
(hydroxysteroid (17-beta) 
dehydrogenase 7) gene, part of a 
gene similar to CDC10 (cell 
division cycle 10, S. cerevisiae, 
homolog), part of a novel gene, a 
novel pseudogene, STSs, GSSs 
and a CpG Island, complete 
sequence. 


256 


100 


1 645 


gi560623 


human, fetal 
lung, mRNA, 
23 14 nt]. [Homo 


hCDC10=CDCi0 homolog j 


236 


72 
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sapiens 








1645 


gi286460 
6 


Mus musculus 


CDC10 


236 


72 


1646 


AAB189 
69 


Homo sapiens 


INCY- Amino acid sequence of 
a human transmembrane protein. 


1707 


95 


1646 


AAB495 
04 


Homo sapiens 


HUMA- Clone HNTMH27, 


1370 


94 


1646 


gil63075 
93 


Mus musculus 


RIKENcDNA 221 0021 G21 
gene 


1325 


90 


1647 


gil00472 
31 


Homo sapiens 


mRNA for K1AA1578 protein, 
partial cds. 


2083 


95 


1647 


gi684119 
4 


Homo sapiens 


HSPC272 


281 


81 


1647 


gi!08003 
75 


Caenorhabditis 
elegans 


Hypothetical protein Y67D8C.5 


192 


21 


1648 


gil43493 
55 


Homo sapiens 


hypothetical protein FLJ23323, 
clone MGC:14873 
IMAGE:3948222, mRNA, 
complete cds. 


1771 


100 


1648 


gil 04399 
69 


Homo sapiens 


cDNA: FLJ23323 fls, clone 
HEP12456. 


1771 


100 


1648 


gil28525 
02 


Mus musculus 


putative 


1540 


65 


1649 


AAY413 
60 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 53 clone 
HJPAD75. 


490 


100 


1649 


AAM244 
06 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO: 1931. 


335 


100 


1649 


AAY414 
70 


Homo sapiens 


HUMA- Fragment of human 
secreted protein encoded by 
gene 53. 


84 


100 


1650 


AAY413 
60 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 53 clone 
HJPAD75. 


267 


63 


1650 


AAM244 
06 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO; 1931. 


199 


90 


1650 


AAY414 
70 


Homo sapiens 


HUMA- Fragment of human 
secreted protein encoded by 
gene 53. 


84 


100 


1651 


gil65503 
12 


Homo sapiens 


cDNA FLJ30993 fls, clone 
HLUNG1000064, weakly 
similar to KARYOGAMY 
PROTEIN KAR4. 


2449 


100 


1651 


gil63068 
92 


Homo sapiens 


clone MGC:2902 
IMAGE:301 0654, mRNA, 
complete cds. 


2449 


100 


1651 


gil39385 
95 


Homo sapiens 


Similar to CG7818gene 
product, clone MGC :453 1 
IMAGE:3010654, mRNA, 
complete cds. 


2449 


100 


1652 


AAG671 
58 


Homo sapiens 


MILL- Amino acid sequence of 
a human 20685 transporter 
polypeptide. 


1586 


100 


1652 


gil32741 
22 


Homo sapiens 


Human DNA sequence from 
clone RP1-55C23 on 
chromosome 6q22.3-23.3 
Contains the VNNI and VNN2 


1586 


100 
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genes for vanin 1 and 2, the gene 
forvanin3(VNN3),aHLF 

icuKcniitt iacion 
pseudogene, a CCNG1 (cyclin 
vji^ pacuuogene, me j part oi a 
novel gene, ESTs, GSSs, and 

STSS) ^Amnfafo ronnannn 






1652 


gil57958 
17 




uijiicujicu protein prouuct 


1586 


100 


1653 


AAB541 
63 


Homo sapiens 


HUMA- Human pancreatic 
cancer antigen protein sequence 
SEQIDNO:615. 


322 


63 


1653 


gi 180886 


Homo saniPn^ 


nuroan conpase mKNA, 
complete cds. 


322 


63 


1653 


gi!73897 
64 




colipase, pancreatic, clone 
MGC:23801 IMAGE ;425 1084, 

m K rsIA pninnlptja nAc 


322 


63 


1654 


AAU171 
98 


Homo sapiens 


HUMA- Novel signal 
ucuisuucuon painway protein, 
Seq ID 763. 


1359 


100 


1654 


gi 159874 
93 


Homo sapiens 


lujuur enuocnejiaj maricer o 
(TEM6) mJRNA, complete cds. 


1359 


100 


1654 


gi 143257 
70 


Homo $flnif*Ti<! 


nuxiN/v ior inyroia specitic r I B 
domain protein, complete cds. 


1359 


100 


1655 


AAG01 1 
18 




ur>o i jnuman secreted protein, 
SEQ ID NO: 5199. 


360 


100 


1655 


gi 1541 89 
66 


Xenoous laevi'q 




66 


44 


1656" 


gil04370 
31 




bL/ri/\. rL»J/llfj4 lis, Clone 

CAE09073. 


1403 


100 


1656 


gil63071 
59 


Mus musculus 


uiuuiuvvn ^protein ior 
IMAGE:3493084) 


1060 


50 


1656 


git 3 8795 
51 


Mus musculus 


uiJKiiuwu iproiein tor 
IMAGE'370900*tt 


1060 


50 


1657 


AAM800 
47 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3693. 


820 


91 


1657 


AAM790 
63 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1725. 


820 


89 


1657 


ABB1214 

4 ! 


Homo sapiens 


HYSE- Human HSPP-29 protein 
homologue, SEQ ID NO:2514. 


820 


91 


1658 


gi581382 
3 


Homo sapiens 


own i5Uiug iriiviN/v, complete 
cds. 


470 


86 


1658 


gi450281 


Homo sapiens 


SUllisol mRMA pntnnl^t^ ^Hc 
juujouj J4lZ\lN/\, l/UIiipiClC COS. 




86 


1658 


gil42505 
20 


Homo sapiens 


putative translation initiation 
factor clone Mnr* 1 $6 R4 

IMAOE:3350981,mRNA, 
complete cds. 


470 


86 


1659 


gi!68771 
87 


Homo sapiens 


clone MGC:17299 
IMAGE:3845811,mRNA, 
complete cds. 


1094 


100 


1659 


AAY129 
52 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted peptide. 


362 


98 


1659 


gi239437 
6 


Cercopithecus 
aethiops 


thromboxane A2 receptor; 
TBXA2R 


94 


29 


1660 


gil04389 
46 


Homo sapiens 


cDNA:FLJ22527fis, clone 
HRC12820. 


1017 


91 


1W" 


gi!65493 


Homo sapiens 


cDNAFU30149fis, clone 


565 


90 
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12 




BRACE2000280, weakly similar 
to MNN4 PROTEIN. 






1660 


gi729920 
7 


Drosophila 
melanogaster 


CGI 6789 gene product 


519 


42 


1661 


AAB955 
72 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:18221. 


2231 


100 


1661 


gil 04359 
27 


Homo sapiens 


cDNA FLF 1 3798 fis, clone 
THYRO1000124. 


2231 


100 


1661 


gil28516 
20 


Mus musculus 


putative 


1745 


77 


1662 


AAY482 
56 


Homo sapiens 


META- Human prostate cancer- 
associated protein 42. 


242 


75 


1662 


gi382085 
7 


Euglena 
spirogyra 


maturase-like protein 


81 


30 ! 


1662 


gil 15596 
49 


Leuconostoc 
mesenteroides 


dextransucrase Dsrb742 


79 


39 


1663 


gil40177 
83 


Homo sapiens 


mRNA for KIAA1783 protein, 
partial cds. 


2287 


100 


1663 


AAU171 
93 


Homo sapiens 


HUMA- Novel signal 
transduction pathway protein, 
Seq ID 758. 


1725 


100 


1663 


AAY578 
95 


Homo sapiens 


INCY- Human transmembrane 
protein HTMPN-19. 


1634 


100 


1664 


gil04369 
92 


Homo sapiens 


cDNA:FLJ21007 fis, clone 
CAE03871. 


3436 


100 


1664 


gil37849 
43 


Mus musculus 


Unknown (protein for 
MGC: 11761) 


2930 


84 


1664 


gil 78628 
68 


Drosophila 
melanogaster 


RE01471p 


308 


28 


1665 


AAG933 
18 


Homo sapiens 


MSG- Human protein HP10505. 


465 


100 


1665 


gil63068 
68 


Homo sapiens 


mitochondrial ribosornal protein 
S21, clone MGC:2680 
IMAGE:2819715, mRNA, 
complete cds. 


465 


100 


1665 


gil36209 
11 


Homo sapiens 


MRPS21 mRNA for 
mitochondrial ribosornal protein 
S21, complete cds. 


465 


100 


1666 


AAU276 
52 


Homo sapiens 


ZYMO Human protein 
AFP213641. 


1484 


100 


1666 


gil58624 
70 


Homo sapiens 


unnamed protein product 


1484 


100 


1666 


AAE0607 
1 


Homo sapiens 


HUMA- Human gene 31 
encoded secreted protein 
HBJLF01,SEQIDNO:133, 


1284 


100 


1667 


AAW781 
32 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 7 clone 

rlrJbrJlJoD. 


246 


1 A A 

100 


1668 


AAM502 
13 


Homo sapiens 


CURA- Human interleukin-1 1- 
like AMF7 C-terminal 
polypeptide. 


2219 


99 


1668 


gil 65489 
21 


Homo sapiens 


unnamed protein product 


2219 


99 


1668 


AAB948 
03 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 15937. 


1358 


100 


1669 


gil20533 
65 


Homo sapiens 


mRNA; cDNA 

DKFZp586O0222 (from clone 


2747 


100 
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DKFZp586O0222); complete 
cds. 






1669 


gi936853 
8 


Homo sapiens 


mRNA full length insert cDNA 
clone EUROIMAGE 1987170. 


2687 


98 


1669 


gi996590 
5 


Mus musculus 


synembryn 


2383 


86 


1 670 


80 


u^«,„ : 


_T"\X T 1 - TIT MA>>> » 

MuL\/\: n^w* us, clone 
LNG09359. 


3757 


100 


1670 


gil28552 
47 


Mus musculus 


putative 


1339 


64 


1670 


gil46026 
09 


Homo sapiens 


hypothetical protein FLJ23554, 
clone MGC: 14866 
IMAGE:3946091,mRNA, 
complete cds. 


1236 


99 


1671 


gH04417 
32 


Homo sapiens 


leucine-rich repeat-containing G 
protein-coupled receptor 6 
(LGR6) mRNA, partial cds. 


4286 


99 


1671 


gil73828 
94 


Homo sapiens 


unnamed protein product 


3899 


93 


1671 


gil73828 
82 


Mus musculus 


unnamed protein product 


3477 


84 


1672 


AAB941 
18 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO;14362. 


1936 


100 


1672 


gi 1 04341 
08 


Homo sapiens 


cDNA FIJI 2552 fis, clone 
NT2RM4000712, moderately 
similar to Homo sapiens 
ubiquitin hydrolyzing enzyme I 
(UBHl)mRNA. 


1936 


100 


1672 


AAB958 
06 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:18793. 


1930 


99 


1673 


AAB482 
93 


Homo sapiens 


UYYA Human ZF5 protein. 


1407 


80 


1673 


gi645611 
4 


Mus musculus 


F-box protein FBX16 


1407 


80 


1673 


ABB1559 
0 


Homo sapiens 


HUMA- Human nervous system 
related polypeptide SEQ ID NO 
4247. 


831 


90 


1674 


gil00471 ' 
63 


Homo sapiens 


mRNA for KIAA1549 protein, 
partial cds. 


7563 


100 


1674 


AAM791 
57 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1819. 


948 


28 


1674 


AAM801 
41 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3787. ! 


941 


30 


1675 


gi 167686 
54 


Drosophila 
melanogaster 


HL01494p 


911 


39 


1675 


gi729229 
9 


Drosophila 
melanogaster 


CG1271 gene product 


888 


38 


1675 


5 


i nermoiogo 
maritima 


glycerol kinase 


846 


38 


1676 


gil28528 
37 


Mus musculus 


putative 


892 


69 


1676 


gi322823 
7 


Homo sapiens 


UHSKerBgene. 


871 


71 


1676 


gi200962 


Mus musculus 


serine 1 ultra high sulfur protein 


827 


65 i 


1677 


gil73901 
82 


Homo sapiens 


clone IMAGE:4797244, 
mRNA, partial cds. 


1694 


99 


1677 


AAY761 


Homo sapiens 


HUMA- Human secreted protein 


863 


99 
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77 




encoded by gene 54. 






lO/ / 


AAiUnJ 

06 


riomo oapiCIIb 


Hi J MA- Human secreted protein 
encoded by gene 14. 


328 


93 


1678 


gi568941 

n 
1 


Homo sapiens 


mRNA for KIAA 1 040 protein, 


2793 


99 


1678 


gil07286 


Drosophila 

m <»1 on (*\ tract/* 1* 


CG8683 gene product 


2736 


48 


1678 


AAY023 

CI 
Of 


Homo sapiens 


ONOY Polypeptide identified by 
tht* cian^l «*nuence tran method. 


2663 


99 


1679 


gil04399 
64 


Homo sapiens 


cDNA: FLJ23320 fis, clone 


3605 


99 


1679 


AAG744 
99 


Homo sapiens 


HUMA- Human colon cancer 

ant iron nrntfMn ^PO ID 

NO:5263. 


623 


95 


1679 


gi 128306 
79 


Drosophila 
helvetica 


putative irunspocKioc 


220 


24 


1680 


gi 1043 82 
77 


Homo sapiens 


-nxiA. 171 linfYJR fie r»lr»nf» 
CUINA. rluJZZU^o IIS, UlUIiC 

HEP08589. 


2454 


100 


1680 


AAB736 
81 


Homo sapiens 


1- riuman oxiaoreauLuibe 
protein ORP-14. 


2337 


100 


1680 


gi 72 9865 
9 


Drosophila 
melanogaster 


(~\ 1 A*7 1 1 nana r\rr\/"1 1 lf+ 

VjOJv/— l gene prouuoi 


882 


43 


1681 


AAB943 
93 


Homo sapiens 


rifc».Li~ riuman proiein sequence 
SEOIDNO:14957. 


3231 


99 


1681 


gi 104347 
65 


Homo sapiens 


cDNA FIJI 2973 fis, clone 
NT2RP2006023. 


3231 


99 


1681 


gil28604 
50 


Mus museums 


putative 


1177 


64 


1682 


gi 104377 
48 


Homo sapiens 


cDNA: FLJ21615 fis, clone 
COL07393. 


876 


100 


1682 


gil31951 
51 


Homo sapiens 


transcription factor TZP (TZP) 
mRNA, complete cds. 


362 


47 


1682 


gil02414 
61 


Homo sapiens 


Human DNA sequence from 

nlnna DTK 1 11 1 m O /vn 

clone Kr j-i 1 J. J u i jl on 
chromosome 20 Contains the 3* 
end of a gene encoding the 
nepaioceuuiar carcinoma- 
associated antigen 58 (HCA58), 
the SCAND1 gene encoding the 
domain-containing 1 protein, a 
novel gene, 2 CpG islands, 

T?CTc QTQq j»nH rnmnlete 

sequence. 


362 


47 


1683 


AAB439 
00 


Homo sapiens 


HUMA- Human cancer 
associaiCQ proiein scquciiuc otiy 
IDNO:1345. 


483 


87 


1683 


gi J 6849 1 
7 


Homo sapiens 


oruion s tyrosine Kinase \o x jxj, 
alpha-D-galactosidase A (GLA), 
L44-like ribosomal protein 
(L44L) and FTP3 (FTP3) genes, 
complete cds. 


483 


87 


1683 


gi!28047 
05 


Homo sapiens 


ribosomal protein L44, clone 
MGC:2064 IMAGE:3353669, 
mRNA, complete cds. 


483 


87 


1684 


AAY079 
31 


Homo sapiens 


HUMA- Human secreted protein 
fragment encoded from gene 80. 


213 


100 


1684 


gi593200 


Mus museums 


neuronal apoptosis inhibitory 


68 


50 
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3 




protein-rs6 






1684 


gi593200 
8 


Musmusculus 


neuronal apoptosis inhibitory 
protein 


68 


50 


1685 


gil40304 
07 


Musmusculus 


keratin-associated protein 16.4 


380 


77 


1685 


gil40304 
09 


Mus musculus 


keratin-associated protein 16.5 


309 


64 


1685 


gil40304 
01 


Mus musculus 


keratin-associated protein 16.1 


302 


67 


1686 


gil53417 
94 


Homo sapiens 


hypothetical protein FLJ12787, 
clone MGC:16870 
IMAGE:3048487, mRNA, 
complete cds. 


1433 


100 


1686 


AAB951 
82 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:17250. 


1426 


99 


1686 


giI04344 
81 


Homo sapiens 


cDNAFU12787fis, clone 
NT2RP2001943. 


1426 


99 


1687 


AAY384 
01 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene No. 16. 


230 


88 


1687 


AAB256 
87 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 23 
SEQIDNO:76. 


66 


38 


1688 


gil27449 
21 


Homo sapiens 


tethering factor SEC34 (SEC34) 
mRNA, complete cds. 


4223 


100 


1688 


gi 145496 
69 


Homo sapiens 


vesicle docking protein SEC34 
mRNA, complete cds. 


4212 


99 


1688 


gil52915 
37 


Drosophila 
melanogaster 


GH25768p 


1691 


43 


1689 


AAB907 
46 


Homo sapiens 


GEMY Human DF989J protein 
sequence SEQ ID 192. 


545 


97 


1689 


AAW644 
71 


Homo sapiens 


GEMY Human secreted protein 
from clone DF989 3. 


545 


97 


1689 


gi282930 
2 


Homo sapiens 


mRNA for Efsl, complete cds. 


74 


37 


1690 


AAW136 
58 


Homo sapiens 


UYMC- Human cytidine 
deaminase. 


657 


87 


1690 


gi598149 


Homo sapiens 


cytidine deaminase (CDA) 
mRNA, complete cds. 


657 


87 


l690~ 


gi432179 
3 


Homo sapiens 


cytidine deaminase gene, exon 4 
and complete cds. 


657 


87 


1691 


g\\269S0 
79 


Homo sapiens 


mRNA for KIAA1767 protein, 
partial cds. 


4013 


99 


1691 


AAM255 
78 


Homo sapiens 


HYSE- Human protein sequence 
SEQ ID NO: 1093. 


3955 


99 


169T 


AAE0618 
6 


Homo sapiens 


HUMA- Human gene 58 
encoded secreted protein 
fragment, SEQIDNO;248. 


3521 


99 


1692 


0 


numo Sapiens 


chaperon in containing t-complex 
polypeptide 1, eta subunit (Ccth) 
mRNA, complete cds. 


890 


100 


1692 


gil41983 
88 


Mus musculus 


chaperonin subunit 7 (eta) 


879 


98 


l692~" 


gi468504 


Mus musculus 


CCTeta, eta subunit of the 
chaperonin containing TCP-1 
(CCT) 


879 


98 


1693 


gil65520 
36 


Homo sapiens 


cDNA FLJ32028fis, clone 
NTONG1000257. 


918 


100 
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SEQ 
ID | 


Hit ID 


Speicies 


Description 


S score 


PornAnf 

identity 


1693 


AAB747 
68 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 13 
SEQ ID NO:77. 




7^ 


1693 


AAB747 
45 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 13 
SEQIDNO:54. 


592 


99 


1694 


gi 100471 
57 


Homo sapiens 


mRNA tor KIAA1546 protein, 
partial cds. 




100 


1694 


gil26978 
97 


Homo sapiens 


mRNA tor KJAA167o protein, 
partial cds. 


010 




1694 


gi729228 
3 


Prosophila 
melanogaster 


CG2083 gene product 


Jj*r 


jj 


1695 


AAG000 
78 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 4159. 


lt>4 




1695 


gi237020 
2 


Homo sapiens 


mRNA for procollagen alpha 
2CV). 


1 tLA 

164 


fin 
5U 


1695 


gil79698 


Homo sapiens 


Human collagen type V alpha-2 
mRNA, 5' end. 


1 CA 

164 


fin 


1696 


gil65525 
96 


Homo sapiens 


cDNA FLJ32466 fis, clone 
SKNMC2000065. 


2609 




1696 


gil40178 
27 


Homo sapiens 


mRNA for K1AA1 805 protein, 
partial cds. 


2609 




1696 


gil 52079 
87 


Macaca 
fascicularis 


hypothetical protein 


2588 


QQ 

yy 


1697 


AAB928 
79 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 11475. 


oiy 


10 


1697 


AAM414 
35 


Homo sapiens 


HYSE- Human polypeptide bbQ 
ID NO 6366. 


ODy 


n*i — 

13 


1697 


AAM396 
49 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2794. 


639 


73 


1698 


AAM254 
87 


Homo sapiens 


HYSE- Human protein sequence 
SEQ ID NO: 1002. 


586 


100 


1698 


AAG036 
67 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 7748. 


jo/ 


yy 


1698 


gi!28500 
50 


Mus musculus 


putative 


DD/ 




1699 


AAE0477 
4 


Homo sapiens 


INCY- Human vesicle 
trafficking protein-17 (VETKT- 
17) protein. 


748 


100 


1699 


AAB416 
37 


Homo sapiens 


CURA- Human UKr A 
ORF1401 polypeptide sequence 
SEQ IDNO:2802. 




inn 


1699 


gi331995 
3 


Homo sapiens 


mRNA for TOM 1 protein. 


638 


82 


J 700 


gi!41401 
00 


Homo sapiens 


OTT gene for one twenty two 
proteins (spliced and unspliced 
forms). 


4797 


99 


1 /UU 


A/VDJOl 
11 


Ur\m r\ edition Q 
JTIUIIIU aajJiviia 


HELI- Human orotein seauence 
SEQ IDNO:17093. 


4779 


99 


1700 


gil41613 
69 


Homo sapiens 


putative RNA-binding motif 
protein 15 short form (RBM15) 
mRNA, complete cds, 
alternatively spliced. 


4779 


99 


1701 


AAB938 
79 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 13792. 


2228 


100 


1701 


AAG667 
10 


Homo sapiens 


BIOD- Human cell growth 
inhibition protein 48. 


2228 


100 
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ID 


Hit Tit 


apeicies 


Description 


S score 


Percent 
identity 


1701 


27 


Homo sapiens 


clone MGC; 15047 
IMAGE:3535485,mRNA, 
complete cds. 


2228 


100 


1702 


AAE1178 

A 
V 


Homo sapiens 


INCY- Human kinase (PKIN)- 
14 protein. 


4186 


100 


1702 


gil40418 

17 


Homo sapiens 


gklp mRNA for kinase-Iike 
prolein spiice variant 1, 
complete cds. 


4186 


100 


1702 


/irvDUJO 

79 


Homo sapiens 


SUGE- Novel protein kinase, 
SEQIDNO: 207. 


4158 


97 


1703 


A AT T 1 77 
53 


Homo sapiens 


GETH Human PR05774 
polypeptide sequence. 


440 


74 


1703 


34 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted protein. 


258 


96 


1703 


gin 0 100 

9 


Saccharomyces 
cerevisiae 


Tel2p 


75 


24 


nod 


cm 1 04107 

62 


Homo sapiens 


cDNA:FU23 164 fis, clone 
LNG09764. 


3205 


100 


1704 


oi 1 flAAOl 
gl I J 

12 


Homo sapiens 


cDNA;FU23577 fis, clone 
LNG12640. 


1755 


98 


1 /U'f 


guzoyoO 
85 


Homo sapiens 


mRNA for KIAA1770 protein, 
partial cds. 


1614 


99 


170^ 


A A \/lA 1 O 

AAM41S 

06 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 6737, 


1194 


77 


1705 


AAM400 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 3165. 


1194 


77 


1705 


ABB1223 
i 


Homo sapiens 


HYSE- Human novel protein, 
SEQIDNO:2601. 


1194 


77 


1706 


gi 152775 
65 


Mus musculus 


RIKEN cDNA 2510039018 
gene 


3091 


91 


170£ 


n't 1 OO/l/Tn 

32 


Mus musculus 


putative 


3088 


91 


J /vD 


A AD/110 

28 


Homo sapiens 


CURA- Human ORFX 
ORF2792 polypeptide sequence 
SEQIDNO:5584. 


2246 


95 


1707 


gil60411 
36 


Macaca 
fascicularis 


hypothetical protein 


702 


92 


1707 


AAB652 

IV/ 


Homo sapiens 


GETH Human PRO 1004 
(UNQ488) protein sequence 
SEOTDNO:227. 


569 


92 


1707 


A AY 666 
93 


xiomo sapiens 


GETH Membrane-bound protein 
PROI004. 


569 


92 


1708 

1 tvO 


A ARO<< 

36 


Homo sapiens 


HELI- Human protein sequence 
SEQIDN0:18369. 


2429 


100 


1708 


oil fulfil 
57 


Homo sapiens 


cDNA FLJ14009 fis, clone 
Y79AA1002431, weakly similar 
to TRANSDUCIN-LIKE 
ENHANCER PROTEIN 2. 


2429 


100 


1708 


gi503043 
9 


Homo sapiens 


chromosome 19, cosmid 
R26610, complete sequence. 


1569 


80 


1709 


gil57051 
43 


Mus musculus 


suppressor of cytokine signalling 
4 


2071 


86 


1709 


AAM007 
59 


Homo sapiens 


HYSE- Human bone marrow 
protein, SEOIDNO: 122. 


3712 


100 


1709 
1710 


AAM008 
72 

gil04387 


Homo sapiens 
Homo sapiens 


HYSE- Human bone marrow 
protein, SEOIDNO: 348. 
cDNA: FLJ22408 fis, clone 


1215 
1934 


99 

too 



222 



WO 02/070539 




PCT7US02/05095 



SEQ 
ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 




85 




HRC08416. 






1710 


AAM399 
17 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 3062. 


837 


45 


1710 


AAM417 
03 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 6634. 


836 


45 


1711 


gil 04374 
28 


Homo sapiens 


cDNA: FU21343 fis, clone 
COL02679. 


1836 


100 


1711 


gil20531 
45 


Homo sapiens 


mRNA; cDNA 

DKFZp434A0926 (from clone 
DKFZp434A0926); complete 
cds. 


1388 


99 


1711 


gi568953 
7 


Homo sapiens 


mRNA for KIAA 1 1 00 protein, 
complete cds. 


1261 


68 


1712 


AAG032 
54 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO; 7335. 


438 


89 


1712 


gi730038 
3 


Drosophila 
melanogaster 


CG7671 gene product 


366 


27 


1712 


AAB945 
51 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:15312. 


309 


100 


1713 


gil44956 
58 


Homo sapiens 


hypothetical protein FLJ12687, 
clone MGC:15791 
IMAGE:3504468, mRNA, 
complete cds. 


2639 


99 


1713 


AAB942 
41 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 14627. 


2631 


99 


1713 


gil04343 
33 


Homo sapiens 


cDNAFLJl 2687 fis, clone 
NT2RM4002532, weakly similar 
to PROTEIN HOM1. 


2631 


99 


1714 


AAC623 
51_aal 


Homo sapiens 


CELL- Nucleotide sequence of 
lysophosphatidic acid 
acyltransferase-beta. 


834 


100 


1714 


AAA392 
92_aal 


Homo sapiens 


CELL- Human lysophosphatidic 
acid acyltransferase beta 
encoding cDNA. 


834 


100 


1714 


AAB306 
23 


Homo sapiens 


CELL- Amino acid sequence of 
lysophosphatidic acid 
acyltransferase-beta. 


834 


100 


1715 


AAB958 
47 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 18896. 


675 


100 


1715 


gil 04367 
63 


Homo sapiens 


cDNA FU14326 fis, clone 
PLACE4000247. 


675 


100 


1715 


gil 65529 
00 


Homo sapiens 


cDNA FU3271 1 fis, clone 
TESTI2000707, weakly similar 
toDOUBLESEX PROTEIN, 
MALE-SPECIFIC. 


90 


35 


1716 


AAB530 
94 


Homo sapiens 


GETH Human angiogenesis- 
associated protein PR0826, SEQ 

III MUUDo. 


278 


100 


1716 


AAB509 
16 


Homo sapiens 


GETH Human PR0826 protein. 


278 


100 


1716 


AAB652 
04 


Homo sapiens 


GETH Human PR0826 
(UNQ467) protein sequence 
SEQ IDNO:20i. 


278 


100 


1717 


AAB949 
15 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:16349. 


961 


100 


1717 


gil46026 
23 


Homo sapiens 


hypothetical protein FLJ1 1526, 
clone MGC: 15059 


961 


100 
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Speicies 


Description 


S score 


Percent 
identity 








IMAGE:3937610,mRNA, 
complete cds. 






1717 


gi 104327 
97 


Homo sapiens 


cDNAFUl 1526 fis, clone 
HEMBA 1002555, weaidy 
similar to Homo sapiens 
mSin3A associated polypeptide 


961 


100 


1718 


AAY124 
39 


Homo sapiens 


GEST Human 5' EST secreted 
protein SEO ID NO:470. 


453 


94 


1718 

l flu 


oin 4771 

83 


Homo sapiens 


Similar to hypothetical protein 
FLJ20859, clone MGC:12940 
IMAGE:2822127, mRNA, 
complete cds. 


453 


94 


1718 


gil28308 
10 


Homo sapiens 


false p73 target protein gene, 
complete cds. 


453 


94 


1719 


gil59289 


Homo sapiens 


hypothetical protein FU1 1354, 
clone MGC:22961 
IMAGE:4865798, mRNA, 
complete cds. 


3522 


100 


1710 


08 


Homo sapiens 


HELI- Human protein sequence 
SEQJDNO:13299. 


3514 


99 


1710 


gIIU4JZD 

95 


Homo sapiens 


cDNA FIJI 1354 fis, clone 
HEiMBA1000129, weakly 
similar to HYPOTHETICAL 
HELICASE C8A4.08C IN 
CHROMOSOME I. 


3514 


99 


1720 


gil47173 
96 


Homo sapiens 


potassium-dependent Na/Ca 
exchanger NCKX3 (SLC24A3) 

VTA a * 1 i 

mRNA, partial cds. , 


3108 


97 


1720 


gil25974 
41 


Musmusculus 


K+-dependent Na/Ca exchanger 


3027 


94 


1720 


pi 1 7flf)ftt 

97 


ivauus 
norvegicus 


potassium-dependent sodium- 
calcium exchanger NCKX3 


3025 


94 


1721 


gil50724 
54 


Musmusculus 


von Willebrand factor A-related 
protein 


1614 


72 


1721 


AAB425 
81 


Homo sapiens 


CURA- Human ORFX 
ORF2345 polypeptide sequence 
obv? ID NO:4690. 


1358 


93 


1721 


AAB883 
40 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone 
rbbC0053. 


1288 


98 


1722 


AAE0382 
2 


Homo sapiens 


HUMA- Human gene 5 encoded 
secreted protein HETKL27, SEQ 
ID NO: 68. 


935 


100 


1722 


AAB825 
97 




nUMA- Human transmembrane 
protein encoded by cDNA clone 
HNALE36. 


935 


100 


1722 


gil57064 
37 


Homo sapiens 


clone MGC:17366 
IMAGE:3860009,mRNA, 
complete cds. 


935 
? j j 


inn 


1723 


AAG665 

03 ! 


Homo sapiens 


BIOD- Human ATP-dependent 
tielicase 31. 


1441 


100 


1723 


AAM257 
80 


Homo sapiens 


HYSE- Human protein sequence 
SEQIDNO:1295. 


1441 


100 


1723 


gil36763 
56 


Homo sapiens 


clone MGC:2679 
IMAGE:281 9663, mRNA, 
complete cds. 


1434 


99 
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ID 


Hit ID 


Speicies 


Description 


S score 


jrcrccjui 

identity 


1724 


AAG673 
94 


Homo sapiens 


SUOb- Ammo acid sequence oi 
human protein kinase SGK269. 




100 


1724 


gi 104371 
81 


Homo sapiens 


cdna: rLJ2ii4U ns, cione 
CAS07548. 




100 


1724 


AAG673 
93 


Homo sapiens 


SUGE- Amino acid sequence of 
human protein Kinase buKzzj. 


952 


46 


1725 


AAF2449 
8 aal 


Homo sapiens 


GEST Human PG-3 coding 
sequence. 


4362 


99 


1725 


AAB354 
01 


Homo sapiens 


GEST Human PG-3. 


4355 


OQ 


1725 


gil33968 
64 


Homo sapiens 


unnamed protein product 


4355 


on 


1726 


AAB427 
84 


Homo sapiens 


CURA- Human ORFX 
ORF2548 polypeptide sequence 
SEQJDNO:5096. 


817 


99 


1726 


gil22248 
87 


Homo sapiens 


mRNA; cDNA DKFZp547H027 
(from clone DKFZp547H027); 
complete cds. 


817 


99 


1726 


gil04384 
59 


Homo sapiens 


cDNA: FLJ22174 fis, clone 
HRC00767. 


817 


yy 


1727 


gil67405 
66 


Homo sapiens 


Similar to hypothetical protein 
FLJ13087, clone MGC:15009 
IMAGE:3536735,mRNA, 
complete cds. 


1854 


t An 
100 


1727 


AAB952 
97 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:l7525. 


807 


95 


1727 


gil04349 
41 


Homo sapiens 


cDNA FLJ13087 fis, clone 
NT2RP3002099. 


807 


95 


1728 


AAB940 
75 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 14267. 


3506 


97 


1728 


AAM939 
95 


Homo sapiens 


HELI- Human stomach cancer 
expressed polypeptide SEQ ID 
NO 59. 


3506 


97 


1728 


gil40421 
45 


Homo sapiens 


cDNA FU14550 fis, clone 
NT2RM2001 696. 


3506 


97 


1729 


AAG008 
97 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 4978. 


392 


93 


1729 


AAG008 
98 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 4979. 


i in 


1 fin 


1729 


AAB425 
97 


Homo sapiens 


CURA- Human ORFX 
ORF2361 polypeptide sequence 
SEQ IDNO:4722. 


130 


100 


1730 


ABB 1161 
3 


Homo sapiens 


HYSE- Human sorting nexin 7 
homologue, SEO ID NO:1983. 




inn 


1730 


AAG741 
74 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID 
NO:4938. 


2309 


99 


1 /30 


1 


xiomo bapienb 


(from clone DKFZp564F052); 
partial cds. 


2148 


99 


1731 


gil34456 
60 


Homo sapiens 


MP19 (LIM2) mRNA, complete 
cds, alternatively spliced. 


933 


100 


1731 


gil 11775 
46 


Homo sapiens 


LIM2 (LIM2) and natural killer 
group 7 (NKG7) genes, 
complete cds. 


933 


100 


1731 


gil 34456 
58 


Homo sapiens 


MP19ins (LIM2) mRNA, 
complete cds, alternatively 


880 


80 
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spliced. 






1732 


AAB405 
91 


Homo sapiens 


CURA- Human ORFX ORF355 
polypeptide sequence SEQ ID 
NO:710. 


1056 


100 


1732 


" gil 56174 
58 


Oryctolagus 
cunicuius 


Rabl 1 family interacting protein 


869. 


48 




AAY294 
88 


Home Sapiens 


CORI- numan lung lumour 
protein LT86-7 predicted amino 
acid sequence. 


55? 


87 


1733 


gil 04377 
50 


Homo sapiens 


cDNA: FLJ21616fis, clone 
COL07477. 


1680 


99 


1733 


gil 43493 
60 


Homo sapiens 


Sim j jar to hypothetical protein 
FU21616, clone MGC:14941 
IMAGE:3947903,mRNA, 
complete cds. 


1443 


99 


1733 


gil 28054 
73 


Mus musculus 


Unknown (protein for 
IMAGE:3490304) 


1410 


97 


1734 


gil 04377 
50 


Homo sapiens 


cDNA:FLJ21616fis, clone 
COL07477. 


1645 


92 


1734 


gil43493 
60 


Homo sapiens 


Similar to hypothetical protein 
FU21616, clone MGC:14941 
IMAGE:3947903,mRNA, 
complete cds. 


1565 


99 


1734 


gil 28054 
73 


Mus musculus 


Unknown (protein for 
IMAGE:3490304) 


1412 


96 


1735 


AAB950 
36 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:16791. 


863 


100 


1735 


gil 04334 
48 


Homo sapiens 


cDNAFU12060fis, clone 
HEMBB1002142. 


863 


100 


1735 


gil53419 
04 


Homo sapiens 


clone MGC:21051 
IMAGE:4476886, mRNA, 
complete cds. i 


751 


99 


1736 


gil40437 
83 


Homo sapiens 


clone MGC: 14256 
IMAGE:4129368,mRNA, 
complete cds. 


2232 


100 


1736 


gil 04368 
57 


Homo sapiens 


cDNA: FLJ20897 fis, clone 
ADKA03573. 


2232 


100 


1736 


gil26537 
85 


Homo sapiens 


clone IMAGE:3349601, 
mRNA, partial cds. 


1783 


99 


1737 


gil 42498 
50 


Homo sapiens 


clone MGC:15062 
IMAGE:2959567,mRNA, 
complete cds. 


1535 


99 


1737 


AAM795 
39 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3185. 


1523 


53 


1737 


AAM785 
55 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1217. 


1523 


53 


1738 


gil31833 
38 


Homo sapiens 


calneuron 1 (CALN1) mRNA, 
complete cds. 


881 


100 


1738 


gil31833 
40 


Mus musculus 


calneuron 1 


880 


99 


1738 


gi767034 
4 


Mus musculus 


unnamed protein product 


880 


99 


1739 


gi499592 
7 


Xenopus laevis 


p33 ringo 


545 


49 


1739 


gil 02802 
85 


Xenopus sp. 


unnamed protein product 


545 


49 


1739 


gi446879 


Xenopus laevis 


speedy protein 


540 


48 
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Speicies 


Description 


S score 


Percent 
io entity 




0 










1740 


gil52815 
53 


Homo sapiens 


solute carrier family 12 member 
8 (SLC12A8) gene, partial cds. 


1905 


oU 


1740 


giI72249 
40 


Mus musculus 


cation-chloride cotransporter 


1761 


74 


1740 


gi 104397 
94 


Homo sapiens 


cDNA: FLJ23188 fis, clone 
LNG12038. 


1613 


99 


1741 


gi633016 
3 


Homo sapiens 


mRNA for KIAA1 161 protein, 
partial cds. 


1137 


99 


1741 


gi730133 
3 


Drosophila 
melanogaster 


CGI 1909 gene product 


485 


41 


1741 


gi64404 


Torpedo 
californica 


4-acetamido-4'- 
isothiocyanostilbene-2, 2'- 
disulphonic acid-binding protein 


415 


38 


1742 


AAM678 
57 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 28163. 


1553 


100 


1742 


AAM554 
71 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded 
protein SEQ ID NO: 27576. 


1553 


100 


1742 


gil 56208 
49 


Homo sapiens 


mRNA for KIAA1 895 protein, 
partial cds. 


1553 


100 


1743 


gi285264 
0 


Homo sapiens 


clone 23856 unknown mRNA, 
partial cds. 


942 


99 


1743 


gi!33252 
81 


Homo sapiens 


hypothetical protein MGC2683, 
clone MGC:43 13 
IMAGE:2819900, mRNA, 
complete cds. 


700 


100 


1743 


gi!26544 
85 


Homo sapiens 


clone MGC:2683 
IMAGE:2819900, mRNA, 
complete cds. 


700 


100 


1744 


gil39383 
07 


Homo sapiens 


clone MGC:15626 
IMAGE:3343642, mRNA, • 
complete cds. 


526 


62 


1744 


AAB907 
65 


Homo sapiens 


NOJI/ Human shear stress- 
response protein SEQ ID NO: 
30. 


524 


64 


1744 


AAB621 
59 


Homo sapiens 


NEUR- Human arginine-rich 
protein. 


524 


64 


1745 


gi724320 
7 


Homo sapiens 


mRNA for KIAA1413 protein, 
partial cds. 


7273 


99 


1745 


AAB930 
57 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO:11862. 


3014 


99 


1745 


gi702286 
1 


Homo sapiens 


cDNA FLJ1 0680 fis, clone 
NT2RP2006573, weakly similar 
to 2',3'-CYCLIC NUCLEOTIDE 
3'-PHOSPHODIESTERASE 


3014 


99 


1746 


gi724320 
7 


' Homo sapiens 


mRNA for KIAA1413 protein, 
partial cds. 


7245 


98 


1746 


AAB930 
57 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:11862. 


3014 


99 


1746 


gi702286 
1 


Homo sapiens 


cDNA FU10680 fis, clone 
NT2RP2006573, weakly similar 
to 2',3'-CYCLIC NUCLEOTIDE 
^-PHOSPHODIESTERASE 
(EC 3.1.4.37). 


3014 


99 
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ID 


Hit ID 


Speicies 
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S score 


Percent 
identity 


1747 


AAE1033 
0 


Homo sapiens 


INCY- Human transporter and 
ion channel-7 (TRICH-7) 
proteia 


1520 


100 


1747 


AAM394 
22 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2567. 


694 


48 


1747 


AAM793 
97 


Homo sapiens 


HYSE- Human protein SEQ ID 

*T/\ *>AJ*» 


694 


48 


1748 


gil04397 
44 


Homo sapiens 


cDNA:FLJ23151 fis, clone 
LNG09417. 


2362 


100 


1748 


gil28604 
56 


Mus musculus 


putative 


1732 


70 


1748 


gi895406 
3 


Arabidopsis 
thaliana 


Contains similarity to a 
transposable element Tip 1 00 
protein for transposase from 
Ipomoea purpurea gb|4063769 
and is a member of the 
transmembrane 4 family 
PF|00335. 


308 


24 


1749 


gil75299 
87 


Homo sapiens 


oxysterol-binding protein-like 
protein OSBPL3 (OSBPL3) 
mRNA, complete cds. 


4671 


100 


1749 


gil73893 
82 


Homo sapiens 


oxysterol binding protein-like 3, 
clone MGC:21526 
IMAGE:3909164, mRNA, 
complete cds. 


4671 


100 


1749 


gil08809 
73 


Homo sapiens 


oxysterol binding protein-related 
protein 3 (ORP3) mRNA, 
complete cds. . 


4671 


100 


1750 


AAB948 
79 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:I6094. 


2813 


100 


1750 


gil04363 
38 


Homo sapiens 


cDNA FU13998fis, clone 
Y79AAI002229, weakly similar 
to DNA CROSS-LINK REPAIR 
PROTEIN PS02/SNM 1 . 


2813 


100 


1750 


AAB943 
19 


Homo sapiens 


HELI- Human protein sequence 
SEQ 1DN0:!4797. 


2330 


99 


1751 


AAB193 
90 


Homo sapiens 


LEXI- Amino acid sequence of a 
human lipoxygenase protein. 


3836 


100 


1751 


gi!33781 
70 


Homo sapiens 


partial ALOXE3 gene for 
arachidonate lipoxygenase 3, 
exons 1 to 4B (and joined CDS). 


3836 


100 


1751 


gil04410 
04 


Homo sapiens 


epidermal lipoxygenase 
(ALOXE3) mRNA, complete 
cds. 


3830 


99 


1752 


AAM932 
41 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 2671. 


664 


95 


1752 


gi 150302 
7U 


Homo sapiens 


clone MGC:9889 
IMAGE:3868330, mRNA, 
complete cds. 


664 


95 


1752 


AAO102 
85 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 24177. 


486 


78 


1753 


gi456191 


Homo sapiens 


H.sapiens mRNA for rho GDP- 
dissociation Inhibitor 1. 


818 


99 | 


1753 


gi337395 


Homo sapiens 


Human GDP dissociation 
inhibitor mRNA, complete cds. 


818 


99 


1753 


gi285979 


Homo sapiens 


Human rho GDI mRNA, 
complete cds. 


818 


99 
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IB 
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Speicies 
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S score 


Percent 
laeniiiy 


1754 


AAQ435 
49_aal 


Homo sapiens 


HARD Gamma subunit of 
human retinal cGMP 
phosphodiesterase DNA. 


472 


100 


1754 


AAR931 
17 


Homo sapiens 


HARD cGMP- 
phosphodiesterase gamma- 
subunit. 


All 


inn 


1754 


AAR384 
84 


Homo sapiens 


HARD Gamma subunit of 
human retinal cGMP 
phosphodiesterase. 


472 


100 


1755 


AAY194 
46 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted protein. 


590 


98 


1755 


AAY195 
99 


Homo sapiens 


HUMA- SEQ ID NO 317 from 
W09922243. 


590 


no 


1755 


AAY196 
02 


Homo sapiens 


HUMA- SEQ ID NO 320 from 
W09922243. 


137 


1 AA 
100 


1756 


gil04373 
93 


Homo sapiens 


cDNA: FLJ21313 fts, clone 
COL02176. 


2197 


AA 

99 


1756 


gi 142503 
21 


Homo sapiens 


hypothetical protein FLJ213 13, 
clone MGC:16820 
IMAGE:4148772, mRNA, 
complete cds. 


2193 


AA 

99 


1756 


gtl28585 
26 


Mus musculus 


putative 


1936 


86 


1757 


gi74l435 
1 


Homo sapiens 


Hoxl.8 gene forhomeobox 
protein. 


478 


1 AA 
I 00 


1757 


gi278967 
2 


Homo sapiens 


homeobox protein A10 

(HOXA1 0) gene, complete cds. 1 


478 


100 


1757 


gi 1 55592 
35 


Homo sapiens 


clone MGC: 12859 
IMAGE:4107013, mRNA, 
complete cds. 


478 


100 


1758 


gi 189772 


Homo sapiens 


Human prostaglandin D2 
synthase gene, exons 2 through 6 
and complete cds. 


872 


90 


1758 


gll35435 
68 


Homo sapiens 

* 


prostaglandin D2 synthase 
(21 kD, brain), clone 
MGC: 14559 IMAGE :4294999, 
mRNA, complete cds. 


872 


90 


1758 


gil29638 
79 


Homo sapiens 


prostaglandin D synthase 
mRNA, complete cds. 


872 


90 


1759 


gi298250 
8 


Homo sapiens 


mRNA for TCR beta chain, 
specific tor Mage i/rULA-AZ. 


1296 


92 


1759 


gi300292 
5 


Homo sapiens 


T cell receptor beta chain 
(TCRBV13S1-TCRBJ2S1) 
mRNA, complete cds. 


1286 


92 


1759 


gi36733 


Homo sapiens 


H.sapiens mRNA for T-cell 
antigen receptor beta-chain. 


1047 


75 


1760 


AAM257 

Zo 


Homo sapiens 


HYSE- Human protein sequence 
SEO ID NO- 1243 


667 


99 


1760 


gi 155290 
64 


Homo sapiens 


sorting nexin 14 (SNX14) 
mRNA, complete cds. 


667 


99 


1760 


gi 134772 
73 


Homo sapiens 


clone MGC:13217 
IMAGE:3959086, mRNA, 
complete cds. 


667 


99 


1761 


AAB948 
43 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 160 18. 


1761 


100 


1761 


gi 104362 
67 


Homo sapiens 


cDNA FLJ13955 fis, clone 
Y79AA1001177. 


1761 


100 
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S score 
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identity 


1761 


gil 63592 

AC 

95 


Mus musculus 


Similar to hypothetical protein 
FLJ13955 


1681 


85 


1762 


AAE0205 
8 


Homo sapiens 


HUMA- Human four disulfide 
core domain (FDCD)-containing 
protein. 


1200 


84 


1762 


gil28353 
76 


Mus musculus 


putative 


918 


68 


1762 


gil26554 
52 


Homo sapiens 


mRNA for keratin associated 
protein 4.7 (KRTAP4.7 gene). 


892 


68 


1763 


AAG674 
85 


Homo sapiens 


LEXI- Amino acid sequence of a 
human transporter protein. 


2391 


99 


1763 


AAEI033 
3 


Homo sapiens 


INCY- Human transporter and 
ion channel-10(TRICH-10) 
protein. 


2368 


96 1 


1763 


gil27182 
01 


Homo sapiens 


Human DNA sequence from 
clone RP11-305P22 on 
chromosome 20 Contains ESTs, 
STSs, GSSs and 7 CpG islands. 
Contains three novel genes and a 
novel gene for a helix-loop-helix 
DNA binding protein, complete 
sequence. 


2243 


100 


1764 


gil 57789 
48 


Homo sapiens 


Similar to thiamine 
pyrophosphokinase, clone 
MGC:14885IMAGE:3622116, 
mRNA, complete cds. 


717 


100 


1764 


gil 26672 
03 


Homo sapiens 


thiamine pyrophosphokinase 
(TPKl)mRNA, complete cds. 


717 


100 


1764 


gil22489 
15 


Homo sapiens 


hTPKl mRNA for thiamin 
pyrophosphokinase, complete 
cds. 


717 


100 


1765 


gil04388 
31 


Homo sapiens 


cDNA: FU22439fis, clone 
HRC09236. 


2525 


99 


1765 


AAB422 
37 


Homo sapiens 


CURA- Human ORFX 
ORF2001 polypeptide sequence 
SEQIDNO:4002. 


2084 


99 


1765 


gil 35592 
84 


Homo sapiens 


Human DNA sequence from 
clone RP5-1 17516 on 
chromosome 20. Contains the 3* 
end of the gene for Ras inhibitor 
JC265 (Ras association 
(RalGDS/AF-6) domain 
containing protein), the 5' end of 
the gene encoding N-terminal 
acetyltransferase complex ardl 
subunit, ESTs, STSs, GSSs and 
two CpG islands, complete 
sequence. 


829 


36 


1 /UD 


8 


Homo sapiens 


Human DNA sequence from 
clone RP5-1024N4 on 
chromosome lp32. 1-33. 
Contains the gene for a novel 
Sodium:solute symporter family 
member similar to SLC5AI 
(SGLT1), a pseudogene similar 
to part of butyrophilin family 
members, a novel gene, ESTs, 
STSs, GSSs and a putative CpG 


2858 


100 
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identity 








island, complete sequence. 






1766 


gi529056 


Homo sapiens 


Na+/gIucose cotransporter 
(SGLT1) gene, exon 15 and 
complete cds. 


1955 


55 


1766 


gi364604 
3 


Homo sapiens 


Human DNA sequence from 
PAC 127L4 on chromosome 22. 
Contains last exon (1 5) of the 
SLC5A1 (SGLT1) gene for 
solute carrier family 5 
(sodium/glucose cotransporter) 
member 1 (High Affinity 
Sodium-Glucose Cotransporter). 
Contains ESTs and STSs, 
complete sequence. 


1955 


55 


1767 


AAB734 
85 


Homo sapiens 


MILL- Human aminopeptidase 
22196. 


3657 




1767 


gil25836 
12 


Homo sapiens 


mRNA for neurolysis 


3657 


99 


1767 


gil39224 
67 


Homo sapiens 


unnamed protein product 


3657 


99 


1768 


gil01221 
38 


Rattus 
norvegicus 


SynGAP-a 


6651 


99 


1768 


gi293544 
8 


Rattus 
norvegicus 


synaptic ras GTPase-activating 
protein p!35 SvnGAP 


6634 


99 


1768 


gi662458 
7 


Homo sapiens 


Human DNA sequence from 
clone RP4-570F3 on 
chromosome 6. Contains the 5 1 
end of the gene for the ortholog 
of the rat synaptic ras GTPase- 
activating protein pi 35 
SynGAP, gene LOC5 1 596 for 
divalent cation tolerant protein 
CUTAor brain 
acetylcholinesterase putative 
membrane anchor, the PHF1 
gene for PHD finger protein 1, 
the KNSL2 gene for kinesin-like 
protein 2, the gene for a novel 
protein similar to ribosomal 
protein L12 (RPL12) and the 
gene for a novel protein similar 
to lysophospholipase 11 
(LYPLA2). Contains ESTs, 
STSs, GSSs and four Cpu 
islands, complete sequence. 


6194 


100 


1769 


AAB958 
63 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:18931. 


1 f\Af\ 

104U 




1769 


gil73892 
oJ 


Homo sapiens 


hypothetical protein FIJI 4346, 

IMAGE:441 5420, mRNA, 
complete cds. 


1040 


100 


1769 


gil 04367 
91 


Homo sapiens 


cDNAFLJ14346fis, clone 
THYRO1001320. 


1040 


100 


1770 


AAB945 
17 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:15235. 


2593 


99 


1770 


gil04351 
22 


Homo sapiens 


cDNA FLJ13203 fis, clone 
NT2RP3004504, highly similar 
to M.musculus mRNA for CPEB 


2593 


99 
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ID 


Hit in 


Speicies 


Description 


S score 


Percent 
identity 








protein. 






1770 


16 


Homo sapiens 


cytoplasmic polyadenylation 
element-binding protein short 
form (CPEB1) mRNA, complete 
cds. 


2583 


98 


1771 


AAR4RD 

59 


Homo sapiens 


INCY- Human extracellular 

signaling molecule (EXC5) (ID 
1493630CD1). 


432 


97 


1771 


90 


Homo sapiens 


GEST Extended human secreted 
protein sequence, SEQ ID NO. 
475. 


420 


95 


1771 


AAY117 
fix 


Homo sapiens 


GEST Human 5' EST secreted 
protein SEQ ID No: 368. 


257 


95 


1772 


AAB930 
75 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:11902. 


2936 


99 


Mil 


gU4U4z4 

15 


Homo sapiens 


cDNAFLJ14710fis, clone 
NT2RP3000632, weakly similar 
to ZINC FINGER PROTEIN 84. 


2936 


99 


1772 


gil 65514 
29 


Homo sapiens 


cDNAFLJ31551fis, clone 
NT2RI2001083, moderately 
similar to ZINC FINGER 
PROTEIN 84. 


1813 


63 


1773 


AAY768 
43 


Homo sapiens 


INCY- Human proton ATPase 
subunit(HPAS) protein 
sequence. 


356 


100 


1773 


AAY885 
90 


Homo sapiens 


SATO/ Human tumour specific 
antigen amino acid sequence. 


356 


100 


1773 


AAW645 


Homo sapiens 


SAGA Human fibrosarcoma cell 
line HT-1080 clone HP00442 
protein. 


356 


100 


1774 


gil 24073 
85 


Homo sapiens 


tripartite motif protein TRIM5 
isoform gamma (TRIMS) 
mRNA, complete cds; 
alternatively spliced. 


1818 


99 


1774 


gil24073 
87 


Homo sapiens 


tripartite motif protein TRIM5 
isoform delta (TRIMS) mRNA, 
complete cds; alternatively 
spliced. 


1559 


99 


1774 


pi 1 0ACfTX 

83 


Homo sapiens 


tripartite motif protein TRIMS 
isoform beta (TRIMS) mRNA, 
complete cds; alternatively 
spliced. 


1557 


100 


1775 


ei 142506 
01 


numo Sapiens 


hypothetical protein FLT22056, 
clone MGC:3045 
UVlAuli:33430o2, mRNA, 
complete cds. 


2600 


99 


1775 


ei 1043 83 
15 


numo sapiens 


cDNA: FLJ22056 fis, clone 
HEP09916. 


1747 


100 


1775 


gi729529 
3 


Drosophila 
melanogaster 


CG8633 gene product m 


597 


31 


1776 


gil04370 
59 


Homo sapiens 


cDNA:FU21054 fis, clone 
CAS00538. 


736 


100 


1776 


gil 15580 
99 


Mus musculus 


syncoilin 


653 


89 


1776 


gil28354 
05 


Mus musculus 


putative 


552 


84 


1777 


gil63076 


Homo sapiens 


hypothetical protein FLJ12270, 


3045 


100 
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08 




clone MGC:10176 

lMAOJi. jyUoVK) 1 *, mivlN/Vj 

complete cds. 






1777 


gin 6209 
05 


Homo sapiens 


miviNrV IOi JSJLfVA lyZO pruieiu, 

partial cds. 


3042 


99 


1777 


AAB939 
45 


Homo sapiens 


HELI- Human protein sequence 


2190 


100 


1778 


gi795981 
9 


Homo sapiens 


PRO1430 


299 


100 


1778 


AAO078 
40 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 217.5Z. 


75 


54 


1778 


AAO088 
70 


Homo sapiens 


HYSE- Human polypeptide SEQ 

ID NU 22. IOi. 


73 


43 


1779 


AAG629 
10 


Homo sapiens 


KLEE/ Amino acid sequence of 
a human xylosylytransferase 
(XT) isoform XT-IL 


4625 


100 


1779 


gi 113222 
70 


Homo sapiens 


mRNA forxylosyltransterase 11 
(XT-11 gene). 


40ZJ 


inn 


1779 


gi 152096 
53 


Homo sapiens 


human XT-II 


4625 


100 


1780 


gi992997 
3 


Macaca 
fascicularis 


hypothetical protein 


17/o 


OA 


1780 


AAG787 
40 


Homo sapiens 


BODE- Human transcriptional 
elongation factor IIS 24. 


1162 


99 


1780 


ABB1122 
0 


Homo sapiens 


HYSE- Human TFIISh 
nomolo^ue, SbQ It/ NUJ oyu. 


653 


100 


1781 \ 


gi 1 52778 
46 


Homo sapiens 


Similar to hypothetical protein 
FLJ21522, clone MUL.lool / 
IMAGE:3853503, mRNA, 
complete cds. 


3122 


99 


1781 


AAB643 
72 


Homo sapiens 


INCY- Amino acid sequence of 
human intracellular signalling 
molecule INTRA4. 


3100 


99 


1781 


gil04376 
38 


Homo sapiens 


cDNA: FLJ21522 fis, clone 
COL05884. 


2892 


94 


1782 


AAB946 
25 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 15483. 


OARR 


10ft 


1782 


gi 1043 53 
87 


Homo sapiens 


cDNA FLJl33co lis, clone 
PLACE1001 104, weakly similar 
tOMYUolN HJbAV i UriAlIN, 

NON-MUSCLE. 




Iflft 

lUw 


1782 


gil65518 
77 


Homo sapiens 


cDNA FLJ31903 fis, clone 

N 1ZK1 /UU420U, weaKiy simuar 

to MYOSIN HEAVY CHAIN, 


2429 


92 


1783 


gil05681 
12 


Homo sapiens 


ALR-like protein mRNA, 
complete cds. 


17050 


100 


1783 


7 


Homo saDiens 


PAC clone RP5-98107 from 
7q34-q36, complete sequence. 


9606 


100 


1783 


AAB422 
30 


Homo sapiens 


CURA- Human ORFX 
0RF1994 polypeptide sequence 
SEQ ID NO:3988. 


9583 


99 


1784 


AAB688 
76 


Homo sapiens 


INCY- Human RECAP 
polypeptide, SEQ ID NO: 6. 


1895 


85 


1784 


gi!04381 
74 


Homo sapiens 


cDNA: FLJ21952 fis, clone 
HEP04970. 


1895 


85 


1784 


gi 169242 


Homo sapiens 


hypothetical protein FLJ21952, 


1862 


84 



233 



WO 02/070539 



PCT/US02/05095 



SEQ 
ID 



Hit ID 



Speicies 



Description 



S score 



Percent 
identity 



96 



1785 



AAU160 
26 



Homo sapiens 



clone MGC:2790 
IMAGE:2960984,mRNA, 
complete cds. 



HUMA- Human novel secreted 
protein, Seq ID 979. 



1433 



100 



1785 



gil40178 

23 



Homo sapiens 



mRNA for KIAA1803 protein, 



1433 



100 



1785 



AAG780 
55 



Homo sapiens 



GEAT Human zinc finger 
domain DNA binding protein S 
1-3. 



1168 



99 



1786 



AAG004 
05 



Homo sapiens 



1786 



AAM906 
02 



Homo sapiens 



GEST Human secreted protein, 
SEQ ID NO: 4486. 



397 



HUMA- Human 
immune/haematopoietic antigen 
SEQ IDNO:18195. 



215 



98 



56 



1786 



AAM431 
14 



Homo sapiens 



HYSE- Human polypeptide SEQ 
ID NO 6045. 



84 



34 



AAG020 
95 



Homo sapiens 



GEST Human secreted protein, 
SEQ ID NO: 6176. 



302 



100 



gi665103 
7 



Mus musculus 
domesticus 



similar to RNA binding protein 



222 



53 



1787 



gi 128478 
83 



Mus musculus 



putative 



222 



1788 



gi979845 



Homo sapiens 



1788 



gi532685 



Mus musculus 



mRNA for putative capacitative 
calcium channel (trp7 gene). 



4470 



1788 



receptor-activated calcium 
channel 



gi229590 



Homo sapiens 



Human putative calcium influx 
channel (htrp3) mRNA, 
complete cds. 



4392 



3529 



53 



100 



98 



81 



AAG023 
37 



Homo sapiens 



1789 



AAM008 
75 



Homo sapiens 



GEST Human secreted protein, 
SEQ ID NO: 6418. 



377 



1789 



gil 66669 



1790 



1790 



AAB733 

81 

gil 20055 



09 



Mus musculus 



HYSE- Human bone marrow 
protein, SEQ ID NO: 351. 



72 



Homo sapiens 



alpha-NAC, muscle-specific 
form gp220 



Homo sapiens 



NANF- Human gas vesicle 
protein homologue hGvpT-b. 



HT025 mRNA, complete cds. 



72 



2838 



2838 



98 



31 

34~ 



99 



99 



gil73914 
58 



Homo sapiens 



clone MGC:2462 
IMAGE:2964737, mRNA, 
complete cds. 



1699 



99 



1791 AAY994 
38 



1791 



Homo sapiens 



AAB240 
37 



Homo sapiens 
Homo sapiens 



GETH Human PRO 1555 
(UNQ763) amino acid sequence 
SEQ IDNO:338. 



1300 



GETH Human PR01555 protein 
sequence SEQIDN0:49. 



1300 



100 



100 



1791 



gil26542 
33 



Similar to hypothetical protein, 
clone 1-2, clone MGC-5442 
IMAGE:3449979,mRNA, 
complete cds. 



1300 



100 



3792 



AAB638 
60 



Homo sapiens 



1792 



AAM767 
71 



Homo sapiens 



LUDW- Human prostate cancer 
associated antigen protein 
sequence SEP I D N0:1222. 



297 



MOLE- Human bone marrow 
expressed probe encoded protein 



272 



50 



51 
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S score 
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identity 








SEQ ID NO: 37077. 






1792 


AAM639 
51 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded 
protein SEQ ID NO: 36056. 


A/TO 

272 


51 


1793 


AAB942 
49 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 14645. 


799 


Oft 

99 


1793 


gi 104343 
56 


Homo sapiens 


cDNA FLF12700 fis, clone 
NT2RP1000721. 


799 


99 


1793 


ABB1242 
2 


Homo sapiens 


HYSE- Human bone marrow 
expressed protein SEQ ID NO: 
261. 


528 


99 


1794 


AAB651 
89 


Homo sapiens 


GETH Human PRO1013 
(UNQ496) protein sequence 
SEQIDNO:158. 


1858 


99 


1794 


AAB875 
36 


Homo sapiens 


GETH Human PRO1013. 


1858 


99 


1794 


AAY666 
66 


Homo sapiens 


GETH Membrane-bound protein 
PRO1013. 


1858 


99 


1795 


AAB651 
89 


Homo sapiens 


GETH Human PRO1013 
(UNQ496) protein sequence 
SEQIDNO:158. 


1655 


93 


1795 


AAB875 
36 


Homo sapiens 


GETH Human PRO1013. 


1655 


93 


1795 


AAY666 
66 


Homo sapiens 


GETH Membrane-bound protein 
PRO1013. 


1655 


93 


1796 


gi777026 
3 


Homo sapiens 


PRO3077 


620 


100 


1796 


gi!58916 

32 


Agrobacterium 
tumefaciens 


AGRJL_3035p 


67 


31 


1797 


AAY362 
33 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 10. 


302 


96 


1797 


gi298307 


Rattus sp. 


beta 3-adrenergic receptor; beta 
3-AR 


83 


38 


1797 


gi241216 


Rattus sp. 


beta 3-adrenergic receptor 


83 


38 


1798 


ABB1203 
7 


Homo sapiens 


HYSE- Human ribosoma! 
protein L31 homologue, SEQ ID 
NO:2407. 


341 


100 


1798 


AAG038 
94 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 7975. 


341 


1 AA 

100 


1798 


AAB437 
07 


Homo sapiens 


HUMA- Human cancer 
associated protein sequence SEQ 
ID NO: 1152. 


341 


1 00 


1799 


AAE1099 
5 


Homo sapiens 


INCY- Human lipid metabolism 
enzyrae-4 (LME-4) protein. 


2242 


99 


1799 


AAB419 
89 


Homo sapiens 


CURA- Human ORFX 
ORF1753 polypeptide sequence 
SEQ ID NO:3506. 


2224 


no 

98 




AAr>>4U 

07 


nomo sapiens 


jtixvui- Jtiuman proioin bcmjcuvo 
SEQIDNO:14123. 


1212 


99 


1800 


AAY761 
94 ■ 


Homo sapiens 


HUMA- Human secreted protein 
encoded by §ene 71 . 


296 


98 


1800 


AAY131 
96 


Homo sapiens 


GEST Human secreted protein 
encoded by 5' EST SEQ ID NO: 
210. 


291 


96 


1800 


AAY194 
71 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted protein. 


148 


96 


1801 


ABB 1242 


Homo sapiens 


HYSE- Human bone marrow 


2841 


99 
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SEQ 
ID 


Hit rn 

nil 11/ 


Speicies 


• Description 


S score 


Percent 
identity 




2 




expressed protein SEQ ID NO: 
261. 






1801 


6 


iiorno s&piens 


PAC clone RP4-751H13 from 
_ 7q35-qter, complete sequence. 


2830 


100 


1801 


ei767049 
6 


1V1U5 Hi USC Ul US 


unnamed protein product 


2352 


84 • 


1802 


82 


I lOiTiO SapicuS 


GEm Human FR0792 
polypeptide sequence. 


1332 


| 86 


1802 


16 


Homo Sapiens 


GETH HumaaPR0792 protein 
sequence SEQ ID NO: 1 55. 


1332 


86 


1802 


AAB240 


Homo sapiens 


GETH Human PR0792 protein 
sequence SEQ IDNO:3l. 


1332 


86 


1803 


AAU123 
82 


Homo sapiens 


GETH Human PR0792 
polypeptide sequence. 


1590 


100 


lioT 


AAB244 
16 


Homo sapiens 


GETH Human PR0792 protein 
sequence SEQ ID NO: 1 55. 


1590 


100 


JOUJ 


55 


Homo sapiens 


GETH Human PR0792 protein 
sequence SEQ IDNO:3l. 


1590 


100 


1804 


AAB437 


Homo sapiens 


HUM A- Human cancer 
associated protein sequence SEQ 
IDNO:lI58. 


414 


98 


1804 
1804 


AAR114 

OA 

gi339848 


Homo sapiens 
Homo sapiens 


CALI- Tissue-plastin. 


414 


98 


1805 


AAY194 
56 


Homo sapiens 


Human T-plastin mRNA, 5' end. 
HUMA- Amino acid sequence 
of a human secreted protein. 


414 

307 


98 | 
90 


1805" 


gil38825 

AA 


Mycobacterium 

tuberculosis 

CDC1551 


conserved hypothetical 
transmembrane protein 


70 


32 


ioUD 


gl 13307] 

4 


Mycobacterium 

tuberculosis 

H37KV 


hypothetical protein Rv2673 


70 


32 


1806 


gil65528 
50 


Homo sapiens 


cDNAFU32676fis, clone 
TESTH000168, weakly similar 
to PROTEIN PHOSPHATASES 
PPl REGULATORY SUBUNIT 
SDS22. 


1265 


100 j 


1806 


gil28599 
36 


Mus museums 


putative 


962 


78 j 


1806 


AAM388 
77 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2022. 


200 j 


45 


1807 


AAY413 
90 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 83 clone 
HRAAB15. 


813 T 


100 j 


1807 


AAM399 
90 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 3135. 


581 


44 


1807 


AAM389 
99 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2144. 


581 


44 j 


1808 


gil28361 
97 


Mus museums 


putative 


2154 


75 


1808 


AAM823 
98 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQIDNO:9991. 


750 j 


90 | 


1808 


AAG040 
59 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 8150. 


491 


100 | 


1809 , 


*AB952 , 
52 


Homo sapiens J 


HELI- Human protein sequence 
5EQIDNO:17419. 


3112 


?9 
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ID 


Hit ID 


Speicies 


Description 


S score 


Percent 

iff pitfitv 

111 V 11 111 J 


1809 


gi!04347 
29 


Homo sapiens 


cDNA FU12949 fis, clone 

N i JsUrJ, vudo 3 o, weaKiy similar 

to TRICHOHYALIN. 


3112 


99 


1809 


gil 28040 
75 


Homo sapiens 


hypothetical protein FLJ12949, 
clone MCjC:1 12ol 
IMAGE:3942403, mRNA, 
complete cds. 


1786 


100 


1810 


AAY195 
87 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted protein. 


160 


96 


1811 


gil 56208 
81 


Homo sapiens 


mRNA for KIAA191 1 protein, 
partial cds. 


1153 


100 


1811 


AAM916 
41 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 19234. 


326 


50 


1811 


gil28458 
02 


Mus muscuius 


putative 


309 


58 


1812 


gil 04399 
11 


Homo sapiens 


cDNA: FLJ23282 fis, clone 
HEP07626. 


2677 


100 


1812 


AAY996 
53 


Homo sapiens 


INCY- Human GTPase 
associated protein-4. 




i nn ! 


1812 


gil01671 
2 


Rattus 
norvegicus 


Fos-related antigen 


zUoU 




1813 


AAY601 
52 


Homo sapiens 


META- Human endometrium 
tumour EST encoded protein 
212. 


379 


100 


1813 


AAY601 
51 


Homo sapiens 


META- Human endometrium 
tumour EST encoded protein 
211. 


83 


73 


1813 


AAB929 
84 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 11 704. 


71 


32 


1814 


gil46029 
95 


Homo sapiens 


hypothetical protein FLJ23375, 
clone MGC:looo4 
IMAGE:4121449, mRNA, 
complete cds. 


2299 


100 


1814 


gil 04400 
40 


Homo sapiens 


cDNA: FLJ23375 fis, clone 
HEP16206. 


2294 


99 


1814 


AAB429 
73 


Homo sapiens 


CURA- Human (JKr A 
ORF2737 polypeptide sequence 


/ZD 


OR 


1815 


AAM787 
22 


Homo sapiens 


HYSE- Human protein SEQ ID 


1808 


99 


1815 


gil 50539 
87 


Homo sapiens 


c-Mpl binding protein mRNA, 
complete cds. 


1439 


100 


1815 


AAE1019 
9 


Homo sapiens 


T-TVOT7 Unmon knna mnrrmU 

ri i Human uone rnajTovv 
derived contig polypeptide, SEQ 

\t\ TsjfV £JL 
\\J INV/. O**. 


1 0*51 

l <CJ l 


85 


1816 


gil04404 
74 


Homo sapiens 


mRNA for FU00074 protein, 
partial cds. 


1001 


100 


1816 


gi702096 
9 


Homo sapiens 


cDNA FLJ20703 fis, clone 
KAIA1965. 


546 


63 


1816 


gil33251 
42 


Homo sapiens 


DKFZP586I2223 protein, clone 
MGC:10840 IMAGE:36 16057, 
mRNA, complete cds. 


535 


58 


1817 


gil 00472 
49 


Homo sapiens 


mRNA for KIAA1587 protein, 
partial cds. 


5002 


100 


1817 


gil36764 
86 


Macaca 
fascicularis 


hypothetical protein 


4641 


92 
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m 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 


1817 

101/ 


gii/oDyl 
40 


Mus musculus 


mage-el 


2474 


67 


1818 


AAY725 
96 


Homo sapiens 


ZYMO Human cytokine alpha 
protein-27 fZalpha27V 


3538 


99 


1818 


gil27102 
97 


Homo sapiens 


unnamed protein product 


3538 


99 


1 Ol o 

AUAU 


88 


Homo sapiens 


cDNA; FU2I478 fis, clone 
COL05012. 


2793 


100 


1819 


gi!28387 
32 


Mus musculus 


putative 


1060 


91 


1819 


gi65722l 
5 


Homo sapiens 


Human DNA sequence from 
clone RP1-37E16 on 
chromosome 22 Contains the 3' 
part of the gene for a novel VHS 
domain containing protein 
similar to predicted worm and 
human proteins, the SH3BP 1 
gene for SH3-domain binding 
protein 1, the gene for a novel 
protein similar to 
nitrophenylphosphatases from 
various organisms, the LGALS1 
gene for soluble galactoside- 
binding lectin 1, a novel gene 
and the gene for a novel protein 
similar to mouse RIP3 (PI 16 
Rho-interacting protein) and rat 
RBI 09, complete sequence. 


647 


46 


loin 

ioly 


gi 126531 
07 


Homo sapiens 


hypothetical protein dJ37E16.5, 
clone MGC8472 
IMAGE:2821743, mRNA, 
complete cds. 


647 


46 




AAB736 
90 


Homo sapiens 


INCY- Human oxidoreductase 
protein ORP-23. 


2502 


100 


loZU 


gi 1043 82 
22 


Homo sapiens 


cDNA: FLJ21988 fis, clone 
HEP06320. 


2502 


100 


1 son 


m* 1 Alien 

gi 143367 
19 


Homo sapiens 


16pl3.3 sequence section 3 of 8. 


2431 


90 


1 ooi 


glJ 


Homo sapiens 


hHa4 gene for keratin type 1. 


2029 


99 


1821 


gi372410 

i 

i 


Homo sapiens 


hHa3-I gene for keratin type I. 


1798 


90 


1821 


gi!28526 

I/O 


Mus musculus 


putative 


1796 


87 


1822 


gi322823 

7 


Homo sapiens 


UHS KerB gene. 


1079 


88 


1822 


AAM794 
04 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3050 


877 


81 


1822 


gi200962 


Mus musculus 


senne 1 ultrahigh sulfur protein 


867 


70 


1823 


AAB863 
64 


Homo sapiens 


MEMO- Human ceramidase K2 
protein. 


1459 


100 


1823 


AAB189 
86 


Homo sapiens 


INCY- Amino acid sequence of 
a human transmembrane protein. 


1459 


100 


1823 


gi 146694 
36 


Homo sapiens 


alkaline phytoceramidase 
(APHC) mRNA. complete cds. 


1459 


100 


1824 


AAW616 
01 


Homo sapiens 


INCY- Human metallothionein 
HMBP-I. 


381 


98 
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ID 


Hit ID 


Speicies 
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S score 


Percent 
identity 


1824 


AAB571 
83 


Homo sapiens 


ROSE/ Human prostate cancer 
antigen protein sequence SEQ 
IDNO:1761. 


363 


84 


1824 


AA0138 
69 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 27761. 


362 


81 


1825 


gil 59288 
96 


Homo sapiens 


Similar to synaptotagmin-like 4, 
clone MGC:17313 
MAGE:3908307, mRNA, 
complete cds. 


3496 


100 


1825 


gi!73859 
44 


Rattus 
norvegicus 


granuphilin A 


3225 


91 


1825 


gi592673 
6 


Mus musculus 


granuphilin-a 


3187 


90 


1826 


gil26531 
47 


Homo sapiens 


signal sequence receptor, beta 
(translocon-associated protein 
beta), clone MGC:8566 
IMAGE:2822983,mRNA, 
complete cds. 


864 


99 


1826 


gi452757 


Homo sapiens 


H.sapiens mRNA for TRAP beta 
subunit. 


847 


99 


1826 


gil73688 
0 


Homo sapiens 


Human SSR2 mRNA for beta- 
signal sequence receptor, 
complete cds. 


847 


99 


1827 


AAB427 
22 


Homo sapiens 


CURA- Human ORFX 
ORF2486 polypeptide sequence 
SEQlDNO:4972. 


2865 


100 


1827 


AAB952 
63 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 17448. 


2864 


99 | 


1827 


gil04347 
59 


Homo sapiens 


cDNA FLJ12969 fis, clone 
NT2RP2005841, weakly similar 
to Homo sapiens mRNA for 
ALEX3. 


2864 


99 


1828 


gi798129 
7 


Homo sapiens 


Human DNA sequence from 
clone RP4-534K7 on 
chromosome lp3 1.2-32.3. 
Contains the PGM1 gene for 
phosphoglucomutase 1, a novel 
gene, ESTs, STSs, GSSs and a 
putative CpG island, complete 
sequence. 


3323 


100 


1828 


gil60418 
46 


Homo sapiens 


clone MGC:9635 
IMAGE:3915942,mRNA, 
complete cds. 


3308 


99 


1828 


gil40178 
15 


Homo sapiens 


mRNA for KIAA1799 protein, 
partial cds. 


3186 


100 


1829 


gi943803 
3 


Homo sapiens 


ser/arg-rich pre-mRNA splicing 
factor SR-A1 (SR-A1) gene, 
complete cds. 


6833 


100 


1829 


gil04404 
02 


Homo sapiens 


mRNA for FLJ00034 protein, 
partial cds. 


6827 


99 


1829 


gi!43853 
2 


Rattus 
norvegicus 


rAl 


5019 


82 


1830 


AAG812 
94 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO: 106. 


1128 


100 


1830 


AAU035 
89 


Homo sapiens 


INCY- Human DNA 
modification protein, DNAMP- 
4. 


1128 


100 
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ID 


Hit in 

Oil LnJ 


opeicies 


Description 


S seore 


Percent 
I identity 


1830 


eil 73902 
02 


oumo sapiens 


similar to RIKJBN cDNA 
2510005D08 gene, clone 
MvjL:z/120 IMAGE:4793121, 
mRNA, complete cds. 


1128 


100 


1831 


AAY652 
82 


numo Sapiens 


ufcb l Human 5 EST related 
polypeptide SEQ ID NO: 1443. 


472 


95 


183! 


27 


Muo luuSCuIuS 


putative 


444 


68 


1831 


gi65I386 
7 


Strongylocentrot 
us purpuratus 


tektin Al 


292 


40 


1832 


AAY1AR 
37 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted protein. 


434 


100 


1832 


gi727J60 
< 


Fowlpox virus 


ORFFPV108 Virion envelope 
protein 


72 


28 


1832 


gi333523 


Pigeonpox virus 


major envelope antigen 


72 


28 


IOjj 


A AD 

66 


Homo sapiens 


USSH Neuroendocrine tumor 
dlk. 


2003 


95 


1833 


gil53419 

Oil 


Homo sapiens 


clone MGC: 17291 
IMAGE:4347187,mRNA, 
complete cds. 


2003 


95 


10 J J 


gusssyj 
10 


Homo sapiens 


clone MGC-203 10 
IMAGE:4130556,mRNA, 
compJete cds. 


2000 


94 


1834 


AAM514 

00 


Homo sapiens 


TAKE Human G protein- 
coupled receptor protein TGR5. 


1712 


100 


1834 


AAY865 
40 


Homo sapiens 


HUMA- Human gene 77- 
encoded protein fragment, SEQ 
IDNO:457. 


679 


98 


lOJH 


/\f\ i ouZ 

91 


Homo sapiens 


HUMA- Human secreted protein 
HDPRK33, SEQIDNO.-206. 


514 


98 




gllODDZS 

98 


Homo sapiens 


cDNA FU32467 fis, clone 
SKNMC2000097, moderately 
similar to M.musculus mRNA 
for protein HtBC. 


2883 


99 


1835 


94 


Homo sapiens 


Hpall tiny fragments locus 9C, 
clone MGC-14943 
IMAGE:4054100, mRNA, 
complete cds. 


2517 


100 


1835 


gi 104375 
55 


Homo sapiens 


cDNA: FLJ21453fis, clone 

OAT f\ACOC 


2517 


100 


1836 


gi724306 
1 


Homo sapiens 


mRNA for KIAA1340 protein, 
partial cds. 


2328 


99 


1836 


3 


Homo sapiens 


mRNA for KIAA073 3 protein, 
complete cds. 


331 


28 


1836 


gil46029 
92 


Homo sapiens 


clone MGC: 16635 
IMAGE:4121528,mRNA, 
complete cds. 


309 


29 


1837 


AAB947 
27 


Homo sapiens 


HEL1- Human protein sequence 
SEQIDNO:15753. 




yo 


1837 


gi 104358 
06 


Homo sapiens 


cDNAFU13710fis, clone 
PLACE2000373, weakly similar 
to F-SPONDIN PRECURSOR 


2624 


96 


1837 


gil42507 
42 


Homo sapiens 


hypothetical protein FLJ1 37 10, 
clone MGC: 14171 
IMAGE:4120678, mRNA, 
complete cds. j 


2473 


98 


1838 


AAB199 


3omo sapiens 


fNCY- Human oxidoreductase 


1900 ! 


)9 
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Hit ID 


Speicies 


Description 


S score 


Percent 
identity 




31 




OXRD-6. 






1 Q-2Q 


gil / \J*tH J 

56 


fiumu bdpiciib 


unnamed nrotein nroduct 


1900 


99 ! 


1838 


AAW677 

j 1 


Homo sapiens 


REGC Human fsh05 gene 
nrntein nroduct 


1528 


100 


1839 


gi!04400 
14 


Homo sapiens 


cDNA: FLJ23356 fis, clone 
HEP14919. 


1859 


99 


1839 


gil 52082 
oo 


Macaca 
iascicuians 


hypothetical protein 


1798 


96 


1839 


gil28536 
oy 


Mus musculus 


putative 


1535 


80 


1840 


gil42505 
12 


Homo sapiens 


clone MGC: 15468 
IMAGE:2966921,mRNA, 
complete cds. 


966 


88 


1840 


gil40432 
62 


Homo sapiens 


Similar to RIKEN cDNA 
1500026B10 gene, clone 
MGC:15737 IMAGE:3355622, 
mRNA, complete cds. 


966 


88 


1840 


gil28377 
54 


Mus musculus 


putative 


710 


68 


1841 


gi767162 
9 


Homo sapiens 


Human DNA sequence from 
clone RP1 M45L22 on 
chromosome op2 1 .5J.~ii.jl. 
Contains the gene for 
myelin/oligodendrocyte 
glycoprotein (MOG), the gene 
tor a novel KKAtf oox 
containing C2H2 type zinc 
linger proiein, xso is, o i o&, 
GSSs and a CpG island, 
complete sequence. 


2247 


90 


1 OA 1 

1 841 


„:ai 1 1 OO 

gl4 J 1 1 oz 


Mus musculus 


7frt ^7 

^jip-j/ 


674 


39 


1841 


gil44956 


Homo sapiens 


zinc finger protein 33 1 ; zinc 

fir>er/*r nmtf*\n AftX r lf»ni» 
LUlgGI piuiciii tUJ, Wl/ilv 

MGC:15739 IMAGE:3355780, 
mRNA, complete cds. 


444 


29 


1842 


gil04399 


Homo sapiens 


cDNA: FLJ23311 fis, clone 


2175 


99 


1842 


ABB1246 
i 


Homo sapiens 


HYSE- Human bone marrow 

fMrnrpcspd nmtPin SRO ID NO* 

300. 


259 


100 




glJJ*f J IJL 

2 


AVI aUlUUJJold 

thaliana 


JJULtlvlVC IJIUL^IIJ 


218 


44 




4 




va<Min1»mrntnn numo 116 kDa 
accessory subunit (ATP6N1B) 
mRNA, complete cds, 
alternatively soliced. 


4378 


100 


1843 


AAB601 
00 


Homo sapiens 


INCY- Human transport protein 
TPPT-20. 


4331 


99 


1843 


gil 69032 
13 


Mus musculus 


H-ATPase accessory subunit a4 


3809 


85 


1844 


AAM938 
59 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3953. 


3149 


100 


1844 


gil 72249 
57 


Homo sapiens 


cationic amino acid transporter 
(SLC7A3) mRNA, complete 
cds. 


3149 


100 


1844 


AAM933 


Homo sapiens 


1 HELI- Human polypeptide, SEQ 


3142 


99 
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85 




ID NO: 2970. 






1845 


gil20530 
07 


Homo sapiens 


mRNA; cDNA 

DKFZp434D1812(firom clone 
DKFZp434D1812); complete 
cds. 


6038 


99 


1845 


gi424019 

5 


Homo sapiens 


mRNA forKIAA0853 protein, 

poTimi COS. 


4974 


100 


1845 


AAB670 
47 


Homo sapiens 


INCY- Human immune response 
molecule (IMUN) protein SEQ 
IDNO:l. 


3777 


99 


1846 


AAB941 
08 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO:14340. 


2903 


100 


1846 


AAU045 
57 


Homo sapiens 


GETH Human Stra6 homologue, 
PRO 10282. 


2903 


100 


1846 


gi!35609 
66 


Homo sapiens 


STRA6 isoform 1 mRNA, 
complete cds, alternatively 
spliced. 


2903 


100 


1847 


AAB583 
63 


Homo sapiens 


ROSE/ Lung cancer associated 
polypeptide sequence SEQ ID 
701. 


309 


100 


1847 


AAY485 
07 


Homo sapiens 


MET A- Human breast tumour- 
associated protein 52. 


308 


98 


1847 


AAM239 
52 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO: 1477. 


294 


98 


1848 


AAM937 
37 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3705. 


929 


49 


1848 


gi3 17060 
9 


Rattus 
norvegicus 


monocarboxylate transporter 
MCT3 


631 


35 


1848 


gi767044 
6 


Mus musculus 


unnamed protein product 


631 


47 


1849 


AAB953 
59 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:17661. 


3130 


99 


1849 


giI04350 
88 


Homo sapiens 


CDNAFLJ13 181 fis, clone 
NT2RP3004016, weakly similar 
to TRANSCRIPTION 
INTERMEDIARY FACTOR 1- 
BETA. 


3130 


99 


1849 


gi729735 
7 


Drosophila 
melanogaster 


CG8419 gene product 


746 


29 


1850 


gil04368 
13 


Homo sapiens 


cDNA:FLJ20859fis ) clone 
ADKA01617. 


2426 


100 


1850 


gil 34771 
83 


Homo sapiens 


Similar to hypothetical protein 
FLJ20859, clone MGC:12940 
IMAGE:2822127,mRNA, 
complete cds. 


2357 


98 


1850 


gil28308 
10 j 


Homo sapiens 


false p73 target protein gene, 
complete cds. 


2239 


99 


1851 


AAM243 
67 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO: 1892. 


546 


100 


1851 


AAY275 
76 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene No. 10. 


394 


96 


1851 


gi30478 


Homo sapiens 


Human mRNA for dopamine 
beta-hydroxylase type b (EC 
1.14.17.1). 


68 


26 


1852 


gil65515 
61 


Homo sapiens 


cDNAFLJ31657fis, clone 
NT2RI2004304, moderately 
similar to Homo sapiens NY- 


2859 


92 i 
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SEQ 
ID 


Hit ID 


Speicies 


Ucscripuon 


S score 


Percent 
identity 








rvDlN-z anu gen miviN/\. 






1852 


AAB366 
26 


Homo sapiens 


INCY- Human FLEXHT-48 
protein sequence SEQ ID 


2056 


67 


1852 


gil 28034 
69 


Homo sapiens 


high-glucose-regulated protein 
5, clone lvivjv/. / jy 
IMAGE:3 139250, mRNA, 

complete UUb. 


2056 


67 


1853 


gi776873 
6 


Homo sapiens 


genomic DNA, chromosome 
ziq } section o// ivd. 


4306 


99 


1853 


gil 42457 
29 


Homo sapiens 


ANKRD3 mRNA for dual- 
specificity Ser/Thr/Tyr kinase, 
complete cua. 


4003 


94 


1853 


gi988671 
1 


Homo sapiens 


mRNA for protein kinase (dik 
gene). 


3999 


94 


1854 


gil 04391 
29 


Homo sapiens 


cDNA: FLJ22655 fis, clone 


960 


100 


1854 


AAG014 
59 


Homo sapiens 


GEST Human secreted protein, 


668 


100 


1854 


AAU173 
65 


Homo sapiens 


HUMA- Novel signal 
transuuctjon patnway proiein, 
SeqID930. 


523 


100 


1855 


gil00471 
85 


Homo sapiens 


rnRNA for KIAA1560 protein, 
partial cds. 


3397 


100 


1855 


gil75124 
95 


Mus musculus 


glycerol-3-phosphate 
acyltransferase, mitochondrial 


3371 


93 


1855 


gil 93367 


Mus musculus 


glycerol-3-phosphate 
acyltransferase 


3363 


93 


1856 


ABB1223 
6 


Homo sapiens 


HYSE- Human eppin-1 
homologue, SEO IDNO:2606. 


472 


100 


1856 


gil39373 
34 


Homo sapiens 


Human DNA sequence from 
cloneRP3-461P17on 
chromosome 20ql2-13.2. 
Contains two novel genes, gene 
HE4 for Major Epididymis- 
specific protein E4 precursor 
(Epididymis Secretory protein 
E4), RPL5 (60S Ribosomal 
Protein L5), COX6C 
(Cytochrome C Oxidase subunit 
VIC) and HSPD1 (HSP60, 
Mitochondrial Matrix Protein PI 
precursor, Heat Shock Protein 
60, GROEL, HUCHA60) 
pseudogenes, the SPINT3 gene 
for Kunit2 type serine protease 
inhibitor 3 (HK1B9J, two genes 
for novel Kunitz/Bovine 
nancreatlc trvnsin inhibitor and 
WAP-type (Whey Acidic 
Protein) 'four-disuifide core' 
domains containing proteins and 
the gene for Eppin-1, -2 and -3. 
Contains ESTs, STSs, GSSs and 
a CpG island, complete 
sequence. 


415 


98 


1856 


gil35917 
53 


Oryctolagus 
cuniculus 


eppin 


257 


66 
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• Description 


S score 


Percent 
! identity 


1857 


gil28558 
41 


Mus musculus 


putative 


758 


57 


1837 


gil77366 
46 


Homo sapiens 


Human DNA sequence from 
clone RP3-341D10 on 
chromosome X Contains a gene 
for a novel protein, part of the 
gene for a protein similar to 
ADP ribosylation factor 3, part 
of a gene similar to HTF9C and 
a CpG island, complete 
sequence. 


424 


100 


1857 


AAM389 
58 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2 103. 


421 


43 


1858 


gi 104395 
20 


Homo sapiens 


cDNA: FU22973 tls, clone 
KAT11042. 


2289 


100 


1858 


gil 16123 
88 


Homo sapiens 


zinc finger transcription factor 
Pegasus mRNA, complete cds. 


2279 


99 


1858 


AAU161 
42 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 1095. 


1745 


98 


1859 


gi 135440 
26 


Homo sapiens 


putative zinc finger protein from 
EUROIMAGE 566*589, clone 
MGC:13109 IMAGE :3959436, 
mRNA, complete cds. 


1481 


99 


1859 


AAB939 
32 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 13929. 


1474 


99 


1859 


gil 04336 
47 


Homo sapiens 


cDNAFLJ12222fis, clone 
MAMMA1001 105, moderately 
similar to O VO PROTEIN. 


1474 


99 


i860" 


AAG786 
15 


Homo sapiens 


SHAN- Human zinc finger 
transcription factor BioZFTF45. 


1760 


89 


1860 


gil 36234 
31 


Homo sapiens 


clone MGC:13132 
IMAGE:4124255, mRNA, 
complete cds. 


1753 


100 


1860 


gil31118 
97 


Homo sapiens 


Similar to KIAA0414 protein, 
clone MGC:2629 
IMAGE:3503643,mRNA, 
complete cds. 


1753 


100 


1861 


AAB622 
01 


Homo sapiens 


RIGE- Cell cycle protein Radh- 
isoform 1. 


3697 


99 


1861 


gi8 16380 
4 


Mus musculus 


putative repair and 
recombination helicase RAD26L 


3215 


89 


1861 


AAB622 
02 


Homo sapiens 


RIGE- Cell cycle protein Radh- 
isoform 2. 


2142 


99 


1862 


gi259856 
5 


Mus musculus 


rabl9 


1008 


88 


1862 


AAM789 

77 I 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1639. 


603 


56 


1862 


gi f £yj I z 

7 


L/rOSOpmia 

melanogaster 


Rab-RP3 gene product 


523 


55 


1863 


gi 173840 
67 


Homo sapiens 


Human DNA sequence from 
clone RPU-146P21 on 
chromosome 10 Contains the 
3'end of a novel gene, a novel 
gene, the 5'end of the gene for 
KIAA0608 and a CpG island, 
complete sequence. 


2467 


99 


1863 


gi 140399 


Cricetulus 


hypothetical protein 1-2 


2311 


92 i 
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ID 
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S score 


Percent 
identity 




64 


griseus 








1863 


AAB951 
96 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:17284. 


2183 


99 


1864 


gil 26539 
87 


Homo sapiens 


cutaneous T-cell lymphoma 
tumor antigen se70-2, clone 
MGC:529 1 IMAuE:345 1 565, 
mRNA, complete cds. 


1237 


99 


1864 


gil 13856 
62 


Homo sapiens 


CTCL tumor antigen se/O-2 
mRNA, complete cds. 


1237 


00 

yy 


1864 


gil04369 
25 


Homo sapiens 


cDNA: FLJ20957 fis, clone 
ADSE02053. 


1233 


99 


1865 


gil04399 
11 


Homo sapiens 


cDNA: FU23282 fis, clone 
HEP07626. 


2579 


100 


1865 


AAY996 
53 


Homo sapiens 


INCY- Human GTPase 
associated protein-4. 


2544 


100 


1865 


gil01671 
2 


Rattus 
norvegicus 


Fos-related antigen 


2256 


84 


1866 


AAB530 
73 


Homo sapiens 


GETH Human angiogenesis- 
associated protein PR0195, SEQ 
IDNO:46. 


331 


100 


1866 


AAB884 
28 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone 
PSEC0203. 


331 


100 


1866 


AAU123 
07 


Homo sapiens 


GETH Human PRO 195 
polypeptide sequence. 


331 


100 


1867 


AAB941 
97 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:14532. 


4854 


99 


1867 


gi!04342 
43 


Homo sapiens 


cDNA FLJ12634 fis, clone 
NT2RM4001 858, weakly similar 
to T-BOX CONTAINING 
PROTEIN TBX6L. 


4854 


99 


1867 


gi669260 
7 


Mus musculus 


MGA protein 


4810 


84 


1868 


AAB957 
79 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 18726. 


1326 


100 


1868 


AAB428 

no 

78 


Homo sapiens 


CURA- Human ORFX 
ORF2642 polypeptide sequence 
SEQIDNO:5284. 


1326 


100 


1 OHO 

I860 


gil 04366 
25 


Homo sapiens 


CDNA vLij l4zzu lis, clone 
NT2RP3003828. 




1 AA 


1869 


AAY108 
23 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted protein. 


183 


100 


1869 


gil 17158 
4 


Plasmodium 
falciparum 


red algal chloroplast 


72 


29 


1869 


gi630468 


Plasmodium 
falciparum 


hypothetical protein 470 - 
Plasmodium falciparum > 


72 


29 


1870 


gi 143367 
1*2 

ID 


Homo sapiens 


I6pI3.3 sequence section 3 of 8. 


1867 


100 


1870 


gi59l245 
9 


Homo sapiens 


Human DNA sequence from 
clone LA16-380A1 on 
chromosome 16 Contains two 
novel genes, ESTs, an STS, 
GSSs and five putative CpG 
islands, complete sequence. 


1106 


100 


1870 


gil 39052 
32 


Mus musculus 


Unknown (protein for 
IMAGE:3601186) 


145 


28 


1871 


AAB433 


Homo sapiens 


CURA- Human ORFX 


2463 


97 
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ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 




82 




utssomo polypeptide sequence 
SEQIDNO:6292. 






1871 


AAW734 
00 




nuMA* numan secreted protein 
encoded by Gene No. 4. 


2454 


100 


18/1 


AAB583 
40 


Homo sapiens 


ROSE/ Lung cancer associated 
polypeptide sequence SEQ ID 


2252 


90 


1872 


AAB569 
24 


Homo sapiens 


ROSE/ Human prostate cancer 
antigen protein sequence SEQ 
IDNO:1502. 


2048 


95 


1872 


93 


Homo sapiens 


MJETA- Human endometrium 
tumour EST encoded protein 53. 


2048 


95 


1872 


git fUl 

3 


nomo sapiens 


NADP+-dependent isocitrate 
dehydrogenase (P1CD) mRNA, 
complete cds. 


2048 


95 


1873 


73 


iiomo sapiens 


INCY- Amino acid sequence of 
human intracellular signalling 
molecule INTRA5. 


' 1875 


100 


1873 


AAB949 
37 


Homo sapiens 


HELI- Human protein sequence 
bc,Kl lu NU: 16430. 


1546 


99 


1873 


gii52919 
19 


Drosophila 
melanogaster 


LD31969p 


1237 


41 


T 074 

1 0 /*r 


73 


Homo sapiens 


INCY- Ammo acid sequence of 
human intracellular signalling 
moJecuie IN 1 RA5. 


1836 


92 


1874 


AAB949 
37 


Homo sapiens 


HELI- Human protein sequence 
ociKl ID iNU: 16430. 


1546 


99 


1874 


gil52919 
19 


Drosophila n 
melanogaster 


LD31969p 


1209 


39 


1875 


11 


Homo sapiens 


clone MGC; 12921 
IMAGE:4129897,mRNA, 
complete cds. 


590 


100 


1875 


gil26982 
16 


Macaca 

Ta^pi Pill uric 

J.aol'iL mans 


hypothetical protein 


589 


99 


1875 


AAM936 
16 


Homo sapiens 


HELI- Human polypeptide, SEQ 

TT\ xr/*V. 1AAA 

ID 1NU: 3444. 


584 


99 


1876 


gi606368 
8 


Homo sapiens 


ammecrl gene, exon 1 and 
joineo i^Da (alternative 
transcripts). 


968 


56 


1876 


gi513948 
2 


Homo sanipnc 


miuNA ior AMMbCKI protein. 


968 


56 


1876 


gi889465 
7 


i T iUtf IIIUOvUJUj 


A VfLfppD 1 
/VLVLlVLC^Xv 1 


964 


53 


1877 


gil04402 
18 


Homo sanienQ 


cl/jna. rLjZjjVt) lis, clone 
LNG03055. 


2913 


99 


1877 


AAY733 
63 


Homo sanien^ 


iin V/ 1 " ii i km clone Z7oZ 1 74 
protein sequence. 


2110 


100 


1877 


AAU162 
62 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 1215. 


1286 


70 


1878 


AAG671 
51 


Homo sapiens 


INCY- Amino acid sequence of 
a human enzyme. 


1689 


99 


1878 


gi 128562 
10 


Mus musculus 


putative 


1457 


85 


1878 


gi3 12821 
8 


Arabidopsis 
thaliana 


putative katanin 


874 


56 


1879 


AAB600 
93 


Homo sapiens 


INCY- Human transport protein 
TPPT-13. 


3295 


95 
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ID 
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S score 
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identity 


1879 


gil 72237 
24 


Homo sapiens 


sodium/glucose cotransporter 
KST1 mRNA, complete cds. 


3295 


95 


1879 


AAF8402 
3 aal 


Homo sapiens 


LEXI- Novel human transporter 
protein (NHP) encoding DNA. 


3289 


95 


1880 


AAB600 
93 


Homo sapiens 


INC Y- Human transport protein 
TPPT-13. 


3461 


99 


1880 


gil 72237 
24 


Homo sapiens 


sodium/glucose cotransporter 
KST1 mRNA, complete cds. 


3461 


99 


1880 


AAF8402 
3 aal 


Homo sapiens 


LEXI- Novel human transporter 
protein (NHP) encoding DNA. 


3455 


99 


1881 


gi 128454 
75 


Mus musculus 


putative 


1648 


70 


1881 


gi 135433 

07 . 


Homo sapiens 


cargo selection protein 
(mannose 6 phosphate receptor 
binding protein), clone 
MGC:1 1 1 17 IMAGE:383341 1, 
mRNA, complete cds. 


725 


39 


1881 


AAY672 
40 


Homo sapiens 


INCY- Human adipophilin-like 
protein (HALP) amino acid 
sequence. 


724 


39 


1882 


AAW469 
04 


Homo sapiens 


AS AH A human mutant alanine 
aminotransferase. 


1821 


68 


1882 


gil76309 
6 


Homo sapiens 


Human glutamate pyruvate 
transaminase (GPT) gene, 
complete cds. 


1821 


68 


1882 


gil73904 
65 


Homo sapiens 


glutamic-pyruvate transaminase 
(alanine aminotransferase), clone 
MGC:17068 IMAGE :4 179699, 
mRNA, complete cds. 


1821 


68 


1883 


AAD094 
95_aal 


Homo sapiens 


SUGE- Human SGP003 
phosphatase polypeptide 
encoding DNA. 


1161 


100 


1883 


AAE0483 
7 


Homo sapiens 


SUGE- Human SGP003 
phosphatase polypeptide. 


1159 


99 


1883 


AAB186 
67 


Homo sapiens 


INCY- A human regulator of 
intracellular phosphorylation. 


1021 


91 


1884 


gi308942 
7 


Homo sapiens 


SSC6 rearranged T cell receptor 
beta chain (TCRBV17) gene, 
complete cds. 


1089 


69 


1884 


gi300292 
7 


Homo sapiens 


T cell receptor beta chain 
(TCRBV 1 7S 1 -TCRBJ1 S5) 
mRNA, complete cds. 


1089 


69 


1884 


gi298250 
8 


Homo sapiens 


mRNA for TCR beta chain, 
specific for Mage 3/HLA-A2. 


1089 


70 


1885 


gil28582 
21 


Mus musculus 


putative 


1850 


91 


1885 


gil23141 
02 


Homo sapiens 


Human DNA sequence from 
clone RP3-329L24 on 
chromosome 6q22. 1-22.33 
Contains a gene for a novel 
protein, part of a gene for a 
hypothetical 23.0 KD protein, 
part of a gene for a protein 
(MCM2/3/5 family), ESTs, 
STSs, GSSs and a CpG island, 
complete sequence. 


1405 


100 


1885 


gi438883 


Arabidopsis 


putative DNA replication 


726 


41 
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identity 




2 


thaliana 


licensing factor 






1886 


gil47146 
00 


Homo sapiens 


clone IMAGF'^SdlAl 
mRNA, partial cds. 


inoi 
jUUI 


99 


1886 


AAR943 
86 


Homo sapiens 


NEWR- Human neural pMI 

protein marker RR/B. 


till 


29 


1886 


" gi376936 

2 


Homo sapiens 


cwiuucj ui'iicurai curiex-i protein 

flD\V»1\ .-Bvt.* .,,1, 

I ^i iv'i / uii^i^ tfOmpictg Qua. 


073 


29 


1887 


gil00472 
39 


Homo sapiens 


mRNA fnr VIA A 1 ^RO nrntpfn 
iiuxrin lur rvi/VAl jqc proiein, 

partial cds. 


7698 


96 


1887 


gil 6551 8 
20 


Homo sapiens 


wjuina FLijjiQjy us, cione 
NT2RP7001231. 


3076 


100 


1887 


gi795918 
1 




mRMA fnr If TA A 1/t/Cfl 

nixuNA iorxuAAHOu protein, 
partial cds. 


2840 


45 


1888 


gil 00472 
39 


Homo ^aniens 


jjiajna ror kiaaiooz protein, 
partial cds. 


7800 


99 


1888 


gil65518 
20 


Homo sapiens 


ri-j j i oDy us, clone 
NT2RP7001231. 


2835 


93 


1888 


gi795918 
1 




mP\IA fnr VIA A 1 A£(\ «- rt f«:- 

niiuNA lorjMAAiHOU protein, 
partial cds. 


2744 


44 


1889 


gil 00472 
39 




miUNA ior JS.1AA1 jo^ protein, 
partial cds. 


7372 


95 


1889 


gi795918 
1 


Homo sapiens 


mRNA for KIAA1460 protein, 


2618 


44 


1889 


gil65518 
20 


Homo sapiens 


CDNAFU31 859 fis, clone 

KJTOP T>700 1 0 3 1 


2354 


81 


1890 


gil 63072 
85 


Homo sapiens 


clone IMAGE:3877337, 
mRNA, partial cds. 


1627 


98 


1890 


gil 52080 
51 


Macaca 

fascicularta 


hypothetical protein 


1417 


55 


1890 


AAY949 
18 


Homo sapiens 


GEMY Human secreted protein 
clone dd504_l 8 protein 
sequence act^ ID NU:42. 


1247 


63 


1891 


gi278041 
4 


Homo sapiens 


hBACH mRNA for brain acyl- 
CoA hydrolase, complete cds. 


1742 


100 


1891 


gil69243 
33 


Homo sapiens 


cytosolic acyl coenzyme A 
thio ester hydrolase, clone 
MGC:3 126 MAGE:3507488, 
mRNA, complete cds. 


1742 


100 


1891 


AAW748 
96 


Homo sapiens 


HUMA- Human secreted protein 
cuuuucu uy gene ioy cione 
HPTTU11. 


1715 


98 


1892 


AAB688 
74 


Homo sapiens 


INCY- Human RFPAP 

polypeptide, SEQ ID NO: 4. 


1266 


100 


1892 


AAY948 
90 


Homo sapiens 


PRO 1 '» Human nrntain n1 nna 

* xvwi* numaii proiein cione 
HP02798. 


3266 


100 


1892 


gil44245 
18 


Homo saniens 


Similar tn IMlfPTvl /»TWTA 
oiuiiicu lU iviJNJClN CJL/JNA 

1190004A01 gene, clone 
MGC:13153 IMAGE:4302257, 
mRNA, complete cds. 


1266 


100 


1893 


gil45950 ' 
19 


Homo sapiens 


mRNA for keratin 6 irs 
(KRT6IRS gene). 


843 


77 


1893 


gi609207 
5 


Mus musculus 


type II cytokeratin 


836 | 


74 


1893 


gi 128345 
35 


Mus musculus 


putative 


836 


74 


1894 


gil40291 
53 


fiomo sapiens 


microtubule-associated protein 
GLFND mRNA. complete cds. 


2565 


100 



248 



WO 02/070539 



PCT/US02/05095 



SEQ 
ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 


1894 


gil3U19 
07 


Homo sapiens 


clone MGC:3213 i 
IMAGE:3502614, mRNA, 
complete cds. 


2565 


100 


1894 


gil 34477 
59 


Homo sapiens 


fibronectin type 3 and SPRY 
domain-containing protein 1 
(FSD1) mRNA, complete cds. 


2562 


99 


1895 


gil44245 
91 


Homo sapiens 


hypothetical protein FLJ22127, 
clone MGC: 14926 
IMAGE:4123948, mRNA, 
complete cds. 


2565 


100 


1895 


gil 04380 
86 


Homo sapiens 


cDNA: FU21886 fis, clone 
HEP03022. 


2555 


99 


1895 


gil 04384 
00 


Homo sapiens 


cDNA: FLJ22127 fis, clone 
HEP19530. 


1566 


100 


1896 


gil 5 1503 
06 


Homo sapiens 


glycogenin-interacting protein 3 
mRNA, complete cds. 


1571 


100 


1896 


gil51503 
04 


Homo sapiens 


glycogenin-interacting protein 2 
mRNA, complete cds, 
alternatively spliced. 


1571 


100 


1896 


gil51502 
98 


Homo sapiens 


glycogenin-interacting protein 1 
mRNA, complete cds. 


1571 


100 



TABLE 3 



SEQ ID 
NO: 


Database 
entry ID 


Description 


Results* 


950 


BL00972 


Ubiquitin carboxyl-terminal hydrolases 
family 2 proteins. 


BL00972A 1 1.93 2.500e-20 267- 
285 BL00972D 22.55 5.179e-17 
828-853 BL00972E 20.72 8.650e- 
13 855-877 BL00972C 16.48 
7.120e-l 1411-426 BL00972B 9.45 
7.923e-l 0 353-363 


950 


PR00833 


POLLEN ALLERGEN POA PI 
SIGNATURE 


PR00833H 2.30 8.000c-10 2-17 


950 


PR00308 


TYPE I ANTIFREEZE PROTEIN 
SIGNATURE 


PR00308A 5.90 7.671e-09 5-20 
PR00308A 5.90 9.471e-09 4-19 


951 


BL00018 


EF-hand calcium-binding domain proteins. 


BL00018 7.41 5.696e-09 331-344 


951 


PR00I8O 


CELLULAR RETIN ALDEHYDE- 
BINDING PROTEIN SIGNATURE 


PR00180C 10.92 8.821e-09 70-92 


952 


BL00018 


EF-hand calcium-binding domain proteins. 


BL00018 7.41 5.696e-09 331-344 


952 


PR00180 


CELLULAR RETIN ALDEHYDE- 
BINDING PROTEIN SIGNATURE 


PR00180C 10.92 8.821e-09 70-92 


953 


PR00401 


SH2 DOMAIN SIGNATURE 


PR00401B 12.94 1.000e-08 340- 
351 


954 


PR00401 


SH2 DOMAIN SIGNATURE 


PR00401B 12.94 1.000e-08 367- 
378 


955 


BL00625 


Regulator of chromosome condensation 
(RCC1) proteins. 


BL00625A 16.21 7.787e-16 308- 
337 BL00625A 16.21 7.369e-15 
190-219 BL00625B 17.69 1.514e- 
13 302-336 BL00625B 17.69 
2.286e-13 184-218 BL00625B 
17.69 3.957e-13 132-166 
BL00625A 16.21 5.690e-i3 138- 
167 BL00625A 16,21 5.731e-ll 
360-389 BLO0625B 17.69 3.333e- 
10 354-388 
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955 


PR00633 


CHROMOSOME CONDENSATION 
REGULATOR RCC1 SIGNATURE 


PR00633A 9.32 6.l43e-09 202-219 
PR00633H 15.10 6.268e-09 196- 
218 PR00633F 10.03 6.354e-09 
373-388 PR00633G 13.71 7.556e- 
09190-209 


957 


BL00120 


Lipases, serine proteins. 


BL00120B 11.379.4866-12 166- 
181 | 


957 
163 


PR00111 
BL00183 


ALPHA/BETA HYDROLASE FOLD 
SIGNATURE 


PR0011 IB 10.61 1.176e-09 170- 
184 


966 


PR00049 


Ubiquitin-conjugating enzymes proteins. 
WILM'S TUMOUR PROTEIN 
SIGNATURE 


BL00183 28.97 1.329e-10 45-93 
PR00049D 0.00 6.034e-09 262-277 


"967 


PRO0049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 6.034e-09 74-89 


*968 


BL00790 


Receptor tyrosine kinase class V proteins. 


BL00790D 12.41 8.297e-09 804- 
829 


"969 


BL00790 


Receptor tyrosine kinase class V proteins. 


BL00790D 12.41 8.297e-09 878- 
903 


971 


PR00380 


KINESIN HEAVY CHAIN SIGNATURE 


PR0O380D 9.93 2.080e-22 321-343 
PR00380A 14.18 1.486e-21 79-101 
PR00380B 12.64 6.57 le-1 8 217- 
235 PR00380C 13.18 6.927e-13 
269-288 


971 


BL00411 


Kinesin motor domain proteins. 


BL00411H 15.66 8.200e-30 320- 
351 BL00411G 21.39 6.100e-28 
270-312 BL00411C 15.04 7.000e- 
22 79-101 BL00411F 14.77 
1.273e-19 208-233 BL00411E 
10.43 7.429e-12 142-161 
BL0041 1 A 11.31 4.484e-l 1 9-24 
BL00411B 13.51 1.563e- 10 45-62 


971 


PD00301 


PROTEIN REPEAT MUSCLE 
CALCIUM-BL 


PD00301 A 10.24 6.400e-09 598- 
609 


971 

♦Results inc 


DM01399 

tide in nrrW* an 


VARICELLA-ZOSTER VIRUS GENE 54 
PROTEIN. 


DM01399B 12.42 7.092e-09 1571- 
1583 



SEQ ID 

NO: 


Database 
entry ID 


Description 


Results* 


971 


BL00678 


Trp-Asp (WD) repeat proteins proteins. 


BL00678 9.67 8.800e-l 01350- 
1361 BL00678 9.67 7.158e-09 
1629-1640 


971 


BL00502 


Polygalacturonase proteins. 


BL00502A 13.44 7.341e-09 1424- 
1439 


971 


BL00412 


Neuromodulin (GAP-43) proteins. 


BL00412D 16.54 7.796e-09 568- 
619 


971 


BL00226 


Intermediate filaments proteins. 


BL00226B 23.86 8.012e-09 930- 
978 


971 


PR00320 


G-PROTEIN BETA WD-40 REPEAT 
SIGNATURE 


PR00320B 12.19 2.385e-12 1348- 
1363 PR00320C 13.01 1.720e~10 
1348-1363 PR00320A 16.74 
4.971e-l 0 3348-1363 PR00320B 
12.19 5.886e-10 1544-1559 
PR00320A 16.74 3.415e-09 1544- 
1559 PR00320C 13.01 5.500e-09 
1498-1513 PR00320B 12.19 
8.650e-09 1627-1642 PR00320C 
13.01 9.100e-09 1627-1642 
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973 


BL01242 


Formamidopyrimidine-DNA glycosylase 
proteins. 


BL01242F 17.92 5.300e- 11 32-66 


975 


PF0065I 


BTB (also known as BR-C/Ttk) domain 
proteins. 


PF00651 15.00 2.500e- 14 46-59 


975 


PR00501 


KELCH REPEAT SIGNATURE 


PR00501A 8.25 7.750e-ll 371- 
385 


976 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 3.898e-09 99-132 


977 


PR0038O 


KINESIN HEAVY CHAIN SIGNATURE 


PR00380A 14.18 9.250e-25 93- 
115 PR00380D9.93 4.857e~19 
302-324 PR00380B 12.64 4.429e- 
18 212-230 PR00380C 13.18 
1.692e-l 6 247-266 


977 


BL004I1 


Kinestn motor domain proteins. 


BL0041 1G 21.39 7.750e-32 248- 
290 BL00411F 14.77 1.000e-25 
203-228 BL00411C 15.04 1.621e- 
24 93-115 BL0041 1H 15.66 
1.871e-24 301-332 BL00411E 
10.43 6.625e-20 143-162 
BL00411A 11.31 4.484e-ll 5-20 


977 


PF00846 


Hantavirus nucleocapsid protein. 


PF00846H 3.96 9.182e-10 408- 
445 


977 


BL00224 


Clathrin light chain proteins. 


BL00224B 16,94 7.1 36e-09 363- 
416 


978 


BL00107 


Protein kinases ATP-binding region proteins. 


BL00107A 18.39 3.368e-l 8 36-67 


978 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12.27 2.068e-10 36-55 


980 


BL00594 


Aromatic amino acids permeases proteins. 


BL00594A 16.75 9.376e-0976- 
120 


982 


BL00790 


Receptor tyrosine kinase class V proteins. 


BL00790E 29.58 l.llle-32 614- 
662 BL00790E 29.58 3.1 lle-12 
668-716 BL00790E 29.58 7.000e- 
10 560-608 


982 


BL00279 


Membrane attack complex components / 
perforin proteins. 


BL00279E 37. 11 7.632e-12 727- 
775 BL00279E37.1I 2.047e-09 
765-813 


982 


BL01187 


Calcium-binding EGF-like domain proteins 
pattern proteins. 


BL01187B 12.04 4.600e-ll 791- 
807 BL01187B 12.04 3.571e- 10 
829-845 BL01187B 12.04 8,714e- 
10 753-769 BL01187A9.98 
4.375e-09 812-824 BL01187A 
9.98 5. 125e-09 774-786 


982 


PR00343 


SELECTIN SUPERFAMILY 
COMPLEMENT-BINDING REPEAT 
SIGNATURE 


PR00343C 16.85 5.364e-09 13-32 


982 


PR00764 


COMPLEMENT C9 SIGNATURE 


PR00764F 16.89 8.027e-10 744- 
765 PR00764F 16.89 6.844e-09 
782-803 


982 


PR00OIO 


TYPE II EGF-LIKE SIGNATURE 


PR0001OA 11.79 6.192e-ll 814- 
826 PROO010C 1 l.lo 5.90Ve-lU 
758-769 PR0001 OA 11.79 7.677e- 
10 776-788 PR00010C 11.16 

I. 214e-09 796-807 PR00010C 

II. 16 8.286e-09 834-845 


986 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 4.150e-12 126- 
149 


987 


PR00019 


LEUCINE-RICH REPEAT SIGNATURE 


PR00019A 11.19 6.087e-10 93- 
107 PR00019B 11.36 7.840e-09 
90-104 
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990 


PD01443 


INHIBITOR CALPAIN CALPASTATIN 
REPEAT THIOL PROT. 


PD01443D 8.36 4.670e-09 815- 
837 


991 


BL01101 


Casein kinase II regulatory subunit proteins. 


BL01101A 16.07 1.000e-409-54 
BL01101B 10.94 9.000e-31 72-97 


"991 


PR00472 


CASEIN KINASE II REGULATORY 

ST IRT rMTT 17 A A/fTT V QlflXl A tt td c 


PR00472C 12.38 5.154e-28 80- 
102 PR00472A 8.03 7.600e-23 8- 
25 FR0G472B 14.84 1.OOOe-19 
25-40 


994 


BL01166 


RNA polymerases beta chain proteins. 


BL01 166G 1 8.10 2.500e-34 824- 
866 BL01166H 19.05 9.410e-30 
936-986 BL01166D 17.37 
4.396e-l 9 612-642 BL0H66E 
13.47 8.244e-17 682-706 
BL01166C 12.21 9.357e-12 431- 
456 


995 


BL00518 


7inp flnopr C*X\ir*A. tuna /DiXTrt _\ 

jo Auigcr, ujjiu^ type (KJINU linger), 
proteins. 


BL00518 12.23 7.000e-09 25-34 


996 


BL00038 


Myc-type, 'helix-loop^helix' dimerization 
domain proteins. 


BL00038A 13.61 6.625e-l 1 284- 
300 


1% 


DM00984 


w MYOD MYOBLAST DETERMINATION 
SHORT. 


DM00984B 15.18 3.901 e-09 262- 
317 


~996 


BL01009 


cAuaocuuidr pruicins oLr/ Xpx-J/Ag5/JrK- 

1/Sc7 proteins. 


BL01009A 13.75 7.750e-09 260- 
278 


"997 


BL00905 


GTP1/OBG family proteins. 


BL00905D 15.00 4.214e-10 125- • 
140 


997 
997 


PR00449 
BL01019 


TRANSFORMING PROTEIN P21 RAS 
SIGNATURE 


PR00449C 17.27 3.903e-13 63-86 
PR00449A 13.20 7.750e-10 22-44 


997 
998 


BL01I15 
BL00107 


ADP-ribosylation factors Family proteins. 
w ** vinuiug iiuwiotu piuiciii r an proteins. 

Protein kinases ATP-binding region proteins. 


BL01019A 13.20 8.624e-10 56-96 
BL01115B 10.81 1.505e-09 102- 
146 

BL00107B 13.31 7.300e-l 5 64-80 


998 
1001 


BL00239 
BL00383 


_Avwvtytvi tyi uauic muojc cioss 11 proteins. 
Tyrosine specific protein phosphatases 
proteins. 


BL00239E 17.14 2.693e- 10 36-86 
BL00383E 10.35 5.263e-09 328- 
339 


1003 


BL00383 


Tyrosine specific protein phosphatases 
proteins. 


BL00383E 10.35 5.263e-09 246- 
257 


1004 


BL00383 


Tyrosine specific protein phosphatases | 
proteins. 


BL00383E 10.35 5,263 e-09 328- 
339 


1010 
1012 


BL00226 
PR00322 


Intermediate filaments proteins. 
G 1 0 PROTEIN SIGNATl IR P 


BL00226B 23.86 5.919e-09 560- 
608 

PR00322E 6.62 1.720e-10 30-40 


1012 
1013 

1014 


BL00997 
BL00269 

BL01160 


G10 protein. 

Mammalian defensins proteins. 


BL00997C 6.36 3.308e-09 29-39 
BL00269C 16.52 6.786e-26 110- 
139 BL00269A 8.53 2.607e-20 
45-65 BL00269B 19.17 5.500e-I7 
72-101 


1015 


BL00028 


Kinesin light chain repeat proteins. 
Zinc finger, C2H2 type, domain proteins. 


BL01160B 1 9.54 8.297e-10 6-60 
BL00028 16 07 5 846e-l 1 476- 
493 BL00028 1 6.07 6.192e-ll 
989-1006 


1015 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 6.087e-10473- 
487 PR00048A 10.52 1.000e-09 
986-1000 


1015 


PD00O66 


PROTEIN ZINC-FINGER METAL-BINDI. " 


PD00066 13.92 6.571e-12 464- 
477 PD0Q066 13.92 7.000e-12 
977-990 PD00066 13.92 4.600e- 
09 949-962 PD00066 13.92 
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7.300e-09 397-410 


1019 


BL00289 


Pentaxin family proteins. j 


BL00289E 18.00 4.3 75e- 13 22-37 




rKUUoiO 




PR00895G 14.55 4.913e-10 19-31 


1022 


BL00348 


p53 tumor antigen proteins. 


BL00348F 23.19 4.571 e-09 140- 
183 


1023 


BL00455 


Putative AMP-binding domain proteins. 


BL00455 13.31 6.684e-13 248- 
264 


1023 


PR00154 


AMP-BINDING SIGNATURE 


PR00154A 8.88 7.375e-10 241- 
253 


1026 


T>T AAjIOI 

BL00421 


Transmembrane 4 family proteins. 


RI 0042 IE 20 97 1 851 e-09 17-47 


1026 


PR00259 


TRANSMEMBRANE FOUR FAMILY 
SIGNATURE 


PR00259D 13.50 7.097e-09 20-47 


1028 


PD01066 


PROTEIN ZINC FINGER ZINC-FINGER 
METAL-BINDING NU. 


PD01066 19.43 2.149e-29 6-45 


1028 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


FDUUUOO XiJyJ. i.oyze-o zoh- 
247 PD00066 13.92 3.400e-14 
150-163 PD00066 13.92 5.800e- 
14 206-219 PD00066 13.92 
8.714e-12 178-191 


3028 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 2.350e-13 102- 
119 BL00028 16.07 5.500e-ll 
190-207 BL00028 16.07 6.192c- 
11 218-235 BL00028 16.07 
1.000e-09 134-151 


1028 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 4.750e-14 215- 
229 PR00048A 10.52 5.500e- 13 

r\f\ 1 t *% TiTl f\f\f\A OA 1 A CO A 1 1 /Ca 

99-113 PR0004oA 10.52 4. Jloe- 
11 131-145 PR00048B6.02 

10.52 1.391e-10 159-173 
PR00048B 6.02 6.625e-10 203- 
213 PR00048A 10.52 4.600e-09 
187.201 PR00048B 6 02 8 579e- 
09147-157 


1029 


T)t)AArt1 A 


rliJKv/lNxiL/IlN I irJCf 111 ivDrxirii 
SIGNATURE 


PR00014D 12 04 2 059e-10 215- 
230 


1029 


BL0079U 


ivecepxor lyrosine Kinase cidbs v pruicina. 


BL00790F 15 90 2 519e~09 157- 
184 


1032 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 6.786e-26 133- 
162 BL00269A 8.53 2.607e-20 
68-88 BL00269B 19.17 5.500e- 17 
95-124 


1033 


PD01876 


AN llljJbiN jVLbLAlN UMA.-A00ULIA 1 -til-/ 

MULTIGENE FAMILY TUM. 


PD01R76C 21 73 1 231e-20 75- 
128 


1034 


PD02870 


RbCbr 1 UK IN 1 bJKl^bUJvJJN- J 
PRECURSOR. 


Pnn9R70Ti 1 8 X\ R 835e-1 1 326- 
359 


1034 


PR00019 


LEUC1NE-R1CH REFbA I blUJN A l UKfc 


PR 0/101 OA 11 1Q 4 150^-11 119- 

JT i\uvw 1 y/x 11.17 1. 1 JVv" 11 11 £*" 

126 PR00019B 11.36 8.000e- 11 
109-123 PR00019B 11.36 8.500e- 
11 184-198 PR00019A 11.19 
6.478e-10 187-201 PR00019A 
11.19 7.333e-09 63-77 


1034 


DM00179 


w KINASE ALPHA ADHESION T-CELL. 


DM00179 13.97 9.609e-09 334- 
344 


1035 


PR00988 


URIDINE KINASE SIGNATURE 


PR00988A 6.39 6.276e-12 358- 
376 


1035 


BL00567 


Phosphoribulokinase proteins. 


BL00567A 10.66 6.459e-ll 360- 
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379 


1038 


BL00120 


Lipases, serine proteins. 


BL00120C 12.62 9.053*12 95- 
106 


1038 


PR00825 


VESPID VENOM ALLERGEN 
PHOSPHOLIPASEA1 SIGNATURE 


PR00825B 14.81 1.371e-10 83~ 
104 


1038 


PR00821 


TRIACYLGLYCEROL LIPASE FAMILY 

C7/T.XT A TT m 1J 


PR00821E 9.644.896e-12 19-38 
PR00821P 16.10 1.806e-09 118- 
134 


1044 


BL00027 


'Homeobox' domain proteins. 


BL00027 26.43 5.500e-30 169- 
212 


1044 


BL00032 


'Homeobox 1 antennapedia-type protein. 


BL00032C 1128 5.179e-16 197- 
215 BL00032B 10.83 3.060e-15 
158-197 


1044 


PR00024 


HOMEOBOX SIGNATURE 


PR00024C 7.49 8.071e-13 201- 
211 PR00024B 11.27 7.000e-12 
191-202 


1044 


PR00026 


ENGRAILED HOMEODOMAIN 
SIGNATURE 


PR00026A 7.47 9.710e-12 153- 
171 


1044 


PR00031 


LAMBDA AND OTHER REPRESSOR 
HELIX-TURN-HELIX SIGNATURE 


PR00031B 16.29 4.724e-10 192- 
209 


1044 


PR00025 


HOMEOTIC ANTENNAPEDIA PROTEIN 
SIGNATURE 


PR00025B 1 1.94 6.434e-10 155- 
171 


1048 


BL01128 


Shikimate kinase proteins. 


BL01 128A 18.84 8.200e-14 7-41 


1048 


BL00211 


ABC transporters family proteins. 


BL00211A 12.23 4.600e-09 9-21- 


1049 


PR00014 


FEBRONECTIN TYPE III REPEAT 
SIGNATURE 


PR00014C 15.44 1.783e-09 211- 
230 PR00014A 8.22 3.045e-09 
373-383 PR00014C 15.44 6.087e- 
09 309-328 


1049 


BL00790 


Receptor tyrosine kinase class V proteins. 


BL007901 20.01 1.750e~12 642- 
673 BL007901 20.01 6.125e-12 
320-351 BL007901 20.01 6.679e- 
09 222-253 


1049 


PR00096 


GLUTAMINE AMIDOTRANSFERASE 
SUPERFAMILY SIGNATURE 


PR00096B 9.72 9.827e-09 689- 
701 


1050 


DM00372 


CARCINOEMBRYONIC ANTIGEN 
PRECURSOR AMINO-TERMINAL 
DOMAIN. 


DM00372C 23.69 4.919e-12 67- 
103 


1053 


BL01282 


BIR repeat proteins. 


BL01282B 30.49 1.000e-ll 194- 
233 


1053 


PR00544 


PROGESTERONE RECEPTOR 
SIGNATURE 


PR00544F 7.76 3.329e-09 18-35 


1057 


PR00464 


E-CLASS P450 GROUP II SIGNATURE 


PR00464A 20.47 9.591e-16 149- 
170 PR00464C 18.84 1.000e-15 
324-353 PR00464D 17.40 6.250e- 
15 353-371 PR00464B 20.41 
1.844e-32 205-224 


1057 


PR00385 


P450 SUPERFAMILY SIGNATURE 


PR00385A 14.97 1.346e-12 335- 
353 PR00385B 10.22 4.1 30e- 11 
353-367 


1057 


PR00463 


E-CLASS P450 GROUP I SIGNATURE 


PR00463E 17.37 4.814e-l 1 344- 
371 


1058 


PD02382 


RECEPTOR CHAIN PRECURSOR 
TRANSME. 


PD02382A 17.43 9.321e-09 99- 
115 


1060 


BL00795 


Involucrin proteins. 


BL00795C 17,06 6.442e-10 905- j 
950 


1060 


BL00422 


Granins proteins. 


BL00422C 16.18 4.255e-10 910- 
938 BL00422C 16.18 4.353e-09 
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913-941 


1060 


PF00992 


Troponin. 


PF00992A 16.67 2.184e-09 900- 
935 PF00992A 16.67 5382e-09 
889-924 


1060 


BL00412 


Neuromodulin (GAP-43) proteins. 


BL00412D 16.54 7.429e-09 883- 
934 


1060 


PF01140 


Matrix protein (MA), pi 5. 


PF01140D 15.54 8.326e-09 903- 
938 


1063 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 6.644e-09 603- 
636 


1063 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 9.643e-10 96-1 1 1 
PR00049D 0.00 2.525e-09 95-1 10 
PR00049D 0.00 3.898e-09 114- 
129 PR00049D 0.00 7.407e-09 
97-112 


10o3 




MOT T TKrAlvI "RHnnOP^TN C- 
lVlWi^ljUoV^/\.lN ivnui/vr Oil i \-r 

TERMINAL TAIL SIGNATURE 


PR00239E 1.58 8.670e-09 100- 
112 


1064 


BL01282 


BIR repeat proteins. 


BL01282B 30.49 1.000e-ll 137- 
176 


1064 


PR00544 


PROGESTERONE RECEPTOR 

CT/TXT A TF Tl? 1? 
MOW A 1 Uivb 


PR00544F 7.763.329e-09 18-35 


1065 


BL01282 


BIR repeat proteins. 


BL01282B 30.49 1.000e-ll 187- 


1065 


PR00544 


PROGESTERONE RECEPTOR 

OfHXT A TI TP U 
blONAlUixti 


PR00544F 7.76 3.329e-09 1 8-35 


1066 


BL00218 


Amino acid permeases proteins. 


BL0021 8D 21 .49 7.324e-l 1 226- 
271 BL00218E23 30 3 475e-09 
307-347 


1067 


BL00994 


Bacterial export FHIPEP family proteins. 


BL00994A 15.15 1.086e-0971- 
118 


1068 


PD01066 


PROTEIN ZINC FINGER ZINC-FINGER 
METAL-BINDING NU. 


PD01066 19.43 8.385e-33 6-45 


1068 


BL01030 


RNA polymerases M/ 15 Kd subunits 
proteins. 


BL01030 23.44 7.480e-10 283- 
321 


1068 


BL00466 


TFIIS zinc ribbon domain proteins. 


BL00466 25.88 5.622e-09 283- 
320 


1068 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 2.385e-15 271- 
284 PD00066 13.92 3.077e-15 
243-256 PD00066 13.92 3.077e- 
15 320-333 PD00066 13.92 
3.077e-l 5 348-361 PD00066 
13.92 3.077e-l 5 376-389 
PD00066 13.92 3.077e-15 432- 
445 PD00066 13.92 3.077e- 15 
617-630 PD00066 13.92 3.077e- 
15 701-714 PD00066 13.92 
7.923e-l 5 215-228 PD00066 
13.92 8200oA 4 589-602 
PD00066 13.92 8.800e-14 729- 
742 PD00066 13.92 5.7 14e- 12 
542-555 PD00066 13.92 9.571e- 
12 561-574 PD00066 13.92 
3.739e-U 404-417 PD00066 
13.92 1.692e-l 0 299-3 12 
PD00066 13.92 2.038e-10 488- 
501 PD00066 13.92 8.615e-10 
645-658 PD00066 13.92 6.700e- 
09 757-770 
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entry ID 


Description 


Results* 


1068 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 5.950e-l 3 388- 
405 BL00028 16.07 6.400e-13 
444-461 BL00028 16.07 1.692e- 
11 227-244 BL00028 16.07 
3.423e- 11 332-349 BL00028 
1 6.07 6.538e-ll 685-702 

Dr AAA*>0 1 C AT 1 l^l- i i « 

730 BL00028 16.07 7.577e-ll 
573-590 BL00028 16.07 4.300e- 
10601-618 BL00028 16.07 
5,500e-10 171-188 BL00028 
16.07 5.800e-10 255-272 
BL00028 16.07 7.900e-10 657- 
674 BL00028 16.07 9.700e-10 
526-543 BL00028 16.07 2.029e- 
09 283-300 BL00028 16.07 
3.829e-09 741-758 BL00028 
16.07 6.914e-09 360-377 
BL00028 16.07 7.686e-09 416- 
433 


1068 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 4.750e-14 570- 
584 PR00048A 10.52 2.929e-13 
329-343 PR00048A 10.52 5.500e- 
33 224-238 PR00048A 10.52 . 
1.000e-l 2 598-612 PR00048A 
10.52 5.235e-12 252-266 
PR00048A 10.52 5.765e-12 441- 
455 PR00048A 10.52 7.353e-12 
385-399 PR00048A 10.52 7.353e- 
12 413-427 PR00048A 10.52 
7.353e-12 710-724 PR00048A 
10.52 3.842e-l 1 357-371 
PR00048A 10.52 5.263e-ll 626- 
640 PR00048A 10.52 5.737e-ll 
280-294 PR00048A 10.52 7.632e- 
11 682-696 PR00048A 10.52 
8.579e-l I 654-668 PR00048B 
6.02 2J25e-i0 457-467 
PR00048A 10.52 2.565e-10 738- 
752 PR00048A 10.52 6.087e-10 
523-537 PR00048B6.02 7.188e- 
10 698-708 PR00048B 6.02 
7.750e-l 0 726-736 PR00048B 
6.02 L000e-09 240-250 
PR00048A 1 0.52 3.520e-09 308- 
ill PKUU048A 10.52 4.600e-09 
551-565 PR00048A 10.52 8.560e- 

AO ^Q£. OIA T3DAAAifor» C r\r\ 

\)y lyo-zlU rKul)U4oB 6.02 
1.000e-08 586-596 


1069 


PD01427 


TRANSFERASE METHYLTRANSFERASE 
BI. 


PD01427B 22.45 1.545e-ll 117- 
158 


1070 


PFO0168 


C2 domain proteins. 


PF00168C 27.49 1.750e-09202- 
228 


1070 


PR00360 


C2 DOMAIN SIGNATURE 


PR00360B 13.61 2.227e-09 219- 
233 


1075 


PR00962 


LETHAL(2) GIANT LARVAE PROTEIN 
SIGNATURE 


PR00962D 10.40 3.054e-10 178- 
202 


1075 


PR00320 


G-PROTEIN BETA WD-40 REPEAT 


PR00320A 16.742.976e-09 181- 
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Database 
entry ID 


Description 


Results* 






SIGNATURE 


196 


1078 


DM01V70 


0 KW ZK632.12 YDR313C bNDUaUMAL 

in. 


323 


1078 


T) T f\f\£HO 

BLU0678 


Trp-Asp (WD) repeat proteins proteins. 




1078 


PR00320 


G-PROTEIN BETA WD-40 REPEAT 

OTp\T A «T*T TT> T2 

MGN A I UKb 


PR00320A 16.74 1.931e-ll 214- 
990 Di>nrw>nR n io i qoq» in 

214-229 PR00320C 13.01 3.880e- 
4.900e-09 257-272 


1079 


PF00774 


Dihydropyridine sensitive L-type calcium 
channel (Beta subuni. 


PF00774D 10.59 8.396e-09 339- 
365 


1079 


T>T A1 All 

BL01013 


Oxysterol-binding protein family proteins. 


TIT fit A1 *2T"\ OiC Ol © OOfia AQ fOQ 

632 


1080 


BL00615 


C-type lectin domain proteins. 


BL00615A 16.68 3.1 60e-l 1 129- 
147 


1081 


PD01066 


PROTEIN ZINC FINGER ZINC-FINGER 
METAL-BINDING NU. 


PD01066 19.43 2.705e-ll 47-86 


1082 


BL00359 


Ribosomal protein LI 1 proteins. 


BL00359B 23.07 7.462e-24 160- 
201 BL00359C 22.18 6.586e-22 
215-249 BL00359A 20.66 
4.000e-21 124-160 


1082 


BL01108 


Ribosomal protein L24 proteins. 


BL0U08A 20.33 1.000e-08 144- 
177 


1084 


PD02462 


PROTEIN BOLA TRANSCRIPTION 
REGULATION AC. 


PD02462A 22.48 1.220e-09 104- 
139 


1084 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 7.000e-17 755- 
772 BL00028 16.07 6.625e-15 
699-716 BL00028 16.07 8.412e- 
14 223-240 BL00028 16.07 
8.941e-14 167-184 BL00028 
16.07 6.850e-13 391-408 
BL00028 16.07 1.783e-12 559- 
576 BL00028 16.07 2.957e-12 
307-324 BL00028 16.07 7.652e- 
12 503-520 BL00028 16.07 
7,652e-12 811-828 BL00028 
16.07 8.043e-12 335-352 
BL00028 16.07 1.346e-ll 447- 
464 BL00028 16.07 2.3 85e- 11 
867-884 BL00028 16.07 4.462e- 
11 671-688 BL00028 16.07 
5.846e-l 1 587-604 BL00028 
16.07 6.192e-ll 839-856 
BL00028 16.07 6.192e-ll 895- 
912 BL00028 16.07 8.962e-ll 
531-548 BL00028 16.07 1.600e- 
10 279-296 BL00028 16.07 
1.600e-10 363-380 BL00028 
lo.u/ o.iwe-iu iii-izs 
BL00028 16.07 6.700e-10 643- 
660 BL00028 16.07 8.500e-10 
251-268 BL00028 16.07 1.77ie- 
09 783-800 BL00028 16.07 
5.886^09 475-492 


1084 


PR0OO48 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.524.600e-18 696- 
710 PRO0048A 10.52 5.091e-15 
164-178 PR00048A 10.52 6.727e- 
15 836-850 PR00048A 10.52 
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1.000e-14 220-234 PR00048A 
10.52 2.500e-14 444-458 
PR00048A 10.52 5.500e-14 556- 
570 PR00048A 10.52 9.250e- 14 
388-402 PR00048A 10.52 1.643e- 
13 640-654 PR00048A 10.52 
3.571 e-i3 304-318 PR00048A 
10.52 3.571e-13 528-542 
PR00048A 10.52 8.071e-I3 332- 
346 PR0O048A 10.52 8.071e-13 
808-822 PR00048A 10.52 8.071e- 
13 864-878 PR00048A 10.52 
8.7l4e-13 500-514 PR00048A 
10.52 7.353e-12 892-906 
PR00048B 6.02 1.000e-l 1 292- 
302 PR00048B6.02 1.000e-ll 
516-526 PR00048B 6.02 l.OOOe- 
11 824-834 PR00048A 10.52 
3.842e-l 1 276-290 PR00048A 
10.52 6.684e-l 1 584-598 
PR00048A I0.529.053e-ll 668- 
682 PR00048A 10.52 4.130e-10 
360-374 PR00048A 10.52 6.870e- 
10 752-766 PR00048B 6.02 - 
1.474e-09 768-778 PR00048B 
6.02 3.368e-09 236-246 
PR00048B 6.02 3.368e-09 460- 
470 PR00048B 6.02 4.789e-09 
152-162 PR00048B 6.02 4.789e- 
09376-386 PR00048B6.02 
4.789e-09 600-610 PR00048A 
10.52 4.960e-09 108-122 
PR00048A 10.52 4.960e-09 248- 
262 PRO0048B 6.02 6.21 le-09 
264-274 PR00048B 6.02 6.21 le- 
09 488-498 PR00048B6.02 
6.21 le-09 796-806 PR00048B 
6.02 7.632e-09 404-414 
PR00048B 6.02 7.632e-09 712- 
722 PR00048A 10.52 8.920e-09 
780-794 


1084 


PD00066 


PROTEIN ZINC-FINGER METAL-BJNDL 


PD00066 13.92 1.600e-14 295- 
308 PDO0066 13.92 1.600e-14 
323-336 PD00066 13.92 L600e- 
14 519-532 PD00066 13.92 | 
3.600e-14 547-560 PD00066 
13.92 1.600e-14 827-840 
PD00066 13.92 1.600e-14 855- 

ouo r ju'vuuoo ij.yz. 

351-364 PD00066 13.92 5.200e- 
14 575-588 PD00066 13.92 
8.200e-14 883-896 PD00066 
13.92 9-400e-14 239-252 
PD00066 13.92 2.500e-13 155- 
168 PD00066 13.92 2.500e-13 
379-392 PD00066 13.92 2.286e- 
12267-280 PD00066 13.92 
2.286e-l 2 491-504 PD00066 
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Database 
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13.92 2.286e-12 799-812 
PD00066 13.92 2.714e-12 463- 
476 PD00066 13.92 2.714e-12 
715-728 PD00066 13.92 2.714e- 
12 771-784 PD00066 13.92 
3.571e-l 2 687-700 PD00066 
13.92 7.000e-12 407-420 
PD00066 13.92 1.000e-10 127- 
140 PD00066 13,92 1.000e-08 
603-616 


1085 


PR00679 


PROHIBITIN SIGNATURE 


PR00679F 8.03 6.478e-28 178- 
202 PR00679C 14.44 7.o77e-22 | 
107-126 PR00679E 12,82 5.171e- 
19 153-173 PRQU679D 11.91 
9.05 3e- I o 130-147 rKlHJO/yU 
6.13 7.882e-17 201-218 

IVDArt^TGO 1 1 CfX 9 AAA** Ifi l 

PK0UO/y£> Ij.OJ z/w-^e-iu o4- 
104 


1086 


PR00245 


OLFACTORY RECEPTOR SIGNATURE 


PKUU245b 12.4U o.Zooe-lZ 4j-0U 


1086 


PR00237 


RHODOPSIN-LIKE GPCR SUPERFAMILY 
SIGNATURE 


PR00237G 19.63 5.814e-09 26-53 


1087 


PD00126 


PROTEIN REPEAT DOMAIN TPR 
NUCLEA. 


PD00126A 22.53 6.885e-10 99- 
120 


1088 


BL0U45 


Ribosomal protein L34e proteins. 


BL01145A 13.73 1.000e-12 3-45 


1093 


BL00154 


E1-E2 ATPases phosphorylation site proteins. 


BL00154D 12.57 7.387e-09 95- 
106 


1093 


PR00121 


SODIUM/POTASSIUM-TRANSPORTING 
ATPASE SIGNATURE 


PR00121E 13.97 9.444e-09 92- 
111 


1095 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDL 


PD00066 13.92 7.923e-15 439- 
452 PD00066 13.92 2.800e-14 
411-424 PD00066 13.92 2.800e- 
14 467-480 PD00066 13.92 
5.800e-l 4 495-508 PD00066 
13.92 5.800e-14 523-536 
PD00066 13.92 8.200e-14 355- 
368 PD00066 13.92 5.500e-13 
579-592 PD00066 13.92 3.143e- 
12 551-564 PD00066 13.92 
4.857e-12 383-396 


1095 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 4.l50e-13 367- 
384 BL00028 16.07 8.200e-13 
563-580 BL00028 16.07 3.348e- 
12 479-496 BL00028 16.07 
7.652e- 12 423-440 BL00028 
16.07 8.826e-12 619-636 
BL00028 16.07 4.1 15e-ll 451- 
468 BL00028 16.07 5.500e- 11 
395-412 BL00028 16.07 7.231 e- 
1 1 SQI-fiOX RT 0009R 16 07 
1.600e-l 0 339-356 BL00028 
16.07 2.200e-10 535-552 


1095 


PRO0O48 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 4.375e-16 560- 
574 PR00048A 10.52 4.21 4e- 13 
476-490 PR00048A 10.52 6.143e- 
13 364-378 PR00048B 6.02 
6.400e-13 492-502 PR00048B 
6.02 1.000e-l 1 352-362 
PR00048B 6.02 1.000e-ll 408- 
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entry D) 


Description 


Results* 








418 PR00048B 6.02 1.692e-l 1 
548-558 PR00048A 10.52 1.947e- 
11 588-602 PR00048A 10.52 
3.842e-l 1420-434 PR00048B 
6.02 4.462e-ll 464-474 
PR00048A 10.52 6.684e-ll 392- 

-tvu riwwtort 0.0$4e-I J 

448462 PR00048B 6.02 7.23 le- 
11 436-446 PR00048A 10.52 
5.696e-10 532-546 PR00048B 
6.025.263e-09 576-586 
PR00048A 10.52 6.400e-09 504- 
518 PR00048A 10.52 6.760e-09 
336-350 PR00048A 10.52 7.120e- 
US/ 616-630 


1095 


PD02462 


PROTEIN BOLA TRANSCRIPTION 
REGULATION AC. 


PD02462A 22.48 9.232e-09 472- 

CAT 

507 


1097 


BL00649 


G-protein coupled receptors family 2 proteins. 


BL00649C 17.82 9.542e-12 400- 
426 


1097 


PR00249 


SECRETIN-LIKE GPCR SUPERFAMILY 
SIGNATURE 


PR00249C 17.08 8.839e-l 1 402- 
426 PR00249A 15:88 7.851e-09 
330-355 


1097 


PR00248 


METABOTROPIC GLUTAMATE GPCR 
SIGNATURE 


PR00248E 17.85 9.366e-09 442- 
465 


1100 


BL00479 


Phorbol esters / diacylglycerol binding 
domain proteins. 


BL00479B 12.57 8.875e-09 886- 
902 


1101 


PR00205 


CADHERIN SIGNATURE 


PR00205B 11.39 5.655e-16 255- 
273 PR00205A 14.73 1.000e-12 
180-196 PR00205B 11.39 4.927e- 
10 475-493 PR00205C 13.65 
9.438e-10 515-530 


1101 


BL00232 


Cadherins extracellular repeat proteins 
domain proteins. 


BL00232B 32.79 1.000e-40 148- 
196 BL00232A 27.72 5.125e-25 
54-87 BL00232B 32.79 4.286e-19 
257-305 BL00232C 10.65 7.429e- 
16 255-273 BL00232B 32.79 
L500e-10 372-420 BL00232C 
10.65 6.538e-l 0475-493 
BL00232C 1 0.65 7.632e-09 1 46- 
164 


1103 


BL00122 


Carboxyiesterases type-B serine proteins. 


BL00122A 12.04 3.152e-15 86- 
107 BL00122D 12.53 7.097e-14 
197-213 BL00122B 16.84 1.346e- 
13 148-1D9 BL00122C 7.91 
9.550e-l 0168-179 


1105 


BL00018 


EF-hand calcium-binding domain proteins. 


BL00018 7.41 2.800e-1 021 7-230 
BLOOQlg 7.41 8.650e-10 133-146 


1105 


PR00450 


RECOVERIN FAMILY SIGNATURE 


PR00450E 12.14 3.438e-15 174- 
193 PR00450B 11.76 5.574e-13 
82-102 PR00450D 16.58 6.7I4e- 
13 152-172 PR00450C 12.22 
6.864e-12 128-150 PR00450G 
15.33 6.591e-09 224-245 


1108 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. ] 

( 


PD00066 13.92 6.143e-12 927- 
U0 PD00066 13.92 7.000e-09 
543-356 


1108 ] 


BL00028 


imc ringer, C2H2 type, domain proteins. 1 

c 


3L00028 16.07 3.250e-13 911- 
m BL00028 16.07 9.100e-13 
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12 939-956 BL00028 16.07 
5.500e-ll 270-287 BL00028 
16.07 4.000&-10298-315 
BL00028 16.07 6.700e-10 968- 
985 BL00028 16.07 9.700e-10 
191-208 BL00028 16.07 9.700e- 
10 355-372 BL00028 16.07 

16.07 7.171e-09 242-259 


1108 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


DT>AAA/H?"Q A AO £ nfiftf»-19 Q?/l- 

934 PR00048A 10.52 4.522e-10 
324-338 PR00048A 10.52 4.913e- 
10 908-922 PR00048A 10.52 
7.652e-10 936-950 PR00048A 
10.52 4.600e-09 965-979 
PR00048A 10.52 7.480e-09 352- 
366 


1108 


DM01970 | 


0 kw ZK632.12 YDR313C ENDOSOMAL 
IIL 


t*\x *t\ i c\nr\ A O < A Q 1 AAa AO 0 1 A 

DMOly/UA o.o\) y. iuue-uy yiu- 
917 


1109 


DM01857 


5 kw NUCLEOSIDE TRANSPORT 
DEPENDENT NA. 


DM01857B 14.94 6.471 e- 19 284- 
312 DM01 857E lo.9U /.ilie-Io 

7.045e-15 548-575 DM01857C 

15.62 4.500e-14 312-344 

DM0 1857A 20.25 l.667e-13 207- 

372-410 


1112 


DM01840 


lew SPAC24B1 1.09 R07E5.13. 


103 DM01840A 10.95 9.571e-13 


1114 


BL01115 


GTP-binding nuclear protein ran proteins. 


BL01115A 10.22 3.438e-l 4 53-97 


1114 


PR00449 


TRANSFORMING PROTEIN F21 KAs 
SIGNATURE 


PR0044QA 13 20 9 308e-16 53-75 
PR00449C 17.27 8.920e-15 94- 
1 17 PR00449B 14.34 5.680e-10 
76-93 


1114 


PR00879 


FISH ACb l i LCrlUL»lNJiD i btvAoii 

SIGNATURE 


PR00879A 6 28 1.450e-09 37-43 


1114 


BL01125 


ROK family proteins. 


BL01 125D 13.61 7.429e-09 214- 
227 


1115 


PF00622 


Domain in SPla and the RYanodine Receptor. 


PF00622B 21.00 2.500e-13 265- 
287 


1115 


BL00518 


Zinc finger, C3HC4 type (RING finger), 
jproteins. 


BL00518 12.23 9.57 1 e- 10 103- 
112 


1116 


PR00019 


LEUCINE-RICH REPEAT SIGNATURE 


PR00019A 1 1.19 5.050e-l 1 66-80 
PR00019B 1 1.36 6.850e-10 63-77 


1116 


DM00315 


072 RIBONUCLEASE INHIBITOR. 


n\yfft031 ^fi 1^ 85 f% 362e-10 84- 
120 DM00315G 15.85 3.340e-09 
246-282 


1117 


BL00509 


Ras GTPase-activating proteins. 


BL00509B 10.28 5.263e-10 429- 
440 


1117 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 9.357e-10 794- 
809 PRO0O49D0.O0 1.915e-09 
793-808 PR00049D 0.00 3.593e- 
09 792-807 PR00049D 0.00 
5.729e-09 791-806 


1117 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 7.559e-09 780- 
813 
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Results* 


1117 


PR00806 


VINCULIN SIGNATURE 


PR00806A 6.63 8.397e-09 794- 
805 


1120 


PR00720 


MAMMALIAN LMW 

PHO^P WHTVP C\ Q IlvTP DO OTD TXT 

PHOSPHATASE SIGNATURE 


PR00720C 12.41 1.099e-27 88- 
109 PR00720B 10.61 4.789e-20 
71-87 PR00720A 16.54 2.000e-17 
28-41 PR00720E 10.01 1.342e-16 
rnvvuvu ihSA J .7786- 
15 110-127 


1120 


PR00719 


LMW PHOSPHOTYROSINE PROTEIN 
PHOSPHATASE SIGNATURE 


PR00719A 14.49 3.000e-23 9-27 
PR00719C 14.10 5.000e-18 85- 
101 PR00719B 14.32 1.346e-15 
52-69 PR00719D 17.529.654e-15 
110-124 


1121 


PD00131 


ATP-BINDING TRANSPORT . 
TRANSMEMBR. 


PD00131B 34.97 7.987e-09 108- 
162 ! 


1123 


BL00615 


i^-iype icciin aomain proteins. 


BL00615A 16.68 9.526e-13 573- 
591 


1123 




TVPT7 TT A"MTIT?PT:i77D r>t> ATnrM 

i irD 11 A IN 1 IrKJbUZJb rRUTEIN 
SIGNATURE 


PR00356C 13,33 3.793e-10 591- 
609 PR00356D 13.09 5.038e-09 
619-636 


1123 


PR00439 


1 I-S SEED STORAGE PROTEIN FAMILY 
SIGNATURE 


PR00439C 15.32 9.2I7e-09 332- 
353 


1127 


BL00134 


aerme proteases, trypsin tamily, nistidine 
proteins. 


BL00134A 11.96 5.781e-15 493- 
510 BL00134B 15.99 4.194e-14 
675-699 


1127 


BL00021 


Kringle domain proteins. 


BL00021B 13.33 8.984e-12 493- 
511 


1127 


BL00495 


Apple domain proteins. 

i 


BL00495N 11.04 9.735e-ll 667- 
702 


1127 


PR00453 


VON WILLEBRAND FACTOR TYPE A 

Pin\/f ATM QTfJXTATTTDT? 
l^UM/UIN olUINA 1 UKb 


PR00453A 12.79 5.345e-14 251- 
269 PR00453B 14.65 4.682e-10 
290-305 


1127 


PR00722 


FAMILY (SI) SIGNATURE 


PR00722A 12.27 9.129e-13 494- 
510 PR00722C 10.87 4.273 e- 11 
674-687 PR00722B 12.51 4.000e- 
09 554-569 


1127 


BL01253 


i /po i iiuroncwiin uojnain proteins. 


BL01253G 11.34 5.348e-09 674- 
688 


1128 


BL00236 


Neurotransmitter-gated ion-channels proteins. 


BL00236D 25.66 4.000e-30 64- 
106 


1128 


PR00252 


NEUROTRANSMITTER-GATED ION 
CHANNEL FAMILY SIGNATURE 


PR00252D 12.29 7.097e-I0 71-84 


1129 


BL00604 


Synaptophysin / synaptoporin proteins. 


BL00604F 5.96 7.718e-10 367- 
412 


1129 


PR00524 


CHOLECYSTOKININ TYPE A RECEPTOR 
SIGNATURE 


PR00524F 5.36 7.415e-09 208- 
222 


1129 


BL01113 


CI q domain proteins. 


BL01113A 17.99 6.455e-14 158- 
185 BL01113A 17.99 5.622e-13 
28-55 BL01113A 1 7 oo i oov 
12393-420 BL01113A 17.99 
4.462e-l 2 230-257 BL01113A 
17.99 6.538e-12 34-61 
BL01113A 17.99 1.000e-l 1 284- 
311 BL01113A 17.99 1.205e-ll 
31-58 BL01113A 17.99 1.614e- 
11 170-197 BL01113A 17.99 
5.091e-ll 167-194 BL0U13A 
17.99 6.523e-l 1456-483 
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BL01113A 17.99 7.341e-ll 411- 
438 BL01113A 17.99 1.766e-10 
402-429 BL01113A 17.99 
2.915e-10 356-383 BL01113A 
17.99 6.745e-10 239-266 
BL01113A 17.99 7.319e-10 293- 
320 BL01113A 17.99 8.085e-10 
164-191 BL01113A 17.99 
8.660e-10 305-332 BL01113A 
17.99 2.038e-09 272-299 
BL01 1 13A 17.99 2.212e-09 353- 
380 BL01113A 17.99 2.385e-09 
86-113 BL01113A 17.99 2.731e- 
09 149-176 BL01113A 17.99 
2.904c-09 453^80 BL01113A 
17.99 3.423e-09 131-158 
BL01 113A 17.99 3.423e-09 308- 
335 BL01113A 17.99 3.769e-09 
396-423 BL01113A 17.99 
3.769e-09 450-477 BL01113A 
17.99 3.942e-09 25-52 
BL01113A 17.99 3.942e-09 173- 
200 BL01113A 17.99 4.115e-09 
278-305 BL01113A 17.99 
4.981e-09 43-70 BL01113A 
17.99 5.327e-09 19-46 
BL01 113A 17.99 5.500e-09 320- 
347 BL01113A 17.99 6.1 92e-09 
143-170 BL01113A 17.99 
7.231e-09 224-251 BL01 1 13A 
17.99 8.269e-09 245-272 


1129 


BL00420 


Speract receptor repeat proteins domain 
proteins. 


BL00420A 20.42 3.571e-13 31-60 
BL00420A 20.42 9.082e-13 1 13- 
142 BLO0420A 20.42 8.691e-l 1 
311-340 BL00420A 20.42 
4.098e-10 125-154 BL00420A 
20.42 4.541e-10 158-187 
BL00420A 20.42 5.279e-10 34-63 
BL00420A 20.42 5.426e-10 137- 
166 BL00420A 20.42 6.754e-10 
49-78 BL00420A 20.42 6.902e- 
10 266-295 BL00420A 20.42 
7.492e-10 43-72 BL00420A 
20.42 8.082e-10 25-54 
BL00420A 20.42 9.852e-10 167- 
196 BL00420A 20,42 2.800e-09 
170-199 BL00420A 20.42 
2.938e-09 414-443 BL00420A 
20,42 3.492e-09 52-81 
BL00420A 20.42 5.015e-09 305- 
334 BL00420A 20.42 5.569e-09 
37-66 BL00420A 20.42 7.923e- 
09 459-488 BL00420A 20.42 
8.477e-09 335-364 BL00420A 
20.42 8.754e-09 83-112 
BL00420A 20.42 9.446e-09 287- 
316 BL00420A 20.42 9.862e-09 
290-319 
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1132 


PR00042 


FOS TRANSFORMING PROTEIN 
SIGNATURE 


PR00042E 9.69 7.652e-09 234- 
258 


1135 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 9.400e-14 365- 
378 PD00066 13.92 6.143e-12 
335-348 PD00066 13.92 2.1 74e- 
11 395-408 


1135 


BL00970 


Nuclei transition ]Kot§ifi 2 proteins. 


BL00970B 10.09 3.069e-10 55-81 


1135 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 3.746e-09 223- 
238 PR00049D0.003.746e-09 
224-239 PR00049DO.003.898e- 
09220-235 


1135 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 6.885e-l 1349- 
366 BL00028 16.07 5.886e-09 
379-396 


1135 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048B 6.02 6.211 e-09 362- 
372 


1135 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 8.169e-09 221- 
254 


1136 


BL00962 


Ribosomal protein S2 proteins. 


BL00962D 22.51 5.500e-35 131- 
175 BL00962C 15.90 9.591e-17 
106-124 BL00962B 36.15 9.060e- 
15 40-94 


1136 


PR00395 


RIBOSOMAL PROTEIN S2 SIGNATURE 


PR00395C 16.17 1.000e-17 106- 
124 PR00395D 13.04 7.000e- 17 
131-149 PR00395F 10.56 6.400e- 
16 169-184 PR00395E 14.46 
4.103e-U 148-160 


1137 


BL00152 


ATP synthase alpha and beta subunits 
proteins. 


BL00152A 15.38 5.109e-14 128- 
154 


1139 


BL00152 


ATP synthase alpha and beta subunits 
proteins. 


BL00152B 21.40 4.273e-37 124- 
162 BL00152A 15.38 8.364e-23 
67-93 


1139 


PR0044? 


TRANSFORMING PROTEIN P21 RAS 
SIGNATURE 


PR00449A 13.20 2.862e-09 139- 
161 


1140 


BL00152 


ATP synthase alpha and beta subunits 
proteins. 


BL001 52B 21.40 2.000e-32 1 85- 
223 BL00152A 15.38 8.364e-23 
128-154 


1140 


PR00449 


TRANSFORMING PROTEIN P21 RAS 
SIGNATURE 


PR00449A 13.20 7.672e-09 200- 
222 


1141 


PR00493 


BREAST CANCER TYPE I 
SUSCEPTIBILITY PROTEIN SIGNATURE 


PR00493G 7.57 1.184e-10 652- 
673 


1141 


PD00078 


REPEAT PROTEIN ANK NUCLEAR 
ANKYR. 


PD00078B 13.14 3.700e-10 494- 
507 


1141 


BL00518 


Zinc finger, C3HC4 type (RING finger), 
proteins. 


BL00518 12.23 3.667e-09 102- 
111 


1141 


PF00791 


Domain present in ZO-1 and Unc5-like netrin 
receptors. 


PF00791B 28.49 9.047e-15 501- 
556 PF00791B 28.49 9.386e-14 
468-523 PF00791C 20.98 9.814e- 
10 515-554 PF00791C 20.98 
7.61 8e-09 482-521 


1141 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 3.500e-32 534- 
550 PF00023A 16.03 7.857e-ll 
501-517 PF00O23B 14.20 9.591e- 
09 497-507 


1143 


BL00301 


GTP-binding elongation factors proteins. 


BL00301A 12.41 1. 75 Oe- 1 2 72-84 


1143 


PR00315 


GTP-BINDING ELONGATION FACTOR 
SIGNATURE 


PR00315A 11.81 4.000e-14 72-86 
PR00315B 1 1.66 7.600e-10 118- 
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127 


1145 


BL00745 


Prokaryotic-type class I peptide chain release 
factors signat 


BL00745C 13.66 7.398e-18 59- 
106 


1146 


BL00745 


Prokaryotic-type class I peptide chain release 
factors signat. 


BL00745C 13.66 4.706e-12 59- 
106 


1149 


BL00660 


Band 4.1 family domain proteins. 


m OO^fiflR 17 33 4 ROOe-97 136- 

176 BL00660A31.507.91te-20 
*9.m^ "RT O0660C 23 36 2 241 e- 
19215-259 BL00660E 23.41 

0 647 p-1 3 301-324 


1149 


PR00935 


BAND 4.1 PROTEIN FAMILY 
SIGNATURE 


PR00935C 11.984.300e-17 154- 
175 PR00935D 10.20 1.281e-14 
91 <;-9T? PB00Q3^R 10 58 6 108e- 
12 141-155 PR00935A 10.16 


1149 


PR00661 


ERM FAMILY SIGNATURE 


PR00661C 9.53 3.616e-10 150- 


1153 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12.27 1.882e-12 155- 
174 


1153 


BL00107 


Protein kinases ATP-binding region proteins. 


BL00107A 18.39 1.818e-15 155- 
186 BL00107B 13.31 8.714e-ll 
221-237 


1153 


BL00239 


Receptor tyrosine kinase class II proteins. 


TXJ 009^ OR 95151 774&-09 90- 
138 


1153 


BL00240 


Receptor tyrosine kinase class III proteins. 


BL00240E 11.56 6.657e-09 141- 

1 fy 


1153 


BL00479 


Phorbol esters / diacylglycerol binding 
domain proteins. 


BL00479C 12.01 9.000e-09 200- 

91 ^ 


1155 


PR00837 


ALLERGEN V5/TPX-1 FAMILY 
SIGNATURE 


PR00R37n 17 91 4 0fi4e-ll 155- 
172 PR00837A 14.77 4.960e-10 
78-97 PR00837B 11.64 1.310e-09 
133-147 


1155 


BL01009 


Extracellular proteins SCP/Tpx-l/Ag5/PR- 
1/Sc7 proteins. 


BL01009D 14.19 8.759e-12 156- 
177 BL01009C 10.54 1.730e-09 
133-147 


1155 


PR00199 


ANNEXIN TYPE III SIGNATURE 


PR00199F 16.19 9.483e-09 113- 
140 


1156 


BL01115 


GTP-binding nuclear protein ran proteins. 


BL01 1 15A 10.22 8.909e-13 4-48 


1156 


PR00449 


TRANSFORMING PROTEIN PZ1 KAb 
SIGNATURE 


PR0044QA 13 20 2 059e-19 4-26 
PR00449C 17.27 1.000e-18 44-67 
PR00449B 14.34 6.727e-ll 27-44 


1159 


BL00175 


Phosphoglycerate mutase family 
phosphohistidine proteins. 


BL001 75D 27.67 4.000e-40 367- 
419 BL00175C 23.75 6.870e-28 
316-348 BL00175A 15.42 
8.200e-l 9 252-272 BL00175B 
12.60 8.714e-17 299-312 


1159 


BL00300 


SRP54~type proteins GTP-binding domain 
proteins. 


BL00300B 20.56 7.554e-ll 38-84 


1160 


FUUUUoo 


PROTFTN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 1.000e-14 547- 
560 PD00066 13.92 2.200e- 14 
353-366 PD00066 13.92 3.400e- 
14 241-254 PD00066 13.92 
6.400e-14 325-338 PD00066 
13.92 1.500e-13 297-310 
PD00066 13.92 6.500e-13 465- 
478 PD00066 13.92 7.500e-13 
437-450 PD00066 13.92 8.500e- 
13 409-422 PD00066 13.92 
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2.714e-12 269-282 PD00066 
13.92 3.571e-12 381-394 
PD00066 13.92 7.577e-10 519- 
532 


1160 


BL00O28 


Zinc finger, C2H2 type, domain proteins. 


BL00028 36.07 3.647e-14 531- 
548 BL00028 16.07 9.471e-14 
309-326 BL00028 16.07 L90Qe- 
13 559-576 BL00028 16.07 
7.750e-13 477494 BL00028 
16.07 2.174e-12 337-354 
BL00028 16.07 6.478e-12 225- 
242 BL00028 16.07 8.043e-12 
421-438 BL00028 16.07 9.217e- 
12 365-382 BL00028 16.07 
2.038e-l 1 253-270 BL00028 
16.07 7.23 le-1 1281-298 
BL00028 16.07 6.100e-10449- 
466 BL00028 16.07 8.800e- 10 
503-520 BL00028 16.07 5.371e- 
09393-410 


1160 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 7.429e-13 418- 
432 PR00048A 10.52 2.588e-12 
362-376 PR00048A 10.52 3.647e- 
12 556-570 PR00048A 10.52 
8.412e-12 474-488 PR00048A ' 
10.52 9.471e-12 222-236 
PR00048A 10.52 2.421e-l 1 306- 
320 PR00048B 6.02 4.462e-l 1 
350-360 PR00048A 10.52 5.737e- 
11 250-264 PRO0O48A 10.52 
7.632e-I 1 528-542 PR00048A 
10.52 8.579e-l 1278-292 
PR00048A 10.52 9.053e-ll 446- 
460 PR00048B 6.02 9.308e-l 1 
544-554 PR00048B 6.02 9.308e- 
11 572-582 PR00048B6.02 
1.000e-10 406-416 PR00048A 
10.52 2.565e-10 334-348 
PR00048B 6.02 4.938e-10 378- 
388 PR00048A 10.52 5.304e-10 
390-404 PR00048B 6.02 8.313e- 
10238-248 PR00048B6.02 
1.947e-09 434-444 PR00048B 
6.02 2.421e-09 462-472 
PR00048B 6.02 3.368e-09 322- 
332 PR00048B 6.02 3.842e-09 
266-276 PR00048B 6.02 6.2Ile- 
09294-304 


1161 


PD02331 


CYCLIN CELL CYCLE DIVISION PROTE. 


PD02331C 13.84 1 913e-ll Q-^fi 


1161 


BL00048 


Protamine PI proteins. 


BL00048 6.39 3.700e-09 165-192 
BL00048 6.39 4.938e-09 281-308 


1161 


DM01206 


CORONAVIRUS NUCLEOCAPSID 
PROTEIN. 


DM01206B 10.69 9.328e-ll 169- 
189 DM01206B 10.69 1.247e-10 
248-268 DM01206B 10.69 
7.781e-10 200-220 DM01206B 
10.69 6.582e-09 246-266 


1163 


BL00239 


Receptor tyrosine kinase class II proteins. 


BL00239B 25.15 3.915e-15 100= 
148 
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1163 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12.27 6.362e-13 165- 
184 i 


1163 


tjt rtrtATO 

jxLUU4 Jy 


domain proteins. 


BL00479C 12.01 1.000e-10 208- 
221 


I 1 CI 

II 03 


t>t nni/Y7 


T>fntA?n Wti*aof»c A 1^*^11111(^^0 fpcrlnn Droteirr? 
rTOlcin KMoSesi /\iir -iJiiiuiiig jcgjuit piui'-ma. 


BL00107A 18.39 3.250e-26 165- 
196 BL00107B 13.31 3.647e-09 
230-246 


llo4 


DJLUIUI J 


nvvcfomLKInHina nrntpin fhmilv nroteins 


BL01013D 26.81 9.135e-22 501- 
545 BL01013A 25.14 4.600e-14 
220-256 BL01013C9.974.906e- 
12 330-340 BL01013B 11.33 
3.01 7e-l 1287-298 


1167 


BL00289 


Pentaxin family proteins. 


BL0O289A 30.36 6.850e-26 25-56 
BL00289E 18.00 6.684e-14 78-93 


1167 


PR00895 


PENTAXIN SIGNATURE 


PR00895A 14.53 1.563e-15 48-63 
PR00895G 14.55 5.846e-12 75-87 


1167 


PR00468 


PLANT LIPOXYGENASE SIGNATURE 


PR00468I 13.42 9.870e-09 59-74 


1168 


t>t> rkrto i n 
FKUUZl / 


SIGNATURE 


PR00217C 10 91 7.527e-09 547- 

563 i 


1169 


PR00756 


XyfCA/TDD AXTT7 AT AXIVT nTPRPTTPi A <2F 

MbMr>KAfMJ& ALAN i L. Ulr J&r 1 IL/Ao c 

(Ml) FAMILY SIGNATURE 


PR00756D 10 58 1 529e-21 367- 
383 PR00756B 14.06 5.737e- 16 
253-269 PR00756A 12.90 1.237e- 
13 205-221 PR00756E 11.91 
4 094e-13 386-399 PR00756C 
11.60 6. 108e-1 1331-342 


1169 


B LOO 142 


Neutral zinc meuHiopepiiaases, zinc-Dinuing 
region proteins. 


BL00142 8 38 5 500e-10 367-378 


1170 


JDMU lotto 


0 PHT V.TO RFPFPTOR 


DM016881 14.97 6.279e-09 75- 
123 


1172 


PR00308 


TYPE I ANTIFREEZE PROTEIN 
SIGNATURE 


PR00308C 3.83 2.523e-10 40-50 
PR00308C 3.83 8.892e-10 41-51 
PR00308C 3.83 8.892e-10 42-52 
PR00308B 4.28 6.671e-09 40-52 


1172 


PR0O833 


POLLEN ALLERGEN POA PI 

QTfTMATf TOP 


PR00833H 2.30 7.1 15e-09 30-45 


1179 


BL00300 


SRP54-type proteins GTP-binding domain 
proicicb. 


BL00300C 25.57 6.000e-09 215- 
269 


1180 


BL00514 


Fibrinogen beta and gamma chains C- 
terminal domain proteins. 


BL00514C 17.41 9.463e-19 233- 
270 BL00514E 14.28 7.750e-12 
293-310 BL00514D 15.35 
9.824e-l 1 274-287 BL00514G 
15.98 4.273e-l 0356-3 86 
BL00514H 14.95 6.217e-09 391- 
416 


1181 


BL01158 


Macrophage migration inhibitory factor 
family proteins. 


BL01158A 21.81 4.130e-30 2-47 
BL01158B 17.07 4.316e-29 47-74 


1 1 00 

1 loz 




CfNHIiim'Qoliitp cvtnnnrf<*r fnmilv nmtein? 

OUUiUlIl.tJxJlUlG SjrlllJJUl Ivl lallUlj plUlWiid. 


BL0O456A 22.59 6.250e-40 46- 
101 BL00456C 24.55 6.586e-40 
184-239 BL00456B 18.94 8.125e- 
25 122-152 BL00456D6.92 
5.500e-l 0 476-486 


1185 


PR00830 


ENDOPEPTIDASE LA (LON) SERINE 
PROTEASE (S16) SIGNATURE 


PR00830A 8.41 4.780e-14 241- 
261 


1185 


PR00918 


CALICIVIRUS NON-STRUCTURAL 
POLYPROTEIN FAMILY SIGNATURE 


PR0O918A 13.76 1.976e- 10 231- 
252 PR00918A 13.76 2.084e- 10 
495-516 


1185 


PR00300 


ATP-DEPENDENT CLP PROTEASE ATP- 


PR00300A 9.56 5.857e-12 237- 
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BINDING SUBUNIT SIGNATURE 


256 PR00300A9.56 5.909e^09 
501-520 


1185 


BL00370 


PEP-utilizing enzymes phosphorylation site 
proteins proteins. 


BL00370A 5.71 6.294e-09 79-87 


1 10.? 


dLV\)\ I j 


Adenylate kinase proteins. 


BL00113A 12.74 7.231e-09 502- 
519 


HQS 


DT (\(\CH A 
wwwuv / "T 


AAA-proieiri family pioielns. 


BL00674D23.4i 2.286e-30 324- 
371 BL00674B4.461.205e-17 
234-256 BL00674C 22.60 2.059e- 
17 531-574 BL00674B4.46 
4.886e-l 7 498-520 BL00674E 
15.24 2.800e-15 402-422 
BL00674C 22.60 7.600e-09 270- 
313 


1185 


BL00870 


Chaperonins clpA/B proteins. 


BL00870A 11.78 9,534e-09210- 
257 


1185 


BL00675 


Sigma-54 interaction domain proteins ATP- 
binding region A proteins. 


BL00675A 24.86 9.775e~09 237- 
281 


1186 


PR00165 


ANION EXCHANGER SIGNATURE 


PR001651 10.02 8.412e-14 829- 
849 PR00165A9.84 6.423e-13 
495-518 PR00165B 15.26 9.090e- 
11 520-541 PR00165F 10.39 
6.663e-l 0639-658 


1186 


BL00291 


Prion protein. 


BL00291A 4,49 9.675e-l0 436- 
471 


1186 


BL00219 


Anion exchangers family proteins. 


BL00219B 14.47 2.707e-24 296- 
340 BL00219C 17.29 5.426e-23 
341-380 BL00219K 12.73 
9.100e-23 831-873 BL00219M 
9.98 9.299e-23 916-962 
BL00219H 10.06 5.705e-21 618- 
666 BL00219I6.164.968e-20 
741-795 BL00219A 17.13 
7.833e-19 122-154 BL00219E 
11.63 2.988e-l 6 485-525 
BL00219F 10.52 8.953e-14 525- 
549 BL002J9G 12.86 8.163e-13 
578-617 BL00219L 18.71 8.423e- 
13 873-912 RT 00910M 1 ft fA 

6.942e-12 978-1022 BL00219D 
15.15 5.286e-l 1 380-416 
BL00219O 14.02 3.377e-09 1023- 
1063 


1186 


PD01168 


SYNTHETASE LIGASE PROTEIN 
ALANYL. 


PD01168L 9.47 7.833e-09 452- 
467 


1186 
1187 


DM01724 
BL00027 


K.W ni^civuJirN rUJuLbJN LlMi HOL-LI. 


DM01724 8.14 4.296e-10 446-466 
DM01724 8.14 6.447e-09 442-462 
DM01724 8.14 7.987e-09 438-458 


1187 


PR00025 


Homeobox* domain proteins. 
HOMEOTIC ANTPNMAP^nTA PunTCTM 
SIGNATURE 


BL00027 26.43 7.943e-14 65-108 
PR00025B 1 1.94 4.000e-ll 51-67 


1187 
1187 


BL00032 
PR00028 


'Homeobox' antennapedia-type protein. 
POU DOMAIN SIGNATURE 


BL00032B 10.83 7.660e-10 54-93~ 


1189 


BL00115 


Eukaryotic RNA polymerase II heptapeptide 
repeat proteins. 


PR00028D 17.92 2.286e-09 49-70 

BL001 15Z 3.12 2.047e-10 469- 
518 


1192 
1192 


BL01215 
BL00567 


Mrp family proteins. 


BL01215A9.75 2.436e-09 466- 
493 






Phosphoribulokmase proteins. 

268 
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1192 


PR00364 


DISEASE RESISTANCE PROTEIN 

C"T/"*"K.T A T'T 1T> T? 

S1GNA 1 UKJb. 


PR00364A 8.19 7.341 e-09 470- 
486 


1192 


BL00113 


Adenylate kinase proteins. 


BL00113A 12.74 8.062e-09 472- 
489 


1193 


PR00303 


PREPROTEIN TRANSLOCASE SECY 
SUBUNIT SIGNATURE 


PR00303G 10.45 8.759e-09 88- 
111 


1197 


PF00429 


ENV polyprotein (coat polyprotein). 




1198 


BL00415 


Synapsins proteins. 


BL00415N 4.29 7.115e-10 224- 
268 


1198 


PR00239 


MOLLUSCAN RHODOPSIN C- 
TERMINAL TAIL SIGNATURE 


PR00239E 1.58 1.307e-09 253- 
265 


1198 


PR00211 


GLUTELIN SIGNATURE 


PR0021 IB 0.86 7.537e-12 245- 
266 PR0021 IB 0.86 2.644e-10 
251-272 PR0021 IB 0.86 4.083e- 
09 233-254 PR00211B0.86 
7.583e-09 239-260 


1198 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 5.688e-12 227- 
260 DM00215 19.43 6.250e- 12 
225-258 DM00215 19.43 5.235e- 
11 232-265 DM00215 19.43 
5.94 le-11 242-275 DM00215 
19.43 4.375e-10 236-269 
DM00215 19.43 4.857e-10 222- 
255 DM00215 19.43 5.179e-10 

230- 263 DM002 1 5 1 9,43 «.554e- 
10 237-270 DM00215 19.43 
2.068e-09 215-248 DM00215 
19.43 3.898e-09 235-268 

/ r\rvo t e its Al A t?(\Qt* f\CS OAA 
DM0U215 IV. 4 5 4.jU<>e-Uy Z4U- 

273 DM00215 19.43 5.576e-09 

231- 264 DM00215 19.43 6.339e- 
09 220-253 DM0021519.43 
9.847e-09 218-251 


1200 


BL00183 


Ubiquitin-conjugating enzymes proteins. 


BL00183 28.97 4.326e-22 81-129 


1202 


BL00237 


G-protein coupled receptors proteins. 


tit nnon A 1*7 A.Q 4 Ol At> 1 A 1 Aft 
JtJLOvZj/A Z/.OO 4.Z14C-10 IVo- 

148 BL00237C 13.19 3.323e-ll 
245-272 BL00237B 5.28 2.227e- 
09 182-194 


1202 


PR00237 


RHODOP SIN-LIKE GPCR SUPERFAMILY 
SIGNATURE 


rKUU/o/r ij.j/ *+.OUue-i.> zju- 
275 PR00237E 13.03 1.000e-12 
174-198 PR00237G 19.63 7.469e- 
12 288-315 PR00237B 13.50 
7.207e-l 1 75-97 PR00237C 15.69 
7.750e-09 122-145 


1203 


BL00237 


G-protein coupled receptors proteins. 


oJLUUzJ /A Z/.05 *4-.Zl**e-10 Ivo- 

148 BL00237C 13.19 3.323e-ll 
280-307 BL00237B 5.28 2.227e- 
09 217-229 


1203 


PR00237 


RHODOPSIN-LIKE GPCR SUPERFAMILY 
SIGNATURE 


PR00237F 13.57 4.600e-13 285- 
310 PR00237E 13.03 1.000e-12 
209-233 PR00237G 19.63 7.469e- 
12323-350 PR00237B 13.50 
7.207e-ll 75-97 PR00237C 15.69 
7.750e-09 122-145 


1207 


PR00259 


TRANSMEMBRANE FOUR FAMILY 
SIGNATURE 


PR00259B 14.81 3.769e-21 50-77 
PR00259C 16.40 4.000e-20 77- 
106 PR00259A 9.27 3.600e-16 
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en cry jlu 


Description 


Results* 








12-36 






Transmembrane 4 family proteins. 


BL00421B 17.62 7-261e-36 56-95 . 
BL00421A 11.79 8.313e-16 8-27 


1207 


PR00164 


ABC-2 TYPE TRANSPORT SYSTEM 

\A Cfjf D"D A XrC DD /~YTT7T\T OT/^XT A Tl ff>r< 

MbMoKANh rKUlbIN SIGNATURE 


PR00164D 13.90 1.486e-09 9-34 


190R 




Kazal serine protease inhibitors family 

proteins* 


BL00282 16.88 7.207e-14 562- 

ette 
JOJ 


19flfl 




Sugar transport proteins, 


BL002I6B 27.64 3,250e-10 267- 
317 




P17HAOOO ~ 


Vesiculovirus phosphoprotein. 


PF00922A 19.17 7.724e-09 88- 
122 


1214 


PR00049 


WELM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 7.143e-10 17-32 


1214 


PD01351 


PROTEIN REPEAT NEUROFILAMENT 
TRIPL. 


PD01351B 13.72 9.518e-10 18-44 
PD01351B 13.72 3.758e-09 24-50 


1214 


DM002 15 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 5.576e-09 5-38 


1214 


PR00910 


LUTEOVIRUS ORF6 PROTEIN 
SIGNATURE 


PR00910A2.51 7.857e-09 17-30 


1215 


BL00612 


Osteonectin domain proteins. 


BL00612E 13.12 3.947e-l 1 379- 
424 


1215 


BL00484 


Thyroglobulin type-1 repeat proteins proteins. 


BL00484C 17.01 3.854e-ll 131- 
146 BL00484B 9.04 1.49le-10 
238-252 BL00484C 17.01 8.560e- 
10258-273 BL00484B9.04 
3.850e-09 111-125 


1216 


BL00223 


Annexins repeat proteins domain proteins. 


BL00223A 15.59 1.000e-33 147- 
181 BL00223A 15.59 1.435e- 16 
75-109 BL00223C 24.79 3.928e- 
15134-189 


1216 


PR00196 


ANNEXIN FAMILY SIGNATURE 


PR00196C 10.36 3.571e-25 151- 
173 PR00196A 11.16 7.300e-24 
84-107 PR00196B 10.68 4.808e- 
16124-141 PR00196A 11.16 
6.236e-14 156-179 PR00196E 
9.19 1.000e-12 155-176 
PR00196GlI.725.829e-ll 199- 
213 PR00196C 10.36 7.913e-ll 
79-101 


1216 


PR00201 


ANNEXIN TYPE V SIGNATURE 


PR00201D 10.49 8.729e-14 151- 
373 PR00201A6.05 1.976e-13 
84-107 PR00201Gll.02 3.847e- 
12 155-182 PR00201A6.05 
8.241e-12 156-179 PR00201H 
12.04 4.889M0 199-213 


1216 


PR00198 


ANNEXIN TYPE II SIGNATURE 


PR00198D7.65 7.787e-21 151- 
173 PR00198B 8.71 3.880e-17 
84-107 PR00198C 14.32 2.688e- 
11 124-141 PR00198G8.09 
7.033e-10 155-176 


1216 


PR00200 


ANNFXTN TYPF TV ^TOMATT IRP 


JiTi AAOAAC 1 A A A C AO A 1 ft 1 p + 

rKUUiOOb 10.00 5.030e-l9 1 51- 
173 PR00200G 9.43 5.546e-14 
155-182 PR00200B7.39 4.653e- 
11 156-179 PR00200B7.39 
4,857e-10 84-107 PR00200H 
13.68 9.663e-10 199-213 


1216 


PR00202 


ANNEXIN TYPE VI SIGNATURE 


PR00202D 5,58 6.793e-14 151- 
173 PR00202G 8.01 5.545e-13 
155-182 PR00202B 11.44 2.782e- 
10155-179 PR00202B 11.44 
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5.206e-09 83-107 


1216 


PR00197 


ANNEXIN TYPE I SIGNATURE 


PR00197B 7.56 4.960e-29 84-107 
PR00197D 7.50 3.000e-26 151- 
173 PR00197A 8.68 7.577e-20 
35-51 PRO0197C7.5OL0O0e-19 
1 94-1 A1 PR00197F 9.03 7. 128e- 
10 155-176 PR00197D7.50 
6.250e-09 79-101 


1216 


PR00199 


ANNEXIN TYPE III SIGNATURE 


PR00199D 5.65 9.297e-17 151- 
173 PR00199B6.86 2.915e-13 
R4-107 PR00199B 6.86 1.265e-ll 
156-179 PR00199G9.094.351e- 
11 156-182 PR00199D5.65 
3.641e-09 79-101 PR00199C 
13.84 9.571e-09 124-141 


1217 


BL00223 


Annexins repeat proteins domain proteins. 


BL00223B 28.47 1.000e-40 188- 
238 BL00223A 15.59 1.000e-33 

1.435e-1647-81 BL00223C 24.79 
3.928e-15 106-161 


1217 


PR00201 


ANNEXIN TYPE V SIGNATURE 


PR00201D 10.49 8.729e-14 123- 
145 PR00201A6.05 1.976e-13 
56-79 PR00201G 11.02 3.847e-12 
io7 t<4 Pi?nft?nt A 6 05 R 241 e- 
12 128-151 PR00201E 12.37 
3.3 17e-ll 206-233 PR00201H 
12.04 4.889e-l0 171-185 


1217 


PR00198 


ANNEXIN TYPE II SIGNATURE 


PR00198D7.65 7.787e-21 123- 
145 PR00198B 8.71 3.880e-17 
56-79 PR00198E 14.67 5.286e-15 
206-233 PR00198C 14.32 2.688e- 
11 96-113 PR00I98G 8.09 
7.033e-10 127-148 


1217 


PR00196 


ANNEXIN FAMILY SIGNATURE 


PR00196D 21.86 1.000e-27 206- 
oixi ppnniovir 10 16 3 571e-25 
123-145 PR00196A 11.16 7.300e- 
24 56-79 PR00196B 10.68 
4.808e-16 96-113 PR00196A 
11.16 6.236e-14 128-151 
PR00196E9.19 1.000e-12 127- 
148 PR00196G 11.72 5.829c-ll 
171-185 PR00196C 10.36 7.913e- 
11 51-73 PR00196C 10.36 
8.750e- 10 282-304 


1217 


PR00202 


ANNEXIN TYPE VI SIGNATURE 


PR00202D 5.58 6.793e-14 123- 
145 PR00202G 8.01 5.545e-13 
127-154 PR00202E 13.00 8.740e- 
11 206-233 PR00202B 11.44 
2.782e-l0 127-151 PR00202B 
11.44 5.206e-09 55-79 


1217 


PR00197 


ANNEXIN TYPE I SIGNATURE 


PR00197E 11.89 l.794e-32 206- 
233 PR00197B7.56 4.960e-29 
56-79 PR00197D 7.50 3.000e-26 
123-145 PR00197A 8.68 7.577e- 
20 7-23 PR00197C 7.50 l.OOOe- 
19 96-113 PR00197F9.03 
7.128e-10 127-148 PR00197D 
7.50 6.250e-09 51-73 
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1217 


PR00200 


ANNEXIN TYPE IV SIGNATURE 


PR00200E 10.00 5.030e-19 123- 
145 PR00200G9.43 5.546e-14 
127-154 PR00200B 7.39 4.653e- 
11 128-151 PR00200F 13.72 
7.094e-l 1206-233 PR00200B 
7.39 4.857e-10 56-79 PR00200H 
13.63 9.663e-iG 171-185 
PR00200E 10.00 8.842e-09 282- 
304 


1217 


" PR00199 


ANNEXIN TYPE HI SIGNATURE 


PR00199D 5.65 9.297e-17 123- 
145 PR00199F 16.19 4.391e-15 
206-233 PR00199B 6.86 2.91 5e- 
13 56-79 PR00199B6.86 1.265e- 
11 128-151 PR00199G9.09 
4.351e-ll 128-154 PR00199D 
5.65 3.641e-09 51-73 PR00199C 
13.84 9.571e-09 96-113 


1218 


BL00223 


Annexins repeat proteins domain proteins. 


BL00223A 15.59 1.000e-33 119- 
153 BL00223A 15.59 1.435e-16 
47-81 BL00223C 24.79 3.928e-15 
106-161 


1218 


PR00196 


ANNEXIN FAMILY SIGNATURE 


PR00196C 10.36 3.571e-25 123- 
145 PR00196All.l67.300e-24 
56-79 PR00196B 10.68 4.808e-16 
96-113 PR00196All.166.236e- 
14 128-151 PR00196E9.19 

I. 000e-12 127-148 PR00196G 

II. 72 5.829e-ll 171-185 
PR0Q196C 10.36 7.913e-l 1 51-73 


1218 


PR00201 


ANNEXIN TYPE V SIGNATURE 


PR00201D 10.49 8.729e-14 123- 
145 PR00201A6.05 1.976e-13 
56-79 PR00201G 11.02 3.847e-12 
127-154 PR00201A6.05 8.241e- 
12128-151 PR00201H 12.04 
4.889e-10 171-185 


1218 


PR00198 


ANNEXIN TYPE II SIGNATURE 


PR00198D 7.65 7.787e-21 123- 
145 PR00198B8.71 3.880e-17 | 
56-79 PR00198C 14.32 2.68 8e-l 1 
96-113 PR00198G8.09 7.033e-10 
127-148 


1218 


PR00200 


ANNEXIN TYPE IV SIGNATURE 


PR00200E 10.00 5.030e-19 123- 
145 PR00200G9.43 5.546e-14 
127-154 PR00200B 7394.653e- 
11 128-151 PR00200B7.39 
4.857e-10 56-79 PR00200H 13.68 
9.663e-10 171-185 


1218 


PR00202 


ANNEXIN TYPE VI SIGNATURE 


PR00202D 5.58 6.793e-14 123- 
145 PR00202G8.01 5.545e-13 
127-1 54. PRflnonon 1 1 d A o too« 

10127-151 PR00202B 11.44 
5.206e-09 55-79 


1218 


PR00197 


ANNEXIN TYPE I SIGNATURE 


PR00197B 7.56 4.960e-29 56-79 
PR00197D 7.50 3.000e-26 123- 
145 PR00197A 8.68 7.577e-20 7- 
23 PR00197C 7.50 1.000e-19 96- 
113 PR00197F9.03 7.128e-10 
127-148 PR00197D7.506.250e- 
09 51-73 
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1218 


PR00199 


ANNEXIN TYPE III SIGNATURE 


PR00199D 5.65 9.297e-17 123- 

I A £ T1T» AA i nnn C OC T fll C a 1 1 

56-79 PR00199B6.861.265e-ll 

too ici t»t> i oo/i o no h i?io 

1 2o- 1 5 1 fkuu i yyu y.uy 4. j j j e- 

II 128-154 PR00199D 5.65 
3.641e-09 51-73 PR00199C 13.84 
9.571e-09 96-113 


1221 


BLO0086 


Cytochrome P450 cysteine heme-iron ligand 
proteins. 


BL00086 20.87 8.615e-27 423- 
455 


1221 


PR00465 


E-CLASS P450 GROUP IV SIGNATURE 


PR00465F 13.37 8.468e-12 393- 
412 


1221 


PR00359 


B-CLASS P450 SIGNATURE 


PR003591 11.13 7.261e-ll 433- 
445 


1221 


PR00463 


E-CLASS P450 GROUP I SIGNATURE 


PR0O4631 15.02 9.571e-21 433- 
457 PR00463G 18.24 6.760e-19 
388-413 PR00463E17.37 6.595e- 
17 304-331 PR00463F 17.63 
7.568e-l 2 347-366 PR00463B 
17.50 7.692e-12 79-101 
PR00463D 14.02 8.875e-ll 284- 
302 PR00463C 12.85 6.932e-10 
171-190 


1221 


PR00464 


E-CLASS P450 GROUP II SIGNATURE 


PR00464G 12.41 2.588e-12 398- 
414 PR00464E 18.28 3.077e-10 
342-363 PR004641 1 4.64 3. 1 06e- 
10 433-457 PR00464H 13.32 
4.635e-09 420-434 PR00464D 
17.40 5.787e-09 313-331 

T\Tk r\t\ A f A A^ to O/t tT OAO. t\f\ 1 O A 

PR00464C 1 8.84 5.808e-09 284- 
313 


1221 


PR00385 


P450 SUPERFAMILY SIGNATURE 


PR00385E 12.66 9.100e-14 433- 
445 PR00385A 14.97 5.696e- 13 
295-313 JrKUUjojo 1U.Z2 o.4UUe- 
09 313-327 


1221 


PR00408 


MITOCHONDRIAL P450 SIGNATURE 


FKUU4UoL> 1 j.44 o.ojie-U;/ Zyj- 
313 


1222 


PR00385 


Tk A er A fiT ttm*»t* "VAX ATX \T oiO\7 A TT TT> 

P450 SUPERFAMILY SIGNATURE 


rKUUJoDA 14.5// j.O^Oc-Jj £.yj- 

313 


1222 


PR00463 


E-CLASS P450 GROUP I SIGNATURb 


PPOOzl/STR 1 7 50 7 rtQ9p-1? 70- 
101 PR004/V*n 14 07 X R75p-1 1 

284-302 PR00463C 12.85 6.932e- 
10 171-190 


1222 


PR00464 


E-CLASS P450 GROUP II SIGNATURE 


PR00464C 18.84 5.808e-09 284- 
313 


1222 


PR00408 


MITOCHONDRIAL P450 SIGNATURE 


PR00408D 15.44 6.831e-09 295- 
313 


1223 


BL00477 


Alpha-2-macroglobulin family thiolester 
region proteins. 


BL00477A 13.50 9.1 82e-19 70-99 


1225 


t>T AACAA 

BLOU5UO 


lnyrjiosin Deia-f iamuy proieino. 


BL00500 9 77 2 565e-28 2-42 


1227 


PR00320 


G-PROTEIN BETA WD-40 REPEAT 
SIGNATURE 


PR00320A 16.74 4.971e-10 231- 
246 PR00320C 13.01 8.200e-t0 
231-246 PR00320B 12.19 9.486e- 
10 231-246 PR00320B 12.19 
3.475C-09 188-203 PR00320B 
12.19 4.600e-09 315-330 
PR00320C 13.01 4.900e-09 315- 
330 


1227 


PR00319 


BETA G-PROTEIN (TRANSDUCIN) 


PR00319B 11.47 9.143e-09 315- 
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SIGNATURE 


330 


1227 


BL00678 


Trp-Asp (WD) repeat proteins proteins. 


BL00678 9.67 1.600e-10 233-244 
BL00678 9.67 1.000e-08 317-328 


1236 


PF00580 


UvrD/REP helicase. 


PF00580D 13.15 8.920e-13 670- 
684 PF00580E 13.89 2.800e-ll 
867-886 PF00580F8.62 9.438e- 
10 913-926 J 


1237 


PR00019 


LEUCINE-RICH REPEAT SIGNATURE 


PR000 1 9B 11.36 1.000e-09 64-78 
PR00019A 11.19 8.000e-09 90- 
104 


1238 


PD00126 


PROTEIN REPEAT DOMAIN TPR 
NUCLEA. 


PD00126A 22.53 5.500e-10229- 
250 


1243 


DM01206 


CORONAVIRUS NUCLEOCAPSID 
PROTEIN. 


DM01206B 10.69 4.759e-09 464- 
484 


1243 


BL00315 


Dehydrins proteins. 


BL00315A9.351.000e-08 389- 
417 


1245 


BL00972 


Ubiquitin carboxyl-terminal hydrolases 
family 2 proteins. 


BL00972A 11.93 8.054e~15 191- 
209 


1246 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 9.500e-12 347- 
363 PF00023A 16.03 8.500e-10 
283-299 PF00023A 16.03 8.875e- 
10 184-200 


1246 


PF00791 


Domain present in ZO-1 and Unc5-like netrin 
receptors. 


PF00791B 28.49 1.989e-13 217- 
272 PF00791B 28.49 6.044e-13 
1 1 7-1 72 PF00791B 28.49 4.31 6e- 
12 184-239 PF00791B 28.49 
9.432e-l 2 250-305 PF00791B 
28.49 6.243e-I0 84-139 
PF00791C 20.98 4.971e-09 98- 
137 


1246 


BL00906 


Uroporphyrinogen decarboxylase proteins. 


BL00906D 24.33 7.750e-09 212- 
256 


1248 


BL00415 


Synapsins proteins. 


BL00415Q 223 8.297e-09 13-49 1 


1250 


BL01113 


CI q domain proteins. 


BL01113B 18.26 2.500e-13 841- 
877 


1252 


BL01248 


Laminin-type EGF-like (LE) domain proteins. 


BL01248 11.02 7.171e-12 258- 
271 BL01248 11.02 7.943e-12 
325-338 


1252 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


PR0001 ID 14.03 7.000e-17 376- 
395 PR00011A 14.06 L000e-14 
376-395 PR00011B 13.08 5.167e- 
14 376-395 PR00011C 24.25 
8.468e-l 4 395-424 PR00011D 
14.03 9.739e-09 249-268 


1253 


BL00164 


Enolase proteins. 


BL00164A 13.58 2.800e-28 41-64 


1253 


PR00148 


ENOLASE SIGNATURE 


PR00148A10.il 1.783e- 18 44-59 




BL01153 


NOLl/NOP2/sun family proteins. 


BL01153D 19.69 8.322e-14 102- ' 
il6 oLOlo3C I3.676.507e-l0 
51-65 


1256 | 


BL00892 


HIT family proteins. 


BL00892B 16.86 1.000e-20 130- 
154 BL00892A 18.17 6.657e-20 
64-95 


1256 


PR00332 


HISTIDINE TRIAD FAMILY SIGNATURE 


PR00332B 13.62 3.000e-16 76-95 
PR00332C 7.37 4.600e-14 143- 
154 PR00332A 10.15 7.375e-12 
55-72 


1257 


PF00791 1 


Domain vrescnt in ZO-1 and Unc5-like netrin 


PF00791B 28.49 4.146e-10 73- 
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receptors. 


128 1 


1258 


BL00615 


C-type lectin domain proteins. 


BL00615B 12.25 5.200e-12 166- 
180 


1259 


BL00071 


Glyceraldehyde 3-phosphate dehydrogenase 
proteins. 


BL00071B 21.70 1.000e-40 80- 
126 BL00071C 11.81 1.000e-40 
146-181 BL00071D 19.39 
3.118e-25 184-239 BL00071E 
1 1.48 4.600e-24 308-329 
BL00071 A 5.81 2.607e-14 5-17 


1259 


PR00078 


GLYCERALDEHYDE-3-PHOSPHATE 
DEHYDROGENASE SIGNATURE 


PR00078B 7.45 3.250e-24 146- 
1 65 PR00078D 1 1 .49 2.800e-21 
231-249 PR00078E 10.50 Alle- 
le 271-287 PR00078A 10.38 
1.000e-15 111-125 PR00078C 
15.99 6.21 le-11 173-190 


1262 


PR00926 


MITOCHONDRIAL CARRIER PROTEIN 
SIGNATURE 


PR00926F 17.75 2.688e-10 15-38 
PR00926D 10.53 6.625e- 10 21-40 


1262 


PR00927 


ADENINE NUCLEOTIDE 
TRANSLOCATOR I SIGNATURE 


PR00927E 14.93 6.143e-10 44-66 
PR00927B 14.66 9.870e-10 265- 
287 PR00927B 14.66 5.685e-09 
46-68 


1262 


BL00215 


Mitochondrial energy transfer proteins. 


BL00215A 15.82 6.250e-17 13-38 
BL00215A 15.82 1.600e-15 230- 
255 BL00215A 15.82 5.974e-13 
108-133 BL00215B 10.44 7.600e- 
09 275-288 


1263 


PR00654 


ANGIOTENSINOGEN SIGNATURE 


PR00654A 15.64 1.540e-26 23-44 
PR00654D 10.48 3.538e-26 153- 
175 PR00654F 15.16 8.071e-26 
255-275 PR00654E 9.81 2.241e- 
25 194-215 PR00654C9.50 
5.500e-21 115-135 


1263 


BL00284 


Serpins proteins. 


BL00284C 28.56 9.514e-21 254- 
296 BL00284E 19.15 9.710e-16 
439-464 BL00284A 15.64 
8.147e-14 1 i3-i37 BLUUZS4JJ 
16.34 1.837e-12 361-388 

n T nATO/ID 1 1 AA *7 CAAa 1 1 TOO 

250 


1264 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 8.071e-17 34-57 


1265 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


tit r\ rtonri a OA On H /CAAa 1 C. 1A 

BLOOzyOA TXt.oy 7.oUUe-lo 34-D/ 


1266 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


t»T AAlAn A OA OA O OAAn 1 £L 1 1 Ci4 

BL00290A 20.89 2.oOUe-lo 31-54 


1267 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.t>y y.4uUe-lo 34- o/ 


1268 


BL00290 


Immunoglobulins and major 
nisiocompaiiDUuy complex prtueinb. 


BL00290B 13.17 4.00Ue-2l 2oZ- 
300 BL00290A 20.89 4.600e- 16 
34-57 BL00290A 20.89 2.42 le- 
13 225-248 


1269 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 4.600e-16 34-57 


1271 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 8.071e-17 34-57 


1272 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 4.600e-16 34-57 


1273 


BL00290 


Immunoglobulins and major 


BL00290A 20.89 4.600e-16 34-57 
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histocompatibility complex proteins. 




1274 


PF00651 


BTB (also known as BR-C/Ttk) domain 
proteins. 


PF00651 15.00 2.500e-14 46-59 


1274 


DM00099 


4 kw A55R REDUCTASE TERMINAL 
DIHYDROPTERIDINE. 


DM00099B 14.73 6.063e-09 300- 
310 


1274 


PR00501 


KELCH REPEAT SIGNATURE 


PR00501A8.25 7.750e-l 1 472- 
486 PR00501A 8.25 7.955e-09 
328-342 


1281 


BL00972 


Ubiquitin carboxyl-terminal hydrolases 
family 2 proteins. 


BL00972A 11.93 3.9l9e-15 101- 
119 BL00972B9.45 7.577e-10 
180-190 


1285 


PF00789 


Domain present in ubiquitin-regulatory 
proteins. 


PF00789B 19.70 5.941e-09 213- 
234 


1286 


PF00789 


Domain present in ubiquitin-regulatory 
proteins. 


PF00789B 19.70 5.941e-09 259- 
280 


1287 


PR00625 


DNAJ PROTEIN FAMILY SIGNATURE 


PR00625A 12.84 3.000e-19 19-39 
PR00625B 13.48 2.756e-17 47-68 


1287 


BL00636 


Nt-dnaJ domain proteins. 


BL00636A 8.07 7.600e-19 23-40 
BL00636B 15.11 6.870e-15 47-68 


1288 


BL00518 


Zinc finger, C3HC4 type (RING finger), 
proteins. 


BL00518 12.23 5.7 14e- 10 24-33 


1289 


PR00500 


POLYCYSTIC KIDNEY DISEASE 
PROTEIN SIGNATURE 


PR005001 9.22 1.107e-31 2810- 
2833 PR00500G3.68 1.087e-30 
2525-2548 PR00500H 17.80 
1.107e-29 2662-2684 PR00500E 
6.99 1.1 06e-27 2350-2370 
PR00500F 9.44 1.108e-26 2483- 
2503 


1289 


PF00801 


PKD domain proteins. 


PF00801B 23.63 9.217e-26 1055- 
1083 PF0080!A 13.49 6.276e-ll 
222-235 PF00801B 23.63 3.087e- 
10 719-747 PF00801B 23.63 
6.609e-10 1652-1680 


1291 


BL00415 


Synapsins proteins. 


BL00415N4.29 5.401e-09 136- 
180 


1292 


PD00930 


PROTEIN GTPASE DOMAIN 
ACTIVATION. 


PD00930B 33.72 2.800e-23 229- 
270 PDOO930A 25.62 5.021e-12 
125-151 


1292 


PF00620 


GTPase-activator protein for Rho-like ! 
GTPases. 


PF00620B 14.20 7.000e-12 178- 
195 


1293 


BL00023 


Type II fibronectin collagen-binding domain 
proteins. 


BL00023 24.31 8.043e-34 281- 
318 BL00023 24.31 5.320e-32 
223-260 BL00023 24.31 5.800e- 
29 340-377 


1293 


BL00142 


Neutral zinc metallopeptidases, zinc-binding 
region proteins. 


BL00142 8.38 1.000e-12 398-409 


1293 


PR00138 


MATRIXIN SIGNATURE 


PR00138D 16.56 5.500e-J0 398- 
424 PR00138C 16.41 1.000e-29 
164-193 PR00138B 15.82 5.875e- 
16 141-157 PR00138A 15.14 
1.000e-15 94-108 PR00138E6.01 
8.472e-l 1431-445 


1293 


BL00024 


Hemopexin domain proteins. 


BL00024C 22.98 I.000e-40 163- 
212 BL00024D 17.28 4.31 6e-36 
392-424 BL00024B 21.53 7.545e- 
33 115-149 BL00024F 11.30 
2.895e-l 8 486-507 BL00024A 
11.49 3.667e-12 94-105 
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BLO0024G 13.31 4.857e-12 525- 
538 BL00024E7.58 2.263e-10 
431-445 


1293 


PR00480 


ASTACIN FAMILY SIGNATURE 


PR00480B 15.41 1.771e-09 393- 
412 


1293 


PR00013 


FIBRONECTIN TYPE II REPEAT ! 
SIGNATURE 


PR00013C 12.29 7.136e-19 372- 
388 PRO0O13C 12.29 6.464e-18 
313-329 PR00013C 12.29 5.500e- 
16 255-271 PR00013B 14.75 
4.375e-12 355-368 PR00013A 
12.26 1.973e-l 1 344-354 
PR00013A 12.26 6.595e-l 1 227- 
237 PRO0O13B 14.75 8.568e-10 
296-309 PR00013A 12.26 2.167e- 
09 285-295 PR00013B 14.75 
2.588e-09 238-251 


1293 


BL00546 


Matrixins cysteine switch. 


BL00546B 20.11 3.368e-40 164- 
208 BL00546C 16.41 6.400e-31 
392-424 BL00546A 19.62 
6.523e-l 9 74-104 BL00546E 
10.23 7.947e-13 486-507 
BL00546F 12.40 5.339e-09 525- 
538 


1294 


BL00023 


Type II fibronectin collagen-binding domain 
proteins. 


BL00023 24.31 8.043e-34 281- 
318 BL00023 24.31 5.320e-32 
223-260 BL00023 24.31 5.8O0e~ 
29 340-377 


1294 


BL00546 


Matrixins cysteine switch. 


BL00546B 20.1 1 3.368e-40 164- 
208 BL00546C 16.41 6.400e-31 
392-424 BL00546A 19.62 
6.523e-l 9 74-104 


1294 


PR00138 


MATRIXIN SIGNATURE 


PR00138D 16.56 5.500e-3O 398- 
424 PR00138C 16.41 1.000e-29 
164-193 PR00138B 15.82 5.875e- 
16 141-157 PR00138A 15.14 
j.000e-15 94-IUo 


1294 


BL00142 


Neutral zinc metallopeptidases, zinc-binding 
region proteins. 


BL00142 8.38 1.000e-l 2 398-409 


1294 


BL00024 


Hemopexin domain proteins. 


BL00024C 22.98 1.000e-40 163- 
212 BL00024D 17.28 4.316e-36 
392-424 BL00024B 21.53 7.545e- 
33 115-149 BL00024A 11.49 
3.667e-12 94-105 


1294 


PR00480 


ASTACIN FAMILY SIGNATURE 


PR00480B 15.41 1.771e-09 393- 
412 


1294 


PR00013 


FIBRONECTIN TYPE II REPEAT 
SIGNATURE 


PR00013C 12.29 7.136e-19 372- 
388 PR00013C 12.29 6.464e-18 
313-329 PR00013C 12.29 5.500e- 

JD wJJ A / 1 I I\\J Uul J1J It. IJ 

4.375e-l 2 355-368 PR00013A 
12.26 1.973e-ll 344-354 
PR00013A 12.26 6.595e-ll 227- 
237 PR00013B 14.75 8.568e-10 
296-309 PR00013A 12.26 2.167e- 
09285-295 PR00013B 14.75 
2.588e-09 238-251 


1298 


DM01354 


kw TRANSCRIPTASE REVERSE II ORF2. 


DM01354R 8.50 2.969e-22 2115- 
2145 DM01354S 11.61 I.692e-14 
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2145-2166 


1298 


DM01688 


2 POL Y-IG RECEPTOR. 


DM01688D 13.44 8.244e-09 
1714-1737 


1298 


DM00179 


w KINASE ALPHA ADHESION T-CELL. 


DM00179 13.97 5.737e-10 1807- 
1817 DM00179 13.97 7.158e-10 
1077-1087 DM00179 13.97 
9.053e- 10 759-769 DM00179 
13.97 9.053e-10 1328-1338 
DM00179 13.97 4.130e-09 574- 
584 DM00179 13.974.130e-09 
1431-1441 DM00179 13.97 
6.870e-09 1713-1723 DM00179 
13.97 7.652e-09 850-860 
DM00179 13.97 8.435e-09 2089- 
2099 


1298 


BL00240 


Receptor tyrosine kinase class III proteins. 


BL00240B 24.70 8.909e-13 623- 
647 BL00240B 24.70 1.209e-10 
1126-1150 BL00240B 24.70 
4.558e-10 124-148 BL00240B 
24.70 6.442e-l 0529-553 
BL00240B 24.70 4.255e-09 1222- 
1246 BL00240B 24.70 8.468e-09 
995-1019 


1298 


PD02327 


GLYCOPROTEIN ANTIGEN PRECURSOR 
IMMUNOGLO. 


PD02327B 19.84 8.091e-09 1 126- 
1148 PD02327B 19.84 9.3 18e-09 
1222-1244 


1298 


PD02870 


RECEPTOR INTERLEUKIN-1 
PRECURSOR. 


PD02870B 18.83 1.200e-10 1610- 
1643 PD02870B 18.83 7.400e-10 
2081-2114 PD02870B 18.83 
7,800e-l 0 1069-1102 PD02870B 
18.83 5.213e-09 1423-1456 
PD02870B 18.83 6.649e-09 67- 
100 PD02870B 18.83 7.989e-09 
1518-1551 PD02870D 15.74 
8.564e-09 566-601 PD02870B 
18.83 9.521e-09 286-3 19 
PD02870B 18.83 9.904e-09 1258- 
1291 




BL00888 


Cyclic nucleotide-binding domain proteins. 


BL00888B 14.79 4.706e-18 372- 
396 BL00888A 18.03 1.000e-08 
354-371 


1301 


PF00615 


Regulator of G protein signalling domain 
proteins. 


PF0061 5B 16.25 9.625e-16 73-90 
PF00615C 10.06 9.206e-12 150- 
164 


1 1CY) 


BLUQ766 


Tetrahydrofolate 

dehydrogenase/cyclohydrolase proteins. 


BL00766E 13.78 9.625e-39 191- 
228 BL00766C 25.86 4.375e-31 
77-125 BL00766D 17.05 5.966e- 
25 152-182 


1302 


PR00085 


TETRAHYDROFOLATE 

DEHYDROGENASE/CYCLOHYDROLASE 
FAMILY SIGNATURE 


rxwvuojEi ID. /y /.UUUC-20 1 51- 
181 PR00085G 10.74 1.865e-22 
208-227 PR00085C 15.23 6.182e- 
21 47-69 PR00Q85D 15.02 
2.688e-20 92-113 PR00085F9.77 
6.595e-15 191-208 


1303 


BL00180 


Glutamine synthetase proteins. 


BL00180E 17.60 1.000e-40 154- 
206 BL00180D 13.26 2.174e-24 
119-141 BL00180F 10.05 6.21 Ie- 
17 218-231 BL00180G 10.20 
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8.435e-17 307-322 BL00180C 

t r% A A A £f\f\~. 1/11 AO 1 1 O 

12.14 4.600e- 14 102-112 
BL00180B 18.03 4.971e-14 68-87 

1ST AA1 OA A 11 OA < A/£<» 1 A. A< 


1304 


BL00180 


Glutamine synthetase proteins. 


BL00180F 10.05 6.750e-15 49-62 


1306 


BL01131 


Ribosomal RNA adenine dimethylases 
j>roteins. 


BL01 131 A 26.62 1.000e-08 77- 
123 


1308 


DM00191 


w SPAC8A4.04C RESISTANCE 
SPAC8A4.05C DAUNORUBICIK 


DM00191 A 8.16 5.440e-09 36-49 


1309 


DM00191 


w SPAC8A4.04C RESISTANCE 
SPAC8A4.05C DAUNORUBICIK 


DM00191A 8.16 5.440e-09 61-74 


1310 


PF00606 


Herpesviral Glycoprotein B. 


PF006061 20.74 7.894e-09 264- 
316 


1310 


BL01219 


Ammonium transporters proteins. 


BL01219D 11.63 2.957e-10217- 
241 BL01219F 15.24 8.809e-09 
335-360 


1310 


PR00342 


RHESUS BLOOD GROUP PROTEIN 
SIGNATURE 


PR00342G 8.18 1.458e-19 220- 
239 PR00342B 11.09 8.657e-13 
61-79 PR00342D 8.46 2.857e-12 
128-145 PR00342I 4.99 6.01 6e- 
12 285-299 PR00342H7.61 
6.927e-l 1250-273 PR00342C 
10.10 4.770e-10 90-108 
PR00342E 14.49 5.950e-10 151- 
175 PR00342F7.02 1.556e-09 
185-201 PR00342J 8.97 7.940e- 
09 308-327 PR00342L 7.61 
9.600e-09 398-424 


1311 


PR00209 


ALPHA/BETA GLIADIN FAMILY 
SIGNATURE 


PR00209B4.88 9.080e-11 80-99 
PR00209B 4.88 6.967e-10 86-105 


1311 


DM00406 


GLIADIN. 


DM00406 7.73 1 .400e-09 86-99 


1311 


PR00501 


KELCH REPEAT SIGNATURE 


PR00501B 18.88 8.342e-09 440- 
455 


1312 


PR00528 


GLUCOCORTICOID RECEPTOR 
SIGNATURE 


PR00528F 9. 1 3 9.063e-09 31-51 


1313 


PF00622 


Domain in SPla and the RYanodine Receptor. 


PF00622C 12.62 6.625e-13 759- 
773 


1313 


BL00518 


Zinc finger, C3HC4 type (RING finger), 
proteins. 


BL00518 12.23 6.571e-10 30-39 
BL00518 12.23 1 .667e-09 356- 
365 


1314 


BL00420 


Speract receptor repeat proteins domain 
proteins. 


BL00420B 22.67 2.824e-25 37-92 
BL00420C 11.90 9.250e-12 122- 
133 


1314 


PR00258 


SPERACT RECEPTOR SIGNATURE 


PR00258D 14.41 6.333e-ll 98- 
1 13 PR00258B 9.63 7.474e-l 1 
52-64 PR00258E 13.33 I.750e-09 
121-134 PR00258C9.05 5.167e- 
\jy u / - / o 


1315 


PR00080 


ALCOHOL DEHYDROGENASE 
SUPERFAMILY SIGNATURE 


PR00080A 9.32 8.548e-10 122- 
134 


1315 


BL00766 


Tetrahydrofolate 

dehydrogenase/cy cl ohydrolase proteins. 


BL00766C 25.86 7.632e-09 20-68 


1315 


PR00081 


GLUCOSE/RIBITOL DEHYDROGENASE 
FAMILY SIGNATURE 


PR0OO81A 10.53 2.452e-13 41-59 
PR00081C 15.13 9.229e-09 167- 
184 


1317 


BL00263 


Natriuretic peptides proteins. 


BL00263 11.87 5.909e-22 129- 
147 
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1317 


PR00711 


ATRIAL NATRIURETIC PEPTIDE 
SIGNATURE 


fKUU/llu 11.75 1.113e-30 128- 
151 PR0071 IB 10.71 7.545e-24 
32-51 PR00711D7.91 1.000e-22 
i z-yu rKU07 1 1 F 3 . 17 2.463e-22 
109-128 PR007UC5.66 7.818e- 
22 51-70 PR00711E6.39 1.000e- 

01 OO 1AO DDAA11 1 A in f\i\ 

9.769e-20 11-30 


1317 


PR00713 


C-TYPE NATRIURETIC PEPTIDE 
SIGNATURE 


rKUU/uc 14,14 1.370e-13 130- 
146 


1317 


PR00710 


NATRIURETIC PEPTIDE FAMILY 
SIGNATURE 


PR00710A 10.90 3.250e-14 127- 
137 PR00710B 11.08 1.391e-12 
136-146 


1317 


PR00712 


BRAIN NATRIURETIC PEPTIDE 
SIGNATURE 


PR00712D 10.52 4.109e-12 128- 
139 PR00712E 10.62 7.231e-10 
138-152 


1318 


BL00609 


Glycosyl hydrolases family 32 proteins. 


BL00609C 13.27 9.270e-ll 249- 
261 


1318 


BL01187 


Calcium-binding EGF-like domain proteins 


BL01187B 12.04 6.538e-l 6 757- 
773 BL03187B 12.04 7.750e- 14 
610-626 BL01187B 12.04 8.200e- 
14 651-667 BL01187B 12.04 
2.029e-l 0 523-539 BL01187A 
9.98 7.429e-10 591-603 


1318 


PR00907 


THROMROMODT TT TM cinMATl IPC 


PR00907B 11.29 6.301e-U 753^ 
770 PR00907B 11.29 2.636e-10 
647-664 PR00907B 11.29 3.524e- 
09 519-536 PR00907G 11.63 

A 1/1% AO £.C\ /no 

4.z4ie-Uy 051-678 


1318 


BL01177 


Anaphylatoxin domain proteins. 


BL01177C 17.39 8.286e-09 517- 


1318 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR00010C 11.16 1.429e-09 762- 
773 PROQOIOC 11.16 8.500e-09 
528-539 


1318 


BL00022 


EGF-like domain proteins. 


BL00022B 7.54 1.000e-08 619- 
626 


1319 


PR00237 


RHODOPSIN-LIKE GPCR SUPER FA MTT v 
SIGNATURE 


DD AAOQ^E 1 *1 AO A AAA 1 A A /■ mf\ 

rtU)v2.37b 13.03 4.000e-10 26-50 


1320 


BL00125 


Serine/threonine specific protein 
phosphatases proteins. 


"RT ftftl / )<Tl 07 1 1 A i f i\~ t e A'* ta 

dluuizjU jj.ii 9.719e-35 23-78 


1320 


PR00114 


SERINEfi'HREONINE PHOSPHATASE 
FAMILY SIGNATURE 


PP Oft 1 1 AX! 1 T <T 1 vl nr\£~ i /: *> a ^-a 

rKuui i^r i /.^i 4.706e-16 39-60 
PR001 14G 17.20 5.421e-12 61-78 


1321 


BL00453 


FKBP-rype peptidyl-nrolvl cis-trans 
isomerase proteins. 


"RT OAA^I'D 01 B£ £ oc A a^ aoi 

DL,\Jv<iD3t> zi.so o.538e-26 281- 
315 BL00453A 15.57 8.364e-12 

1 1 323-336 


1321 


PR00280 


CHANNEL FORMING COLICIN 
SIGNATURE 


PR00280A 1 1.09 8.227e-09 284- 
300 


1 jZZ 


PR00497 


NEUTROPHIL C YTOSOL FACTOR P40 
SIGNATURE 


PR00497A 6.92 8.261e-09 310- 
328 


1322 


BL50002 


Src homology 3 (SH3) domain proteins 
profile. 


BL50002B 15.1 8 9.500e-09 45-59 1 


1323 


PR00449 


TRANSFORMING PROTEIN P21 RAS 
SIGNATURE 


PR00449A 13.20 8.269e-16 34-56 


1323 


BL01115 


GTP-binding nuclear protein ran proteins. 


BL011 15A 10.22 2.474e-09 34-78 


1323 


PR00300 


ATP-DEPENDENT CLP PROTEASE ATP- 
BINDING SUBUNIT SIGNATURE j 


PR00300A 9.56 6.260e-09 36-55 
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1323 


BL00567 


Phosphoribulokinase proteins. 


BL00567A lO.oo y.l00e-0y 35-54 


1324 


PR00109 • 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12.27 9.847e-10 314- 
333 


1324 


PF00242 


DNA polymerase (viral) N-termina! domain 
proteins. 


PF00242G 13.52 6.276e-09 748- 
788 


1324 


BL00107 


Protein kinases ATP-binding region proteins. 


BL00107B 13.31 1.000e-ll 381- 
397 BL00107A 18.39 8.091e-09 
314-345 


1325 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12.27 9.847e-l 0 314- 
333 


1325 


PF00242 


DNA polymerase (viral) N-terminal domain 
proteins. 


PF00242G 13.52 6.276e-09 721- 
761 


1325 


BL00107 


Protein kinases ATP-binding region proteins. 


BL00107B 13.31 1.000e-ll 381- 
397 BL00107A 18.39 8.091e-09 
314-345 


1326 


BL00472 


Small cytokines (intercrine/chemokine) C-C 
subfamily signatur. 


BL00472C 20.76 8.225e-09 50-87 


1327 


PR00705 


PAPAIN CYSTEINE PROTEASE (CI) 
FAMILY SIGNATURE 


PR00705A 10.55 8.667e-13 114- 
130 PR00705B 10.22 2.3 85e-10 
293-304 


1327 


BL00139 


Eukaryotic thiol (cysteine) proteases cysteine 
proteins. 


BL00139D 9.24 8.125e-17 312- 
329 BL00139C 9.23 2.800e-10 
292-302 BL00139B 10.19 7.600e- 
10 157-166 BL00139A 10.29 
2.723e-09 114-124 


1328 


PR00705 


PAPAIN CYSTEINE PROTEASE (CI) 
FAMILY SIGNATURE 


PR00705A 10.55 8.667e-13 155- 
171 PR00705B 10.22 2.385e-10 
334-345 


1328 


BL00139 


Eukaryotic thiol (cysteine) proteases cysteine 
proteins. 


BL00139D 9.24 8.125e-17 353- 
370 BL00139C9.23 2.800e-10 
333-343 BL00139B 10.19 7.600e- 
10 198-207 BL00139A 1029 
2.723e-09 155-165 


1330 


PD01270 


RECEPTOR FC IMMUNOGLOBULIN 
AFFIN. 


PD01270A 17.22 7.443e-10 129- 
169 PD01270A 17.22 7.387e-09 
36-76 


1332 


BL00412 


Neuromodulin (GAP-43) proteins. 


BL00412D 16.54 6.772e-10 250- 
301 


1332 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 2.068e-09 751- 
784 


1333 


BL00232 


Cadherins extracellular repeat proteins 
domain proteins. 


BL00232B 32.79 8.594e-35 151- 
199 BL00232B 32.79 5.579e-22 
260-308 BL00232A 27.72 
1.000e-20 57-90 BL00232C 10.65 
3.613e-14 258-276 BL00232B 
32.79 4.872e-ll 377-425 
BL00232C 10.65 3.21 le-09 480- 
498 


1333 


DM01724 


kw ALLERGEN POLLEN CIM1 HOL-LI. 


DM01724 8.14 9.1 t3e-10 698-718 
DM01724 8.14 6.803e-09 694-714 


1333 


PR00205 


CADHERIN SIGNATURE 


PR00205B 1 1.39 4.545e-15 258- 
276 PR00205A 14.73 5.600e-09 
183-199 PR00205B 11.39 8.0l7e- 
09 480-498 


1335 


BL00214 


Cytosolic fatty-acid binding proteins. 


BL00214B 26.51 9.000e-29 47-92 
BL00214A 21.17 1.000e-24 6-32 


1335 


PR00178 


FATTY ACID-BINDING PROTEIN 


PR00178C 20.54 3.864e-25 65-93 
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SIGNATURE 


PR00178A I5.077.188e-23 7-28 
PR00178D 13.526.170e-12 111- 
130 


1336 


PR00452 


SH3 DOMAIN SIGNATURE 


PR00452B 1 1.65 8.250e-09 509- 
525 


1JJO 


DlAjyjy ML 


Ubiquitin carboxyl-terminal hydrolases 


BL00972A 1 1.93 8.759e-17 1 12- 
130 BL00972D 22.55 & life- 12 
354-379 BL00972B9.45 7.088e- 
09 193-203 


1340 


BL00243 


Integrins beta chain cysteine-rich domain 
proteins. 


BL002431 31.77 3.155e-09 1-44 


1340 


PR00633 


CHROMOSOME CONDENSATION 
REGULATOR RCC1 SIGNATURE 


PR00633E 12.18 4.682e-10 182- 
199 PR00633G 13.71 L667e-09 
185-204 PR00633H 15.10 3.963e- 
09 244-266 




BL00625 


Regulator of chromosome condensation 
(RCC1) proteins. 


BL00625B 17.69 5.219e-15 179- 
213 BL00625B 17.69 9.194e-14 
343-377 BL00625A 16.21 
4.405e-12 185-214 BL00625A 
16.21 5.500e-12 129-158 
BL00625A 16.21 7.203e-12 349- 
378 BL00625B 17.69 5.778e-10 
123-157 BL00625B 17.69 5.034e- 
09 285-319 


1 1A1 


Dl f\t\An£. 

BLUU476 


Fatty acid desaturases family 1 proteins. 


BL00476F 12.75 6.551e-09 45-90 


1345 


PR00320 


G-PROTEIN BETA WD-40 REPEAT 
MuNATURE 


PR00320A 16.74 9.690e- 11 292- 
307 PR00320B 12.19 4.343e-10 
292-307 PR00320C 13.01 7.840e- 
10 292-307 


1345 


BL00678 


Trp-Asp (WD) repeat proteins proteins. 


BL00678 9.67 1.000e-09 294-305 


i jt+j 


DLUUZZ5 


Crystallins beta and gamma 'Greek key* motif 
proteins. 


BL00225B 18.06 7.273e-22 6-41 
BL00225B 18.06 5.673e-14 97- ' 
132 BL00225A 13.82 7.218e-09 
61-82 


1350 


rUKJ l OZ o 


FRUlblN 1NTERGENIC REGION ABC1 
PRECURSOR MITOCHONDRION T. 


PD01823D 16.66 3.093e-15 21-42 
PD01823E 9.30 5.909e-15 75-88 


1352 




Ferritin iron-binding regions proteins. 


BL0054OA 15.06 1.000e-40 9-50 
BL0054OB 18.82 1.000e-40 100- 
155 BL00540C 13.00 7,500e-15 
165-177 


1353 


PR00294 


STREPTOMYCES SUBTILISIN 
INHIBITOR SIGNATURE 


PR00294A 10.44 6.444e-10 159- 
186 


1353 


DM001 Ql 1 


W or AC8A4.U4C RESISTANCE 
SPAC8A4.05C DAUNORUBICIN. 


DM00191D 13.94 7.167e-10206- 
245 


1356 


BL 00428 


i'ai \ Atj/tla ni<nt&i'rui Anil/ 1 UA J A i -_«irr» 

^en cycie proteins Its W / rodA / spoVE 
proteins. 


BL00428A 14.30 3.613e-09 91- 
110 


1359 


BL00142 


Neutral zinc metal lopeptidases, zinc-binding 
region proteins. 


BL00142 8.38 7.188e-10 389-400 


1359 


PD01719 


PRECURSOR GLYCOPROTEIN SIGNAL 
RE. 


PD0171QA l9RQ7QQio Keen 

578 PD01719B9.30 1.750e-09 
877-885 PD01719A12.89 
3.000e-09 1006-1034 


1359 


PR00480 


ASTACIN FAMILY SIGNATURE 


PR00480B 15.41 3.186e-09 384- 
403 


1360 


DM00191 


w SPAC8A4.04C RESISTANCE 
SPAC8A4.05C DAUNORUBICIN 


DM00191D 13.94 6.330e-l I 232- 
271 DM00191D 13.94 7.728e-ll 
48-87 DM00191D 13.94 5.000e- 
10112-151 DM00191D 13.94 
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5.667e-10 59-98 DM0019ID 
13.94 5.667e-10 123-162 
DM00191D 13.94 6.583e-10 56- 
95 DM001 91D 13.94 8.417e-10 
280-319 DM00191D 13.94 
8.917e-10 192-231 DM00191D 
13.94 1.391e-09 224-263 
DM00191D 13.94 2.409e-09 208- 
247 DM00191D 13.94 4.835e-09 
120-159 DM00191D 13.94 
5.304e-09 149-188 DM00191D 
13.94 5.461e-09 21 1-250 
DM00191D 13.94 6.322e-09 80- 
119 DM00191D 13.94 7.652e-09 
243-282 DM00191D 13.94 
8.513e-09 216-255 DM00191D 
13.94 9.452e-09 177-216 


1360 


PF00624 


Flocculin repeat proteins. 


PF00624J 6.21 3.496e-ll 237-292 
PF00624J 6.21 6.597e-l 1 53-108 
PF00624J 6.21 4.1 21 e-1 0253-308 
PF00624J6.21 5.718e-10 141-196 
PF00624F 1 1 .04 1 .508e-09 50-86 
PF00624J 6.21 3.163e-09 101-156 
PF00624J 6.21 3.233e-09 165-220 
PF00624I 9.10 5.181e-09 140-170 
PF00624F 11.04 6.008e-09 130- 
166 PF00624J6.21 6.093e-09 
125-180 PF00624J 6.21 6.1 63e~ 
09 221-276 PF00624G 10.91 
6.806e-09 45-100 PF00624G 
10.91 7. 169e-09 181-236 
PF00624G 10.91 7.387e-09 221- 
276 PF00624J6.21 8.674e-09 
197-252 PF00624J 6.21 8.884e- 
09 117-172 PF00624J6.21 
8.884e-09 213-268 PF00624J 
6.21 9.512e-09 55-110 


1360 


BL00115 


Eukaryotic RNA polymerase II heptapeptide 
repeat proteins. 


BL001 15Z 3. 12 6. 1 63e- 10 22-71 
BL001 15Z 3.12 7.638e-09 36-85 

290 


1363 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 1.321e-09 110- 
126 


1363 


PF00791 


Domain present in ZO-1 and Unc5-like netrin 
receptors. 


PF00791B 28.49 7.527e-13 110- 
165 PF00791B 2o.49 o.liye-uy 
77-132 PF00791C 20.98 7.529e- 
09 91-130 


1366 


PF00168 


C2 domain proteins. 


PF00168C 27.49 9.250e-17 320- 
346 


1366 


PR00399 


SYNAPTOTAGMIN SIGNATURE 


PR00399A 9.52 1.844e-14 148- 
164 PR00399C 12.82 8.071 e-1 4 
218-234 PR00399B 14.27 2.853e- 
13 163-177 PR00399D 14.48 
1.871e-ll 238-249 


1366 


PR00360 


C2 DOMAIN SIGNATURE 


PR00360B 13.61 8.759e-12 337- 
351 PR00360A 14.59 3.903e-10 
308-321 PR00360B 13.61 4.789e- 
10 203-217 
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1367 


PF00168 


C2 domain proteins. 


PF00168C 27.49 9.250e-17 320- 
346 


1367 


PR00399 


SYNAPTOTAGMIN SIGNATURE 


PR00399A9.52 1.844e-14 148- 
164 PR00399C 12.82 8.071e-14 
218-234 PR00399B 14.27 2.853e- 
13 163-177 PR00399D 14.48 
I.87Ie-II 238-249 


1367 


FR00360 


C2 DOMAIN SIGNATURE 


PR00360B 13.61 8.759e-12 337- 
351 PR00360A 14.59 3.903e-10 
308-321 PR00360B 13.61 4.789e- 
10 203-217 


1368 


BL00086 


Cytochrome P450 cysteine heme-iron ligand 
proteins. 


BL00086 20.87 1.857e-20 444- 
476 


1368 


PR00464 


E-CLASS P450 GROUP II SIGNATURE 


PR004641 14.64 4.375e-17454- 
478 PR00464A 20.47 9.591e-16 
130-151 PR00464C 18.84 l.OOOe- 
15 305-334 PR00464D 17.40 
6.250e-l 5 334-352 PR00464H 
13.32 8,941e-15 441-455 
PR00464F 15.23 9.654e-13 403- 
419 PR00464B 20.41 1.844e-12 
186-205 PR00464E 18.28 7.907e- 
12 363-384 PR00464G 12.41 
8.412e-12 419-435 


1368 


PR00465 


E-CLASS P450 GROUP IV SIGNATURE 


PR00465H 17.76 6.586e-10 454- 
473 


1368 


PR00385 


P450 SUPERFAMILY SIGNATURE 


PR00385A 14.97 1.346e-12 316- 
334 PR00385B 10.22 4.1 30e-ll 
334-348 PR00385D 13.11 7.857e- 
10 445-455 PR00385E 12.66 
9.438e-l 0454-466 


1368 


PR00463 


E-CLASS P450 GROUP I SIGNATURE 


PR00463G 18.24 3.605e-14 409- 
434 PR00463E17.374.814e-ll 
325-352 PR004631 15.02 5.574e- 
09 454-478 PR00463H 12.41 
7.158e-09 444-455 


1370 


BL00218 


Amino acid permeases proteins. 


BL00218D 21.49 9.757e-l 1 263- 
308 


1371 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 3.288e-09 35-50 


1372 


PR0O38O 


KINESIN HEAVY CHAIN SIGNATURE 


PR00380A 14.18 4.086e-22 84- 
106 PROO380C 13.18 5.286e-17 
240-259 PR0O38OD 9.93 7.698e- 
17 290-312 PR00380B 12.64 
7.805e-14 207-225 


15 11 


T>T f\r\ ill 

BL0041 1 


Kinesin motor domain proteins. 


BL00411G 21.39 7.750e-25 241- 
283 BL00411C 15.04 2.500e-22 
84-106 BL00411H 15.66 8.235e- 
16289-320 BL0041 IE 10.43 
y.l29e-lo 135-154 BL004HF 
14.77 9.795e-16 198-223 
BL00411D 12.13 5.909e-09 114- 
125 


1373 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 1.915e-09 590- 
605 


1373 


PR00019 


LEUCINE-RICH REPEAT SIGNATURE 


PR00019A 11.19 5.800e-12 158~ 
172 PR00019B 11.36 1.000e-10 
130-144 PR00019A 11.19 8.826e- 
10 133-147 PR00019B 11.36 
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7.12Ue-U9 lUo-lzU 


1373 


PR00500 


POLYCYSTIC KIDNEY DISEASE 
PROTEIN SIGNATURE 


PR00500B 7.74 7.821e-09 250- 
271 


1374 


BL00411 


Kinesin motor domain proteins. 


Of AA/4 1 ITt K Hii T OH q OO TO 

J3L00411Ii 15.00 /.olle-ZZ /y- 
110 BL00411G 21.39 8.683e-22 
3 1 -7.5 


1374 


PR00380 


KINESIN HEAVY CHAIN SIGNATURE 


PR00380C 13.18 2.385e-16 30-49 
PR00380D 9.93 3.739e-i6 80-102 


1376 


BL00790 


Receptor tyrosine kinase class V proteins. 


BL00790E 29.58 6.667e-12 767- 
815 


1376 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR00010C ll.lo 5.636e-l0 423- 
434 PROOOIOC 11.16 8.071 e-09 
148-159 


1376 


BL00022 


EGF-like domain proteins. 


BL00022B 7.54 8.200e-09 427- 
434 


1376 


PR00907 


THROMBOMODULIN SIGNATURE 


PR00907B 1 1.29 7.312e-10 224- 
241 PR00907G 11.63 5.297e-09 
62-89 PR00907B 11.29 8.3 54e-09 
98-115 PR00907B 11.29 9.451e- 
09 334-351 


1376 


BL01187 


Calcium-binding EGF-like domain proteins 
pattern proteins. 


BL01 187B 12.04 5.235e-15 62-78 
BL01 187B 12.04 5.765e-15 418- 
434 BL01187B 12.04 3.000e- 12 
143-159 BL01187B 12.04 7.3 33e- 
12 297-313 BL01187B 12.04 
7.000e-ll 338-354 BL01187B 
12.04 4.857e-10 378-394 
BL01187B 12.04 5.886e-10 102- 
118 BL01187A 9.98 6.571e-10 

'yyy t>T A 1 1 OH A A AO < 1 AC~ 

321-333 BL01187A 9.98 5.l25e- 
09 126-138 BL01187A9.98 
9.625e-U9 302-J /4 


1377 


BL00048 


Protamine P 1 proteins. 


BL00048 6.39 4.038e-09 396-423 


1381 


BL00107 


Protein kinases ATP-binding region proteins. 


BL00107A JOy o.5UUe-Z/ 342- 
3 /3 


1381 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12.27 5.412e-12 342- 
361 


1381 


PD00306 


PROTEIN GLYCOPROTEIN PRECURSOR 
RE. 


PD00306A 10.26 6.143e-09 25-39 


1382 


PR0091O 


LUTEOVIRUS ORF6 PROTEIN 
SIGNATURE 


PR00910A 2.51 6.036e-09 48-61 


1388 


PR00019 


LEUCINE-RICH REPEAT SIGNATURE 


PR00019A 11.19 7.261e-10 69-83 
PR00019B 11.36 4.600e-09 66-80 


1392 


BL00018 


EF-hand calcium-binding domain proteins. 


BL00018 7.41 6.870e-0942-55 


1396 


BL00790 


Receptor tyrosine kinase class V proteins. 


BL00790B 21.59 1.000e-40 61- 
113 BL00790C 16.65 1.000e-40 
165-219 BL00790K9.30 l.OOOe- 
40 657-711 BL00790Q 15.61 

7.68 5.929e-39 797-830 
BL00790G 22.06 5.114e-36 376- 
420 BL00790R 16.20 7.469e-36 
951-995 BL00790E 29.58 7.250e~ 
35 273-321 BL00790J 14.21 
8.200e-33 605-645 BL00790N 
13.25 1.214e-3 1763-790 
BL007901 20.01 1.931e-29 501- 
532 BL00790D 12.41 2.500e-27 
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243-268 BL00790H 13.42 

8.74 8.683e-25 741-763 
BL00790P 12.33 3.755e-24 830- 
855 BL00790F 1 5 90 5 Sflfie-Od 

339-366 BL00790L 11.16 5.909* 
21 721-741 BL00790A 19.74 
1.964e-19 31-53 


1396 


BL00240 


Receptor tyrosine kinase class III proteins. 


BL00240F 17.74 9.500e-l 6 789- 
837 BL00240E 11.56 1.439e-15 

73rt-77d Rl 0ft9dnO 9R d5 

8.793e-l 5 836-889 


1396 


BL00107 


Protein kinases ATP-binding region proteins. 


BL00107A 18.39 3.647e-20 750- 
781 BL00107B 13.31 5.091e-13 
818-834 


1396 


PR00109 


TYROSINE KINASE CATALYTIC 

nniVyf A TXT C TfTKT ATTTDT7 


PR00109D 17.04 9.100e-22 819- 
542 PKUUlOJJb 14.41 7.429e-19 
863-886 PR00109B 12.27 5.125e- 
18 750-769 PR00109A 15.00 
2.895e-13 713-727 PR00109C 


1396 


BL00239 


Receptor tyrosine kinase class II proteins. 


BL00239E 17.14 5.426e-27 790- 
840 BL00239B 25. 15 3.000e-23 
684-732 BL00239F 28.15 8.132e- 
21 844-889 BL00239D 16.81 

1 R 75 3 IdRp 1 n 7T 7-760 


1396 


BL50001 


Src homology 2 (SH2) domain proteins 

nrrvfilp 


BL50001B 17.40 2.714e-ll 747- 
7fiR rt 5nnnin 1 1 An 7 inn*, in 

818-829 BL50001C 10.17 l.OOOe- 

OQ 799-R10 


1396 


PR00014 


FIBRONECTIN TYPE III REPEAT 
SIGNATURE 


PR00014C 15.44 8.071e-13 490- 
509 PR00014B 14.77 3.400e-10 
467-478 PR00014D 12.04 6.824e- 
10 508-523 PR00014A 8.22 
3.455e-09 342-352 


1401 


BL00027 


'Homeobox' domain proteins. 


BL00027 26.43 2.000e-ll 84-127 


1403 


BUM 160 


Kinesin light chain repeat proteins. 


BL01 160B 19.54 5.958e-09 387- 
d41 


1404 


BL01113 


Clq domain proteins. 


BL011I3B 18.26 2.500e-13 841- 
877 


1406 


BL01206 


Amiloride-sensitive sodium channels 
nroteins 


BL01206D 30.58 3.025e-28 363- 

530-576 BL01206F 16.40 7.643e- 
15 485-506 BL01206E 20 72 
5.650e- 14 427-454 BL01206C 
12.30 3.455e-12 333-352 
BL01206B 13.56 1.205e-10 313- 
327 


1408 


BL01220 


Phosphatidylethanolamine-binding protein 
family proteins. j 


BL01220B 16.65 1.000e-40 59- 
100 BL01220C 14.75 5.846e-34 
100-128 BL01220A 22.62 
3.400e-3121-52 


1409 


BL00815 


Alpha-isopropylmalate and homocitrate 
synthases proteins. 


BL00815C 21.36 3.118e-09 786- 
815 


1412 


PR00049 


WILMS TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 4.051e-09 1-16 


1412 


PR00806 


VINCULIN SIGNATURE 


PR00806B 4.28 9.640e-09 3-17 


1418 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 5.200e-09 453- 
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466 


1418 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 7.882e-14 524- 
541 BL00028 16.07 8.269e-ll 
555-572 BL00028 16.07 2.543e- 
09 437-454 BL00028 16.07 
4.600e-09 408-425 BL00028 
16.07 6.657e-09 465-482 


1418 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 3.160e-09 521- 
535 PR00048A 10.52 4.960e-09 
434-448 PR00048A 10.52 6.760e- 
09 552-566 PR00048A 10.52 
7.840e-09 462-476 


1419 


BL00022 


EGF-like domain proteins. 


BL00022A 7.48 5.000e-09 177- 
184 BL00022A 7.48 5.000e-09 
241-248 BL00022A 7.48 8.000e- 
09 49-56 


1419 


PR00011 


TYPE IH EGF-LIKE SIGNATURE 


PR00011D 14.03 5.696e-09 182- 
201 PR00011D 14.03 6.478e-09 
86-105 PR00011D 14.03 9.087e- 
09118-137 


1419 


DM01842 


1 CELLULOSE-BINDING DOMAIN, 
BACTERIAL TYPE. 


DM01842 11.31 9.922e-09 94-141 


1421 


PR00371 


FLAVOPROTEIN PYRIDINE 
NUCLEOTIDE CYTOCHROME 
REDUCTASE SIGNATURE 


PR00371D 14.55 4.536e-l 1 385- 
405 


1421 


PR004O6 


CYTOCHROME B5 REDUCTASE 
SIGNATURE 


PR00406D 10.02 6.538e-10 385- 
405 


1421 


PR00409 


PHTHALATE DIOXYGENASE 
REDUCTASE FAMILY SIGNATURE 


PR00409F 12.70 2.484e-09 385- 
405 


1421 


PR00466 


CYTOCHROME B-245 HEAVY CHAIN 
SIGNATURE 


PR00466E 6.82 6.958e-17 386- 
404 PR00466C 10.17 8.244e-09 
195-216 


1422 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 4.462e- 11 1087- 
1104 


1422 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDL 


PD00066 13.92 6.478e- 11 1075- 
1088 


1422 


BL00319 


Amyloidogenic glycoprotein extracellular 
domain proteins. 


BL00319C 17.12 4.375e-10 1154- 
1188 


1422 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 4.522e-i0 1084- 
1098 PR00048B6.02 1.474e-09 
1072-1082 PR00048A 10.52 
6.760e-09 1056-1070 


1423 


PR00260 


BACTERIAL CHEMOTAXIS SENSORY 
TRANSDUCER SIGNATURE 


PR00260C 10.26 9.294e-09 146- 
167 


1424 


BL00845 


CAP-Gly domain proteins. 


BL00845 16.43 6.442e-21 405- 
430 BL00845 16.43 9.820e-19 
203-228 


1426 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDL 


PD00066 13.92 3.769e-15 369- 
jo J. rUUUUoo io.yz *f.*toze-i j 
285-298 PD00066 13.92 2.800e- 
14 257-270 PD00066 13.92 
5.200e-14 313-326 PD00066 
13.92 8.962e-10 341-354 


1426 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 5.050e-13 269- 
286 BL00028 16.07 5.050e- 13 
297-314 BL00028 16.07 2.5O0e- 
10 325-342 BL00028 16.07 
5.200e-l 0 353-370 BL00028 
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16.07 7.000e-10 241-258 
BL00028 16.07 9.700e-10 381- 
398 


1426 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 8.500e-17266- 
280 PR00048A 10.52 5.500e- 14 
294-308 PR00048A 10.52 4.706e- 
12350-364 PR00048B 6.02 
6.000e-12 310-320 PR00048B 
6.026.538e-ll 394-404 
PR00048A 10.52 2.565e-10 238- 
252 PR00048B 6.02 2.688e-10 
254-264 PR00048B 6.02 4.375e- 
10 338-348 PR00048A 10.52 
5304e-10 378-392 PR00048A 
10.52 9.609e-10 322-336 
PR00048B 6.02 5.263e-09 282- 
292 PR00048B 6.02 6.21 le-09 
366-376 


1429 


PR00671 


INHIB1N BETA B CHAIN SIGNATURE 


PR00671C 4.18 5.345e-09 9-29 


1431 


DM01803 


1 HERPESVIRUS GLYCOPROTEIN H. 


DM01803A 10.51 6.855e-09 215- 
236 


1431 


BL00226 


Intermediate filaments proteins. 


BL00226D 19.10 7.400e-09 390- 
437 1 


1432 


DM01803 


1 HERPESVIRUS GLYCOPROTEIN H. 


DM01803A 10.51 6.855e-09 251- 
272 


1432 


BL00226 


Intermediate filaments proteins. 


BL00226D 19.10 7.400e-09 426- 
473 


1434 


PR00545 


RETINOIC ACID RECEPTOR 
SIGNATURE 


PR00545A 5.35 9.430e-09 383- 
398 


1436 


BL01238 


GDA1/CD39 family of nucleoside 
phosphatases proteins. 


BL01238A 11.72 7.840e-16 76-91 


1437 


PD00930 


PROTEIN GTPASE DOMAIN 
ACTIVATION. 


PD00930B 33.72 2.800e-26 1256- 
1297 PD00930A 25.62 3.864e-13 
1152-1178 


1437 


PF00620 


GTPase-activator protein for Rho-like 
GTPases. 


PF00620B 14.20 7.000e-12 1205- 
1222 


1437 


PR00683 


SPECTRIN PLECKSTRIN HOMOLOGY 
DOMAIN SIGNATURE 


PR00683B 16.62 2.603e-10 946- 
968 PR00683D 15.87 2.773e-09 
1005-1024 


1437 


PR00543 


OESTROGEN RECEPTOR SIGNATURE 


PR00543H 10.86 7.573e-09 556- 
576 


1437 


PF00595 


PDZ domain proteins (Also known as DHR 
or GLGF). 


PF00595 13.40 7.600e-09 90-101 


1437 


BL00275 


Shiga/ricin ribosomai inactivating toxins 
proteins signatu. 


BL00275A 12.16 7.677e-09 1226- 
1240 


1441 


BL00223 


Annexins repeat proteins domain proteins. 


BL00223B 28.47 1.000e-40 140- 
190 BL00223C 24.79 1.000e-40 
217-272 BL00223A 15.59 
5.500e-32 21-55 BL00223A 
15.59 4.783e-14 230-264 
BL00223C 24.79 2.515e-10 8-63 
BL00223A 15.59 6.250e-10 71- 
105 


1441 


PR00199 


ANNEXIN TYPE III SIGNATURE 


PR00199G 9.09 8.364e-21 239- 
265 PR00199F 16.19 5.636e-16 
158485 PR00199D5.65 5.375e- 
1425-47 PR00199B 6.86 1.574e- 
13 30-53 PR00199D 5.65 7.987e- 
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13 234-256 PR00199H 12.62 
5.339e-12 282-296 PR00199D 
5.65 9.276e-10 75-97 


1441 


PR00200 


ANNEXIN TYPE IV SIGNATURE 


PR00200F 13.72 1.118e-35 158- 
185 PR00200G9.43 1.000e-34 
238-265 PR00200B 7.391.643e- 
29 30-53 PR00200H 13.68 
1.766e-l 8 282-296 PR00200E 
10.00 6.160e-16 75-97 PR00200E 
10.00 2. llle-14 25-47 PR00200A 
4.93 2.125e-14 5-16 PR00200C 
8.76 1.500e-12 54-63 PR00200E 
10.00 2.859e-l 1 234-256 
PR00200G9.43 5.294e-ll 29-56 
PR00200D 10.01 9.722e-10 70-87 


1441 


PR00197 


ANNEXIN TYPE I SIGNATURE 


PR00197F 9.03 5.250e-16 238- 
259 PR00197D7.50 1.250e-15 
25-47 PR00197E 11.89 8.463e-14 
158-185 PR00197D 7.501.542e- 
12 234-256 PR00197D 7.50 
5.45 le-1 0 75-97 PR00197B 7.56 
2.206e-09 3.0-53 


1441 


PR00198 ; 


ANNEXIN TYPE 11 SIGNATURE 


PR00198G 8.09 7.943e-16 238- 
259 PR00198D7.65 2.271e-13 
234-256 PR00198D 7.65 9.894e- 
13 25-47 PR00198E 14.67 
6.381e-il 158-185 PR00198H 
12.05 1.462e-10 282-296 
PR00198B 8.71 9.357e-10 30-53 
PR00198D 7.65 4.845e-09 75-97 


1441 


PR00201 


ANNEXIN TYPE V SIGNATURE 


PR00201G 1 1.02 9.419e-26 238- 
265 PR00201A6.05 4.770e-16 
30-53 PR00201E 12.37 4.103e-15 
158-185 PR00201H 12.04 4.375e- 
14 282-296 PR00201D 10.49 
4. 150e-l 0 75-97 PR00201G 11.02 
8.402e-l 0 29-56 PR00201D 10.49 
6.179e-09 25-47 


1441 


PR00301 


70 KD HEAT SHOCK PROTEIN 
SIGNATURE 


PR00301D 15.51 7.395e-09 38-59 


1441 


PR00196 


ANNEXIN FAMILY SIGNATURE 


PR00196D 21.86 3.032e-24 158- 
185 PR00I96E 9.19 8.333e-23 
238-259 PR00196A 11.16 9.100e- 
21 30-53 PR00196F 13.89 
2.714e-l 5 266-282 PR00196C 
10.36 5.167e-15 25-47 PR00196G 
1 1.72 3.000e-14 282-296 
PR00196C 10.36 7.344e-13 234- 

75-97 PR00196G 11.72 9.217e-10 
207-221 PR00196F 13.89 4.188e- 
09107-123 PR00196A 11.16 
7.840e-09 80-103 


1441 


PR00202 


ANNEXIN TYPE VI SIGNATURE 


PR00202G8.01 4.833e-28 238- 
265 PR00202E 13.00 4.643e-16 
158-185 PR00202D 5.58 9.604e- 
13 75-97 PR00202B 11.44 
2.763e- 11 29-53 PR00202H9.20 
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4.740e-l 1 282-296 PRO02O2D 
5.58 1.908e-09 25-47 PR00202G 
8.01 9.237e-09 29-56 


1444 


DM01513 


CAMP-DEPENDENT PROTEIN KINASE 
REGULATORY CHAIN. 


DM01513A 13.61 8.568e-14 15- 
56 


1445 


BL00603 


Thymidine kinase cellular-type proteins. 


BL00603C 30.02 l.OOOe-40 152- 
207 BL00603A 20.71 4.500e-33 
63-96 BL00603D 10.53 5.091e- 
18 217-232 BL00603B 11.39 
3.455e-15 132-147 


1440 


PD01922 


PROTEIN PHOSPHODIESTERASE 
HYDROL. 


PD01922B 21.83 7.328e-14 162- 
198 


1447 


BL00061 


Short-chain dehydrogenases/reductases 
family proteins. 


BL00061B 25.79 1.93 le-13 99- 
137 


1448 


BL01160 


Kinesin light chain repeat proteins. 


BL01160B 19.54 5.958e-09 64- 
118 


1449 


PF00856 


SET domain proteins. 


PF00856A 26.14 8.579e-l 1 5-42 


1449 


PF00628 


PHD-finger. 


PF00628 15.84 5.500e-10 11-26 


1452 


BL00030 


Eukaryotic RNA-binding region RNP-l 
proteins. 


BL00030B 7.03 3.400e-10 116- 
126 


1454 


PF00075 


RNase H. 


PF00075D 10.71 7.000e-l 1 517- 
528 PF00075C 11.58 9.786e-ll 
484-496 PF00075B 12.56 4.073e- 
10 449-460 PF00075A 14.44 
2.143e-09 402-419 


1454 


PR00211 


GLUTELIN SIGNATURE 


PR00211B 0.864.417e-09 138- 
159 


1456 


BL00262 


Insulin family proteins. 


BL00262B 16.89 8.286c-17 68-88"" 
BL00262A 12.48 4.600e- 15 32-50 


1456 


PR00277 


INSULIN B CHAIN SIGNATURE 


PR00277A 14.82 2.421e-13 29-43 
PR00277B 12.79 2.350e-ll 43-56 


1456 


PR00276 


INSULIN A CHAIN SIGNATURE 


PR00276A 11.844.750e-13 69-79 
PR00276B 8.02 7.828e-10 78-88 


1457 


PR00213 


MYELIN P0 PROTEIN SIGNATURE 


PR00213E 5.51 9.775e-12 264- 
289 


1459 


BL00856 


Guanylate kinase proteins. 


BL00856C 29.21 2.658e-26 539- 
587 BL0O856B 9.61 2.946e-18 
511-532 


1459 


PR00452 


SH3 DOMAIN SIGNATURE 


PR00452B 1 1.65 2.750e-09 369- 
385 


1459 


PD00289 


PROTEIN SH3 DOMAIN REPEAT 
PRESYNA. 


PD00289 9.97 6.586e-09 298-312 


1459 


PF00595 


PDZ domain proteins (Also known as DHR 
orGLGF). 


PF00595 13.40 8.800e-09 295-306 


1461 


PR00475 


HEXOKINASE FAMILY SIGNATURE 


PR00475B 14.92 6.143e-26 186- 
212 PR00475E 16.08 2.742e-22 
327-350 PR00475F9.68 4.000e- 
20 407-430 PR00475A 14.06 
3.118e-19 118-135 PR00475C 
11.92 6.684e-19 239-256 
PR00475G 9.08 1.692e-l 6 479- 
496 PR00475D 13.30 2.653e-13 
262-277 PR00475G9.082.650e- 
1032-49 


1461 


BL00378 


Hexokinases proteins. 


BL00378C 16.14 1.000e-40 243- 
287 BL00378E 22.92 5.821e-40 ! 
313-359 BL00378B 14.23 3.647e- 
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32 98-1 DLUUj/or o.z/ 
2.688e-17 481-496 BL00378D 

1 A ClA 1 AH At* 1 *2 OOI *X(\'X 

10.94 1.4/401J Ayi m J\)j 

BL00378A 19.01 8.694e-ll 59-87 
BL00378F 8.27 3.714e-l0 34-49 


1464 


PR00722 


CHYMOTRYPSIN SERINE PROTEASE 
FAMILY (S 1 ) SIGNATURE 


PR00722A 12.27 8.448e-14 56-72 


1464 


BL00021 


Kringle domain proteins. 


BL00021B 13.33 1.763e-13 55-73 


1464 


PR00839 


V8 SERINE PROTEASE FAMILY 
SIGNATURE 


PR00839B 11.20 4.945e-09 55-73 


1464 


BL01253 


Type I fibronectin domain proteins. 


BL01253E 16.01 6.381e-09 125- 
162 


.1464 


BL00134 


Serine proteases, trypsin family, histidine 
proteins. 


BL00134A 11.96 3.813e-15 55-72 
BL00134B 15.99 7.200e-10 186- 
210 BL00134C 13.45 9.20oe-09 
219-233 


1466 


BL00291 


Prion protein. 


BL00291A4.49 9.379e-09 105- 
140 


1467 


PF00534 


Glycosyl transferases group 1. 


PF00534B 14.47 9.581e-12 398- 
422 


1468 


PF01105 


emp24/gp25L/p24 family. 


PF01105B 25.12 2.868e-25 126- 
178 ! 


1469 


PF01105 


emp24/gp25L/p24 family. 


PF01105B 25.12 2.868e-25 151- 
203 


1470 


PR00305 


14-3-3 PROTEIN ZETA SIGNATURE 


PR00305A 9.33 9.500e-36 37-67 
PR00305E 13.01 4.316e-32 177- 
204 PR00305D 16.34 3.647e-30 
150-177 PR00305F 15.95 1.964e- 
26 204-234 PR00305C 8.68 
3. 182e-26 115-138 PR00305B 
9.99 4.857e-24 84-109 PR00305F 
15.95 8.975e-15 215-245 


1470 


BL00796 


14-3-3 proteins. 


BL00796C 17.44 1.000e-40 99- 

« a v r\ft^*ft^**"^ 1 OA 1 AAA* yl ft 

149 BL00796D 17.39 1.000e-40 
150-196 BL00796B 10.67 7.000e- 
39 37-70 BL00796L 14.15 
3.045e-33 198-234 BL00796A 
1 0.52 4.o Soe-zo j- i51-.uu/70Jd 
14.15 2.742e-l 1209-245 


1474 


PF00642 


Zinc finger C-x8-C-x5-C-x3-H type (and 
similar). 


PrQQo4Z 11. jV /./yoe-iu o/o-oo/ 
PF00642 11.59 7.055e-09 276-287 


1475 


PF00588 


SpoU rRNA Methylase family. 


rrOUjooB l/.lo o.ZUuc-lU Zen- 
's ni 


1476 


BL01160 


Kinesin light chain repeat proteins. 


BL01160B 19.54 5.653e-09 791- 

o4j 


1477 


PR00326 


GTP1/OBG G TP-BINDING PROTEIN 

t- a -k ait -xr pi /-'"V T A TT ITi T? 

FAMILY SIGNATURE 


PR00326A 8.75 4.255e-14 364- 

JOJ 


1 All 
14/ / 




Pa^ein^ aloha/beta oroteins. 


BLO03O6B 8.28 1.900e-09 557- 
568 


1477 


PR0O318 


ALPHA G-PROTEIN (TRANSDUCIN) 
SIGNATURE 


PR00318E 7.23 5.320e-09 220- 
230 


1479 


BL01305 


moaA / nifB / pqqE family proteins. 


BL01305D 14.97 7.279e-09 7-22 


1480 


PR0O918 


CALICIVIRUS NON-STRUCTURAL 
POLYPROTEIN FAMILY SIGNATURE 


PR00918A 13.76 5.807e-09 458- 
479 


1480 


PR00674 


LIGHT HARVESTING PROTEIN B CHAIN 
SIGNATURE 


PR00674A 20.10 9.870e-09 133- 
154 


1481 


PR00171 


SUGAR TRANSPORTER SIGNATURE 


PR00171E 14.87 1.000e-08 73-86 
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1482 


DM01418 


352 FIBRILLAR COLLAGEN 
CARBOXYL-TERMINAL. 


DM01418A 20.83 5.650e-23 101- 
149 DM01418B 22.51 8.500e-ll 
166-208 DM01418C 20.48 
8.655e-10 236-278 


1482 


BL00291 


Prion protein. 


BL00291A4.49 9.349e-10 23-58 


1482 


BLOl 113 


Clq domain proteins, 


BL01113A 17.99 6.1 14e-l 138-65 
BLOl 1 13A 17.99 2.91 5e-10 35-62 
BLOl 1 13A 17.99 6.53 8e-09 17-44 
BLOl 1 13A 17.99 6.712e-09 29-56 
BLOl 1 13A 17.99 8.442e-09 32-59 


1483 


DM01418 


352 FIBRILLAR COLLAGEN 
CARBOXYL-TERMINAL. 


DM01418A 20.83 5.650e-23 117- 
165 DM0141 8B 22.51 8.500e-ll 
182-224 DM0141 8C 20.48 
8.655e-10 252-294 


1483 


BL00291 


Prion protein. 


BL00291 A 4.49 9.349e-10 23-58 


1483 


BLOl 113 


Clq domain proteins. 


BL01113A 17.99 6.1 14e-l 138-65 
BL01I13A 17.99 2.91 5e-l 035-62 
BLOl 113A 17.99 6.538e-09 17-44 
BLOl 1 13A 17.99 6.712e-09 29-56 
BLOl 1 13A 17.99 8.442e-09 32-59 


1484 


BL01052 


Calponin family repeat proteins. 


BL01052B 15.31 3.308e-l 130-56 


1484 


PR00888 


SMOOTH MUSCLE PROTEIN/CALPONIN 
FAMILY SIGNATURE 


PR00888C 12.272.141e-09 30-46 


1486 


BL00795 


Involucrin proteins. 


BL00795C 17.06 7.600e-09 239- 
284 


1486 


BL00415 


Synapsins proteins. 


BL00415N 4.29 9.409e-09 818- 
862 


3490 


BL01046 


ATP-dependent serine proteases, Ion family, 
serine active sit. 


BL01046D 19.61 4.938e-35 452- 
493 BL01046C 17.03 9.581e-31 
377-421 BL01046B 19.24 4.977e- 
29 331-377 


1490 


PR00830 


ENDOPEPTIDASE LA (LON) SERINE 
PROTEASE (SI6) SIGNATURE 


PR00830D 8.08 2.552e-20 767- 
787 PR00830A8.41 7.545e-18 
375-395 PR00830E 13.94 8.500e- 
15 790-809 PR00830C 8.47 
2.837e-13 737-757 PR00830B 
14.73 7.429e-13 654-671 


1490 


BLOl 128 


Shikimate kinase proteins. 


BLOl 128A 18.84 8.027e-12 371- 
405 


1490 


PR00300 


ATP-DEPENDENT CLP PROTEASE ATP- 
BINDING SUBUN1T SIGNATURE 


PR00300A 9.56 1.254e-10 371- 
390 


1490 


PR00819 


CBXX/CFQX SUPERFAMILY 
SIGNATURE 


PR00819B 10,83 2.350e-10 370- 
386 ! 


1490 


BL00674 


AAA-prbtein family proteins. 


BL00674B 4.46 8.071e-10 368- 
390 


1490 


PR00364 


DISEASE RESISTANCE PROTEIN 
SIGNATURE 


PR00364A 8.19 1.818e-09 370- 
386 


1490 1 


JLML/vUi IJ 


/iuenyiaie Kinase proteins. 


BL001 13A 12.74 7.369e-09 372- 
389 


1491 


BL00824 


Elongation factor 1 beta/betaVdelta chain 
proteins. 


BL00824B 9.21 2.338e-09 150- 
170 


1495 


BL00615 


C-type lectin domain proteins. 


BL00615A 16.68 3.880e-ll 47-65 
BL00615B 12.25 2.286e-10 149- 
163 


1498 


PR00119 


P-TYPE CATION-TRANSPORTING 
ATPASE SUPERFAMILY SIGNATURE 


PR00119B 13.94 8.714e-12 35-50 
PR00119E8.48 7.716e-11 420- 
440 
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1498 


PR00120 


H+TRANSPORTING ATPASE (PROTON 
PUMP) SIGNATURE 


PR00120C 9.90 7.037e-10 420- 
437 


1498 


BL00154 


E1-E2 ATPases phosphorylation site proteins. 


BL00154E 20.37 5.275e-19 263- 
304 BL00154F 8.23 6.175e-19 
417-441 BL00154C 12.38 4.326e- 
13 31-50 BL00154D 12.57 
5.935e-09 191-202 


1499 


BL00039 


DEAD- box subfamily ATP-dependent 
helicases proteins. 


BL00039D 21.67 3.455e-33 476- 
522 BL00039A 18.44 8.548e-23 
145-184 BL00039C 15.63 8.500e- 
16 277-301 BL00039B 19.19 
1.837e-12 191-217 


1499 


DM01537 


kw SKI2W SKI2 NUCLEOLAR HELICASE. 


DM01537B 21.63 8.990e-12 450- 
497 


1499 


PF00271 


Helicases conserved C-terminal domain 
proteins. 


PF00271 7.99 5.500e-10 507-515 


1501 


BL00115 


Eukaryotic RNA polymerase II heptapeptide 
repeat proteins. 


BL00115Z 3.12 9.669e-09 116- 
165 


1502 


PF00168 


C2 domain proteins. 


PF00168B 11.83 8.000e- 10 38-49 


1502 


PR00360 


C2 DOMAIN SIGNATURE 


PR00360A 14.59 6.806e-1043-56 
PR00360B 13.61 2.227e-09 67-81 
PR00360B 13.61 5.909e-09 223- 
237 


1503 


PF00791 


Domain present in ZO-1 and Unc5-like netrin 
receptors. 


PF00791B 28.49 4.165e-13 780- 
835 PF00791B 28.49 6.767e-10 
888-943 PF00791C 20.98 8.059e- 
09 794-833 


1504 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 5.875e-10 437- 
453 PF00023A 16.03 7.000e-10 
563-579 PF00023A 16.03 8.500e- 
10 248-264 PF00023A 16.03 
9.250e-l0 95-1 1 1 PF00023A 
16.03 3.250e-09 596-612 
PF00023A 16.03 3.893e-09 716- 
732 PF00023A 16.03 6.786e-09 
62-78 PF00023A 16.03 9.036e-09 
496-512 


1504 


PD00078 


REPEAT PROTEIN ANK NUCLEAR 
ANKYR. 


PD00078B 13.14 2.957e-09 88- 
101 PD00078B 13.14 5.696e-09 
556-569 PD00078B 13.14 9.217e- 
09 742-755 


1504 


PF00791 


Domain present in ZO-1 and Unc5-like netrin 
receptors. 


PF0079IB 28.49 5.024e-15 215- 
270 PF00791B 28.49 7.750e- 14 
62- 11 7 PF0079 1 B 28.49 9.795e- 
14 530-585 PF00791B 28.49 
9.505e-13 683-738 PF00791B 
28.49 7.253e-12 95-150 
PF00791B 28.49 2.636e-ll 716- 
771 PF00791G 20.98 5.o9oe-II 
697-736 PF00791B 28.49 3.359e- 
10 404-459 PF00791B 28.49 
5.369e-10 248-303 PF00791B 
28.49 6.767e-l 0 563-618 
PF00791C 20.98 8.052e-10 544- 
583 PF00791C 20.98 3.382e-09 
229-268 PF00791B 28.49 7.275e- 
09 371-426 PF00791C 20.98 
9.9 12e-09 385-424 
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1505 


BL00479 


Phorbol esters /diacylglycerol binding 
domain proteins. 


BL00479B 12.57 8.714e-0? 143- 
159 


1506 


BL00479 


Phorbol esters /diacylglycerol binding 
domain proteins. 


BL00479B 12.57 8.714e-09 167- 
183 | 


1507 


BL00518 


Zinc finger, C3HC4 type (RING finger), 
proteins. 


BL00518 12.23 5.286e-10 239- 
248 


1507 




SIR repeat proicins. 


BL01282B 30.49 1.900e-09 220- 
259 


1507 


BL00264 


Neurohypophysial hormones proteins. 


BL00264 8.98 4.884e-09 341-368 


1510 


BL00122 


Carboxylesterases type-B serine proteins. 


BL00122G 11.672.500e-15 15-26 


1511 


PR00910 


LUTEOV1RUS ORF6 PROTEIN 
SIGNATURE 


PR00910A2.51 1.986e-11 340- 
353 PR00910A2.51 1.986e-ll 
342-355 PR00910A2.51 1.986e- 
11 344-357 PR00910A2.51 
9.778e-l 0346-359 PR00910A 
2.51 1.107e-09 338-351 
PR00910A2.51 3.464e-09 336- 
349 


1511 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 4.508e-09 324- 
357 


1512 


DM01970 


0 kw ZK632.12 YDR3I3C ENDOSOMAL 
III. 


DM01970B 8.60 8.475e-15 175- 
188 


1514 


PR00833 


POLLEN ALLERGEN POA PI 
SIGNATURE 


PR00833H 230 8.375e-10 149- 
164 PR00833H2.30 2.846e-09 ' 
147-162 


1514 


PR00308 


TYPE I ANTIFREEZE PROTEIN 
SIGNATURE 


PR00308A5.90 9.630e-ll 150- 
165 PR00308C3.83 8.892e-10 
104-1 14 PR00308C 3.83 8.892e- 
10 105-1 15 PR00308C 3 83 
8.892e-10 151-161 PR00308C 
3.83 8.892e-10 152-162 
PRO03O8C3.83 8.892e-10 153- 
163 PR00308C3.83 8.892e-10 
154-164 PR00308C 3.83 7.545e- 
09 103-113 PR00308C3.83 
7.896e~09 150-160 PR00308B 
4.28 8.397e-09 150-162 
PR00308A 5.90 9.047e-09 101- 
116 


1514 


PR00456 


RIBOSOMAL PROTEIN P2 SIGNATURE 


PR00456E 3.06 7,188e-10 144- 
159 PR00456E3.06 1.684e-09 
145-160 PR00456E 3.06 7.949e- 
0997-112 PR00456E3.06 
9.430e-09 98-113 


1515 


PF00992 


Troponin, 


PF00992A 16.67 3.368e-09 448- 
483 


1521 


PR00019 


LEUCINE-RICH REPEAT SIGNATURE 


PR00019A 1 1.19 7.333e-09 322- 
336 PR00019B H36 9.280e-09 
319-333 


1522 


BL00315 


Dehydrins proteins. 


BL00315A 9.35 7.197e-10 93-121 


1524 


PD00930 


PROTEIN GTPASE DOMAIN 
ACTIVATION. 


PD00930B 33.72 4.240e-16 235- 
276 


1524 


PR00234 


HIV-1 MATRIX PROTEIN SIGNATURE 


PR00234E 11.78 7.268e-09361- 
375 


1525 


BL00183 


Ubiquitin-conjugating enzymes proteins. 


BL00183 28.97 8.338e-14 44-92 


1527 


PR00019 


LEUCINE-RICH REPEAT SIGNATURE 


PR00019B 11.36 6.850e-10 132- 
146 PR00019A 11.19 2.667e-09 
135-149 PR00019Bll.36 9.640e- 
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09180-194 PR00019B 11.36 
1. 000e-08 277-291 


1529 


BL00625 


Regulator of chromosome condensation 
(RCC1) proteins. 


BL00625A 16.21 2.033e-16 567- 
596 BL00625B 17.694.205e-12 
561-595 BL00625B 17.69 9.423e- 
11 93-127 BL00625B 17.69 
1.444e-10 152-186 BL00625A 
16.21 1.759e-10 99-128 
BL00625A 16.21 2.739e-09 515- 
544 BL00625B 17.69 3.1 72e-09 
43-77 BL00625A 16.21 4.170e- 
09 158-187 


1529 


PR00633 


CHROMOSOME CONDENSATION 
REGULATOR RCC1 SIGNATURE 


PR00633B 13.02 3.535e-09 561- 
575 PR00633A 9.32 6.260e-09 
527-544 PR00633F 10.03 7.949e- 
09 528-543 


1530 


BL00414 


Profiiin proteins. 


BL00414A 13.85 6.344e-13 2-16 
BL00414E 15.46 6.283e-09 121- 
136 


1530 


PR00392 


PROFILIN SIGNATURE 


PR00392C 18.98 2.350e-12 41-62 
PR00392A 14.10 4.176e-12 4-14 
PR00392D 12.00 5.250e-U 63-78 
PR00392F17.40 7.955e-ll 122- 
140 PR00392E 12.06 6.500e-09 
109-123 


1531 


BL00414 


Profiiin proteins. 


BL00414A 13.85 6.344e-13 2-16 
BL00414E 15.46 6.283e-09 105- 
120 


1531 


PR00392 


PROFILIN SIGNATURE 


PR00392C 18.98 2.350e-12 41-62 
PR00392A 14.10 4.176e-12 4-14 
PR00392D 12.00 5.250e-ll 63-78 
PR00392F 17.40 7.955e-l 1 106- 
124 PR00392E 1X06 8.833e-09 
93-107 


1532 


PD00301 


PROTEIN REPEAT MUSCLE CALCIUM- 
BL 


PD00301A 10.24 8.200e-09 131- 
142 


1533 


DM01930 


2 kw FINGER SMCX SMC Y YDR096W. 


DM01930F 14.16 1.310e-27 24-60 


1534 


BL00411 


Kinesin motor domain proteins. 


BL0041 1G 21.39 2.200e-39 77- 
119 BL00411H 15.66 8.800e-33 
125-156 BL00411F 14.77 6.250e- 
18 33-58 


1534 


PR00380 


KINESIN HEAVY CHAIN SIGNATURE 


PR00380D 9.93 7.923e-26 126- 
148 PR00380C 13.18 1.000e-21 
76-95 PR00380B 12.64 1.621e-16 
42-60 


1534 


BL00893 


mutT domain proteins. 


BL00893 18.99 8.826e-09 176- 
201 


1536 


BL00600 


Aminotransferases class-Ill pyridoxal- 
phosphate attachment si. 


BL00600E 16.43 5.725e-15 164- 
1<tt BL00600G 12 43 7 000e-14 
242-261 BL00600F 8.77 7.480e- 
1 1 207-220 BL00600D 8.71 
1.750e-10 143-157 


1537 


BL00838 


Interleukins -4 and -13 proteins. 


BL00838A 12.35 8.696e-09 136- 
155 


1537 


PD01847 


PHOTOSYSTEM II PROTEIN REACTION 
CENTRE I TRANSM. 


PD01847 9.59 8.946e-09 137-173 


1539 


PR00121 


SODIUM/POTASSIUM-TRANSPORTING 
ATPASE SIGNATURE 


PR00121D 16.72 3.012e-12 261- 
283 
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1539 


BL00154 


E1-E2 ATPases phosphorylation site proteins. 


BL00154E 2037 8.468e-16 532- 
573 BL00154C 12.38 3.520e- 12 
264-283 


1539 


BL01228 


Hypothetical cof family proteins. 


BL01228D 17.44 6.400e-l 1 660- 
685 


1539 


PR00119 


P-TYPE CATION-TRANSPORTING 
ATPASE SUPERFAM1LY SIGNATURE 


PR00119B 13.94 3.333e-ll 268- 
283 PR00119D 9.56 6.063e-10 
548-559 


1540 


BL00289 


Pentaxin family proteins. 


BL00289A 30.36 9.031e-09 331- 
362 


1542 


BL01279 


Protein-L-isoaspartate(D-aspartate) 0- 
methyltransferase signa. 


BL01279A 24.27 1.000e-l 167- 
115 


1542 


BL00422 


Granins proteins. 


BL00422C 16.18 7.!76e-09 303- 
331 


1545 


BL00027 


Homeobox' domain proteins. 


BL00027 26.43 4.462e-32 244- 
287 


1545 


PR00025 


HOMEOTIC ANTENNAPEDIA PROTEIN 
SIGNATURE 


PR00O25B 1 L94 3.143e-12 230- 
246 


1545 


PR00024 


HOMEOBOX SIGNATURE 


PR00024C 7.49 3.500e-12 276- 
286 PR00024A 11.87 7.000e- 12 
251-263 PR00024B 11.27 1.409e- 
10 266-277 


1545 


PR00031 


LAMBDA AND OTHER REPRESSOR 
HELIX-TURN-HELIX SIGNATURE 


PR00031B 16.29 4.414e-10 267- 
284 


1545 


BL00032 


'Homeobox' antennapedia-type protein. 


BL00032B 10.83 1.675e-37 233- 
272 BL00032C 11.28 4.429e-21 
272-290 BL00032A 18.38 
5.750e-l 0193-216 


1546 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 6.400e-16 245- 
258 PD00066 13.92 8.615e-15 
329-342 PD00066 13.92 6.000e- 
13 301-314 PD00066 13.92 
4.857e-l 2 217-230 PD00066 
13.92 1.346e-10 273-286 
PD00066 13.92 8.200e-09 357- 
370 


1546 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 5.950e-13 313- 
330 BL00028 16.07 7.261e-12 
229-246 BL00028 16.07 3.077e- 
11 16-33 BL00028 16.07 3.769e- 
1 1 285-302 BL00028 16.07 
9.308e-ll 341-358 BL00028 
16.07 3.100e-10 397-414 
BL00028 16.07 5.800e-10201- 
218 BL00028 16.07 6.400e-10 
369-386 BL00028 16.07 7.600e- 
10257-274 BL00028 16.07 
8.800e-10 72-89 BL00028 16.07 
9.229e-09 101-118 


1546 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 3.1 18e-12 310- 
324 PR00048B 6.02 5.000e-12 
326-336 PR00048A 10.52 6.294e- 
12 13-27 PR00048B 6.02 1.692e- 
11 242-252 PR00048A 10.52 
3.842e-l 1 338-352 PR00048A 
10.52 5.263e-l 1 366-380 
PR00048A 10.52 8.579e-ll 226- 
240 PR00048A 10.52 8.579e-1 1 
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254-2oo rKU0U4oA IU.jZ i.J4oe- 

10 394-408 PR00048A 10.52 

a rstiA mow oO£ DPAAAAftTi 

6.02 7.188e-10 298-308 

PR00048A 10.52 9.640e-09 98- 
112 


1547 


BL00585 


Ribosomal protein S5 proteins. 


340 BL00585A 28.43 4.286e-16 


1548 


PR00482 


OMPTIN SERINE PROTEASE 
SIGNATURE 


842 


1549 


PR00500 


POLYCYSTIC KIDNEY DISEASE 
PROTEIN SIGNATURE 


PR00500B 7.74 7.359e-10 56-77 


1551 


PR00917 


SMALL ROUND STRUCTURED VIRUS 
(C37) CYSTEINE PROTEASE FAMILY 
SIGNATURE 


PR00917G 10.598.990e-09 812- 
830 


1553 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 9.486e-09 109- 
126 


1555 


PF00638 


RanBPl domain proteins. 


PF00638 11.91 4.600e-18 67-82 


1555 


DM01269 


303 kw ACTIVATING RAN GTPASE 
ISOZYME. 


DM01269A 23.35 1.600e-20 68- 
96 DM01269B 11.71 3.323e-09 
138-148 


1556 


BL00406 


Actins proteins. 


BL00406E 8.44 8.541e-28 323- 
373 BL00406B 5.47 1.375e-27 
82-137 BL00406D 12.58 3.160e- 
26 266-321 BL00406C 6.75 
6.943e-25 141-196 BL00406A 
9.95 2.575e-20 7-42 


1556 


PR00190 


ACTIN SIGNATURE 


PR00190F 7.80 3.647e-13 139- 
159 PR00190C 11.49 2.029e-12 

233-250 


1558 


BL00048 


Protamine PI proteins. 




1558 


DM01 206 


CORONAVIRUS NUCLEOCAPSID 
PROTEIN. 


DM01206B 10.69 9.328e-ll 157- 

177 r\\A(W)(\fX\ 1 A fiQ 1 9A7p.1A 

236-256 DM01206B 10.69 

7 721p-1 A 1RR-9AR DM0TJ06B 
10.69 6.582e-09 234-254 


1559 


PR00315 


GTP-BINDING ELONGA 1 ION rAt 1 UK 
SIGNATURE 


140 


1559. 


PR00326 


GTP1/OBG GTP-BINDING PROTEIN 
FAMILY SIGNATURE 


PR00326A 8.75 1.000e-09 127- 
148 


1559 


PR00449 


TRANSFORMING PROTEIN P21 RAS 
SIGNATURE 


PR00449A 13.20 6.431e-09 125- 
147 


1559 


PR00755 


AFLATOXIN BIOSYNTHESIS 
REGULATORY PROTEIN SIGNATURE 


PR00755F 10.99 9.722e-09 30-52 


1563 


PF0O780 


Domain found in NIKl-like kinases, mouse 


PF00780B 23.03 9.908e-09 14-57 


1567 


BL00162 


Eukaryotic-type carbonic anhydrases proteins. 


BL00162C 17.78 1.000e-40 88- 
125 BL00162E 14.93 7.231e-39 
171-204 BL00162F 22.68 5.050e- 
31 208-242 BL00162A 22.92 
8.714e-30 16-47 BL00162D 
15.06 7.158e-24 126-151 
BL00162B 21.43 l.375e-19 51-74 


1568* 


PR00457 


ANIMAL HAEM PEROXIDASE 
SIGNATURE 


PR00457E 20.67 1.621e-24 414- 
441 PR00457D 16.81 8.258e-21 
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389-410 PR00457B 13.29 3.455e- 
18 223-239 PR00457G 17.45 
7.000e-18 595-616 PR00457C 
19.25 4.414e-16 371-390 
PR00457H 15.90 8.650e-14 666- 
681 PR00457A 15.80 5.645e-12 
!69-!S! PR00457F 13.69 S.S756- 
11 467-478 


1569 


BL01304 


ubiH/COQ6 monooxygenase family proteins. 


BL0l304A8.05 3.57ie-ll 50-64 


1569 


PR00368 


FAD-DEPENDENT PYRIDINE 
NUCLEOTIDE REDUCTASE SIGNATURE 


PR00368A 17.76 3.769e-10 50-73 


1569 


PR00757 


FLAVIN-CONTAINING AMINE OXIDASE 
SIGNATURE 


PR00757A 6.64 5.552e-10 50-70 


1569 


BL00623 


GMC oxidoreductases proteins. 


BL00623A 12.60 2.929e-09 50-69 


1569 


PR00420 


AROMATIC-RING HYDROXYLASE 
(FLAVOPROTEIN MONOOXYGENASE) 
SIGNATURE 


PR00420A 14.78 6.455e-09 50-73 


1569 


BL00064 


L-lactate dehydrogenase proteins. 


BL00064A 21.16 7.203e-09 50-88 


1569 


PR00370 


FLAVIN-CONTAINING 
MONOOXYGENASE (FMO) SIGNATURE 


PR00370A 335 9.772e-09 50-66 


1571 


BL00019 


Actinin-type actin-binding domain proteins. 


BL00019D 15.33 3.880e-17 145- 
175 


1573 


BL00893 


mutT domain proteins. 


BL00893 18.99 5.500e-16 127- 
152 


1573 


PR00502 


MUTT DOMAIN SIGNATURE 


PR00502B 15.92 4.600e-13 138- 
154 PR00502A 15.06 2.636e-09 
124-139 


1574 


PF00632 


HECT-domain (ubiquitin-transferase). 


PF00632B 18.45 7.000e-16 488- 
516 PF00632C 20.66 7.851e-14 
533-565 


1576 


PR00239 


MOLLUSCAN RHODOPSIN C- 
TERMINAL TAIL SIGNATURE 


PR00239E 1.58 9.566e-10 292- 
304 


1576 


BL00115 


Eukaryotic RNA polymerase 11 heptapeptide 
repeat proteins. 


BL001I5Z 3.125.632e-09 243- 
292 


1576 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 6.1 18e-1 1296- 
329 DM00215 19.43 9.647e-ll 
327-360 DM00215 19.43 8.232e- 
10 322-355 DM00215 19.43 
2.068e-09 291-324 DM00215 
19.43 2.983e-09 265-298 
DM00215 19.43 4.356e-09 292- 
325 DM00215 19.43 7.712e-09 
275-308 DM00215 19.43 8.017e- 
09 266-299 DM00215 19.43 
8.475e-09 271-304 DM00215 
19.43 8.780e-09 286-3 19 


1582 


BL01280 


Glucose inhibited division protein A family 
proteins. 


BL01280A 15.97 6.727e-36 69- 
110 BL01280B 23.56 8.105e-27 
128-180 


1582 


BL00076 


Pyridine nucleotide-disulphide 
oxidoreductases class-I. 


BL00076A 18.83 6.745e-12 68-98 


1582 


BL00836 


Alanine dehydrogenase & pyridine nucleotide 
transhydrogenase. 


BL00836D 22.30 9.576e-12 69- 
106 


1582 


BL00504 


Fumarate reductase / succinate dehydrogenase 
FAD-binding siteproteins. 


BL00504A 10.76 3.870e- 11 69-91 


1582 


BL00977 


FAD-dependent glycerol- 3-phosphate 
dehydrogenase proteins. 


BL00977A 20.76 8.583e-ll 69- 
121 
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1582 


PR00411 


PYRIDINE NUCLEOTIDE DISULPHIDE 
REDUCTASE CLASS-I SIGNATURE 


PR00411A 15.95 l.OOOe- 10 69-92 


1582 


BL00982 


Bacterial-type phytoene dehydrogenase 
proteins. 


BL00982A 18.41 2.151e-1071- 
103 


1582 


PR00368 


FAD-DEPENDENT PYRIDINE 
NUCLEOTIDE REDUCTASE SIGNATURE 


PR00368A 17.76 8.846e-13 69-92 
PR00368C 15.74 5.263e-LU o9-95 


1582 


PR00419 


ADRENODOXIN REDUCTASE FAMILY 
SIGNATURE 


PR00419A 14.89 3.571e-09 69-92 


1582 


PR00757 


FLAVIN-CONTAINING AMINE OXIDASE 
SIGNATURE 


PR00757A 6.64 6.226e-09 69-89 


1582 


PR00469 


PYRIDINE NUCLEOTIDE DISULPHIDE 
REDUCTASE CLASS-II SIGNATURE 


PR00469A 15.46 1.851e-10 69-92 
PR00469F 16.51 8.063e-09 65-90 


1582 


BL00623 


GMC oxidoreductases proteins. 


BL00623A 12.60 8.5 86e-09 69-88 


1586 


PR00413 


HALOACID DEHALOGENASE/EPOXIDE 
HYDROLASE FAMILY SIGNATURE 


PR00413E 15.78 6.714e-09 70-87 


1587 


PD01861 


PROTEIN NUCLEAR 
RIBONUCLEOPROTEIN SMALL MRNA 
RNA. 


PD01861A 14.06 6.3 18e-l 060-84 


1588 


PD00289 


PROTEIN SH3 DOMAIN REPEAT 
PRESYNA. 


PD00289 9.97 6.586e-09 46-60 


1588 


PF00595 


PDZ domain proteins (Also known as DHR 
or GLGF). 


PF00595 13.40 9.400e-09 43-54 


1591 


BL00914 


Syntaxin / epimorphin family proteins. 


BL00914 24.91 1.250e-29 184- 
234 


1592 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR00010C 11.166.667e-ll 363- 
374 


1592 


BL01187 


Calcium-binding EGF-like domain proteins 
pattern proteins. 


BL01187B 12.04 6.538e-16 358- 
374 BL01187A9.98 3.250e-09 
278-290 


1592 


BL00022 


EGF-like domain proteins. 


BL00022B 7.54 8.200e-09 367- 
374 


1593 


PR00625 


DNAJ PROTEIN FAMILY SIGNATURE 


PR00625A 12.84 4.600e-20 14-34 
PR00625B 13.48 8.759e-20 46-67 


1593 


BL00636 


Nt-dnaJ domain proteins. 


BL00636A 8.074.176e-18 18-35 
BL00636B 15.11 1.000e-15 46-67 


1594 


PD02448 


TRANSCRIPTION PROTEIN DNA- 
BINDIN. 


PD02448A 9.37 3.854e-09 351- 
390 


1598 


PD02448 


TRANSCRIPTION PROTEIN DNA- 
BINDIN. 


PD02448A 9.37 1.51 le-20 50-89 
PD02448B 10.17 8.071e-19 89- 
137 


1602 


PR00403 


WW DOMAIN SIGNATURE 


PR00403B 12.19 9.816e-ll 144- 
159 PR00403B 12.19 8,l67e-l0 
103-H8 


1602 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 6.644e-09 23-56 


1602 


BL01159 


WW/rsp5/WWP domain proteins. 


BL01 159 13.85 5.224e-10 144- 
159 BL01159 13.85 6.891e-09 
103-118 


1602 


PR00571 


ENDOTHELIN-B RECbr 1 UK 
SIGNATURE 


PRfift^71fi S 16 7 750e-09 107- 
126 


1603 


PR00403 


WW DOMAIN SIGNATURE 


PR00403B 12.19 9.816e-ll 107- 
122 


1603 


BL01159 


WW/rsp5/WWP domain proteins. 


BL01159 13.85 5.224e-10 107- 
122 


1603 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 6.644e-09 23-56 


1605 


PR00929 


AT-HOOK-LIKE DOMAIN SIGNATURE 


PR00929B 4.38 4.600e-10 358- ! 
370 


1605 


DM00303 


6 LEA 1 1-MER REPEAT REPEAT. 


DM00303A 13.20 7.708e-10 306- 
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356 DM00303A 13.20 2.912e-09 
304-354 DM00303A 13.20 
7.212e-09 300-350 DM00303A 
13.20 7.212e-09 3 11-361 


1605 


BL00354 


HMG-1 and HMG-Y DNA-binding domain 
proteins (Ahook). 


BL00354B 3.16 7.722e-09 357- 
370 


1606 




AMIiNOTRANSriiRASii BIOSYNTHESIS 
PHOSPHOSERINESER. 


PD02379E 11.43 1.000e-40 194- 
236 PD02379F 18.62 6.029e-35 
245-284 PD02379H 16.03 
5.235e-33 352-385 PD02379B 
12.05 3.613e-31 80-113 
PI)02379A 15.57 2,800e-25 29-60 
PD02379C 13.34 3.700e-21 119- 
139 PD02379D 11.83 9.419e-16 
168-181 PDQ2379G 10.62 
2.537e-14 313-328 


1606 


PR00800 


AROMATIC-L-AMINO-ACID 
DECARBOXYLASE SIGNATURE 


PR00800G 1 1 90 1 RRQ<».ftQ 07 
118 


1607 

• 


PD02379 


AMINOTRANSFERASE BIOSYNTHESIS 
PHOSPHOSERINE SER. 


PD02379E 1 1 41 1 ftnfWM 104- 
236 PD02379F 18.62 6.029e-35 
245-284 PD02379R 1? 05 *\ 611*. 
31 80-113 PD02379A 15.57 
2.800e-25 29-60 PD02379H 
16.03 7.864e-23 306-339 
PD02379C 13.343 700e-21 119- 
139 PD02379D 11.83 9.4196-16 
168-181 


1607 


PR00800 


AROMATIC-L-AMINO-ACID 
DECARBOXYLASE SIGNATURE 


PR00800G 11.29 1.889e-09 97- 
118 


1610 


PR00874 


FUNGI-IV METALLOTHIONEIN 
SIGNATURE 


PR00874C 4.37 6.625e-09 33-48 


1614 


BL00035 


*POU' domain proteins. 


BL00035B 14.46 6.236e-09 683- 
704 


1616 


PF00168 


C2 domain proteins. 


PF00168C 27.49 8.412e-13 634- 
660 


1616 


PR00360 


C2 DOMAIN SIGNATURE 


PR00360B 13.61 8.105e-10 651- 
665 


1617 


PF00168 


C2 domain proteins. 


PF00168C 27.49 8.412e-13 115- 
141 


1617 


PR00360 


C2 DOMAIN SIGNATURE 


PR00360B 13.61 8.105e-10 132- 
146 


1619 


PR00566 


DOPAMINE IB RECEPTOR SIGNATURE 


PR00566E ,13.44 5.255e-18 466- 
483 PR00566A9.32 3.000e-17 
200-214 PR00566D9.35 1.600e- 
12 446-455 PR00566C 11.44 
2.184e-12 401-412 PR00566B 
8.20 3.053e-l 1 341-351 


1619 


PR00242 


DOPAMINE RECEPTOR SIGNATURE 


PR00242E 13.29 1.000e-12 424- 
439 PR00242B 1 1 .77 8.650e-l 1 
257-267 


1619 


BL00237 


G-protein coupled receptors proteins. 


BL00237C 13.19 6.786e-20 364- 
391 BL00237A 27.68 9.710e-15 
266-306 BL00237B 5.28 5.263e- 
10 309-321 


1619 


PR00237 


RHODOPSIN-LIKE GPCR SUPERFAMILY 
SIGNATURE 


PR00237F 13.57 5,800e-19 369- 
394 PR00237B 13.50 6.250e-19 
236-258 PR00237E 13.03 9.500e- 
15 301-325 PR00237C 15.69 
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3.925e-09 280-303 PR00237A 
1 1.48 7.387e-09 202-227 


1620 


PR00169 


«^TA OOTTTTv/T f'T-J AXTMT7T QTfTMATTTRF 
POIAbolUM CiiAJNINiiL* ol01NAl'UJL\E< 


PR001 69A 16.77 7.851e-l 1 46-66 


1621 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 7.529e-ll 183- 
216 


1621 


PF00685 


Sulfotransferase proteins. 


PF00685C 26.03 5.100e-09 118- 
164 


1621 


PR00049 


WILM'S TUMOUR PROTEIN SKiNAl uku 


213 


1622 


BL00951 


ER lumen protein retaining receptor proteins. 


d] nno^lR Id IX 1 670e-09 43-74 


1623 


BL00292 


Cyclins proteins. 


BL00292B 2031 3.925e-U 120- 
151 


1624 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


125 


1624 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


131 BL00028 16.07 3.57le-09 
145-162 


1625 


BL01226 


Hydroxymethylglutaryl-coenzyme A synthase 
proteins. 


BL012261 25.06 8.560e-09 256- 
304 


1629 


BL00030 


Eukaryotic RNA-binding region RNP-1 
proteins. 


BL00030A 14.39 1.563e-12 72-91 
BL00030A 14.39 2.125e-12 156- 
175 


1637 


BL00740 


MAM domain proteins. 


BL00740B 19.76 3.813e-09 637- 
658 


1637 


PR00597 


GELSOLIN FAMILY SIGNATURE 


PR00597G 8.55 6.586e-27 637- 
660 PR00597A 12.96 5.846e-26 
326-348 PR00597E 13.46 2.000e- 
22 523-544 PR00597F 16.29 
9. s26e-22 582-602 trKWiyfu 
12.77 l .uuue-^u 4oy-4yu 
PR00597B 9.78 2.500e-20 415- 

436-455 PR00597H 15.32 7.577e- 
19 666-686 PR00597D 12.77 
3.392e-10 94-115 PR00597B 9.78 

7.875e-09 61-80 PR00597A 12.96 
8.027e-09 689-711 


1641 


BL00027 


'Homeobox' domain proteins. 


tn Aft097 16 43 7 000e-ll 93-136 


1641 


PR00887 


STRUCTURE-SPECIFIC RECOGNITION 
PROTEIN SIGNATUivb 


PR00887D 15.12 8.909e-09 337- 

^1 
jji 


1646 


PR00259 


TRANSMEMBRANE FOUR FAMILY 
SIGNATURE 


PR00259A 9.27 3.308e-18 19-43 
PR00259C 16 40 9 800e-18 88- 
117 PR00259D13.50 2.756e-15 
238-265 


1646 


BL00421 


Transmembrane 4 family proteins. 


BL00421A 11.79 5.263e-14 15-34 
BL00421E 20.97 4.632e-13 235- 
265 


1651 


PR00669 


INHIBIN ALPHA CHAIN SIGNATURE 


PR00669F 5.57 9.899e-09 223- 
241 


1652 


DM01292 


ESICULAR LUMEN DOMAIN. 


DM01292L 12.54 9.505e-09 240- 
265 


1653 


PR00128 


COLIPASE SIGNATURE 


PR00128D 9.77 6.250e-25 47-66 
PR00128C 9.28 5.299e-20 24-47 


1653 


BL00121 


Colipase proteins. 


' BL00121B 9.96 3.160e-33 15-64 
BL00121A 14.56 2.107e-09 16-56 


1656 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 9.929e-10 384- 
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399 ' . . ■ ~ 


1658 


BL01118 


Translation initiation factor SUI1 proteins. 


BL01 1 1 8B 26.75 8.579e-26 94- 
132 BL01118A 12.46 4.000e-13 
77-92 


1659 


BL00811 


Oleosins proteins. 


BL0081 1 A 8.26 3.310e-09 120- 
158 






/w\-proiein i&nuiy proteins. 


BL00674B 4:46 9.1 82e-l 1 184- 
206 


1660 ; 


PR00830' 


ENDOPEPTIDASE LA (LON) SERINE 
PROTEASE (SI 6) SIGNATURE 


PR00830A 8.41 8.544e-10 191- 
211 


1660 


PR00300 


ATP-DEPENDENT CLP PROTEASE ATP- 
BINDING SUBUNIT SIGNATURE 


PR00300A9.56 9.416e-09 187- 
206 


1660 


PR00051 


BACTERIAL CHROMOSOMAL 
REPLICATION INITIATOR (DNAA) 
SIGNATURE 


PR00051A10.68 9.899e-09184- 
205 


1661 


DM01871 


hvSSRLIGASECYCLO 
FORMYLTETRAHYDROFOLATE. 


DM01871C 20.79 9.836e-10 270^ 
296 


1663 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 7.712e-09 95-110 


1665 


BL01181 


Ribosomal protein S21 proteins. 


BL01181 15.43 2.500e-10 13-49 


1666 


PR00259 


TRANSMEMBRANE FOUR FAMILY 
SIGNATURE 


PR00259C 16.40 6.824e-16 88- 
117 PR00259A 9.27 3.423e-14 
24-48 PR00259D 13 50 1 574e-13 
238-265 PR00259B 14.81 8.714e- 
13 61-88 


1666 


BL00421 


Transmembrane 4 family proteins. 


BL00421B 17.62 4.600e-19 67- 
106 BL00421E 20.97 6.21 le-13 
235-265 BL00421A 11.79 
5.600e-12 20-39 


3668 


PR00496 


NAPIN SIGNATURE i 


PR00496A 6.68 6.276e-09 21-43 


1671 


PR00237 


RHODOPSIN-LIKE GPCR SUPERFAMILY 
SIGNATURE 


PR00237G 19.63 2.543*11 670- 
697 PR00237A 11.48 3.000e-10 
424-449 


1671 


PR00373 


GLYCOPROTEIN HORMONE RECEPTOR 
SIGNATURE 


PR00373D 11.16 2.403e-09 503- 
518 


1671 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 6.600e-10 496- 
536 BL00237D 1 1 .23 4.545e-09 
680-697 


1671 


PR00910 


LUTEOYIRUS ORF6 PROTEIN 
SIGNATURE 


PR00910A 2.51 7.429e-09 400- 
413 


1671 


PR00019 


LEUCINE-RICH REPEAT SIGNATURE 


PR00019B 11.36 6.000e-ll 94- 
108 PR00019A 11.19 7.300e-ll 
215-229 PR00019B 11.36 6.850e- 
10 46-60 PR00019A 11.19 
8.043e- 10 285-299 PR00019B 
11.36 5.320e-09 212-226 
PR00019B 11.36 9.640e-09 70-84 


1672 


BL00972 


Ubiquitin carboxyl-terminal hydrolases 
family 2 proteins. 


BL00972A 11.93 7.5 00e-20 36-54 
BL00972D 22.55 6.806e-16 296- 
321 BL00972B9.45 1.000e-13 
116-126 BL00972E 20.72 8.773e- 
12 321-343 


1673 


PF00646 


F-box domain proteins. 


PF00646A 14.37 6.906e-09 92- 
106 


1675 


BL00933 


FGGY family of carbohydrate kinases 
proteins. 


BL00933D 24.01 7.545e-15 212- 
249 BLO0933B 15.94 2.200e-09 
54-65 BL00933E 13.80 3.543e-09 
439-455 BL00933A 17.50 



302 



WO 02/070539 




SEQ ID 
NO: 


Database 
entry ID 


Description 


Results* 








4.857e-09 20-44 


1676 


PR00876 


XTCN X K TVYTVD X/fCTAT T rvTWTrvKTPTM ! 
NJEMA I Kjuct Mxi 1 1 JrllUlNJiXiN 

SIGNATURE 


PR00876B 7 66 1 887e-10 137- 
151 


1676 


BL00026 


Cnitin recognition or Dinuing aomam 
proteins. 


BL00026 12 95 2.776e-09 55-76 


1676 


BL01208 


VWFC domain proteins. 


BL0120?B 15.83 2.946e-09 172- 
187; 


1676 


BL01282 


BIR repeat proteins. 


BL01282B 30.49 4.47 le-09 130- 
169 


1676 


PR00875 


MOLLUSC METALLOTHIONEIN 
SIGNATURE 


145 


1676 


BL00956 


Fungal hydrophobins proteins. 


165 


1676 


PD00866 


GLYCOPROTEIN PROTEIN SPIKE E2 
PRECURSOR PEPLOMER. 


PD00866L 3.73 6.564e-10 1-1 1 
PD00866L 3,73 1.443e-0y JJ-4J 
PD00866L 3.73 2.918e-09 12-22 
PD00866L 3.73 2.918e-09 19-29 
FUUU0O0JL j./i 4.oJoe-uy ZHy- 
159 


1676 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU, 


PD02283C 17.54 6.288e-09 141- 

1 ^a 
169 


1676 


DM01724 


kw ALLERGEN POLLEN CIM1 HOL-LI. 


DM01724 8.14 7.465e-10 17-37 
DM01724 8.14 4.434e-09 19-39 
DM01724 8.14 6.684e-09 10-30 


1676 


PR00858 


CRUSTACEAN METALLOTHIONEIN 
SIGNATURE 


PR00858B 5.93 5.883e-09 155- 
174 PR00858B 5.93 8.085e-09 
136-155 


1676 


PR00874 


FUNGI-IV METALLOTHIONEIN 
SIGNATURE 


PR00874C 4.37 9.739e-10 125- 
140 PR00874C 4.37 9.000e-09 
135-150 


1676 


BL00243 


Integrins beta chain cysteine-rich domain 
proteins. 


BL002431 31.77 3.779e-10 38-81 
BL002431 31.77 4.309e-10 68-1 1 1 
BL002431 31.77 5.235e-10 58-101 
BL002431 31.77 7.353e-10 98-141 
BL002431 31.77 1.000e-09 78-121 
BL002431 31.77 1.000e-09 88-131 
BL002431 31.77 1.380e-U9 121- 
1 64 BL002431 3 1 .77 2.648e-09 
1 19-162 BL00243I 3 1.77 3.662e- 
09 61-104 BL002431 31.77 
4.296e-09 131-174 BL00243I 
3 1.77 4.676e-09 48-91 BL00243I 
31.77 6.704e-09 109-152 
BL002431 31.77 7.845e-09 25-68 
BL002431 31.77 9.366e-09 134- 
177 


1676 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 3.885e-16 128- 
174 BL00203 13.94 8.607e-13 
123-169 BL00203 13.94 2.780e- 
11 153-199 BL00203 13.94 
3.571e-ll 148-194 BL00203 
13.94 4.363e-ll 113-159 
BL00203 13.94 5.45 le-11 139- 
185 BL00203 13.94 6.934e-ll 
144-190 BL00203 13.94 9.209e- 
11 131-177 BL00203 13.94 
2.436e-10 35-81 BL00203 13.94 
4.255e-l 0 133-179 BL00203 
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13.94 6.745e-10 32-78 BL00203 
13.94 7.032*10 154-200 
BL00203 13.94 2.929e-09 34-80 
♦BL00203 13.943.388^09 149- 
195 BL00203 13.94 3.571 e-09 
136-182 BL00203 13.94 5.224e- 
09 127-173 QL00203 13.94. 
5,776e-09 43-89 BL00203 13.94 
6.87 r 8e-09 140-186 BL00203 
13.94 7.796*09 45-91 BL00203 
13.94 9.541e-09 42-88 . 


xo/y 




KINASE DEHYDROGENASE 
TRANSFERASE. 


PD01976A8.95 1.493*09 83-96 


1680 


BL00623 


GMC oxidoreductases proteins. 


BL00623A 12.60 9.859e- 10 12-31 


1680 


PR00419 


ADRENODOXIN REDUCTASE FAMILY 
SIGNATURE 


PR00419A 14.89 4.729*09 12-35 


1680 


PR00368 


FAD-DEPENDENT PYRIDINE 
NUCLEOTIDE REDUCTASE SIGNATURE 


PR00368A 17.769.357*09 12-35 


1683 


BL01172 


Ribosomal protein L44e proteins. 


BL01172B 14.10 8.909e-38 15-57 
BL01172C 16.78 7.188e-31 63- 
102 


1685 


DM01724 


kw ALLERGEN POLLEN CIM1 HOL-LI. 


DM01724 8.14 5.909e-ll 11-31 
DM01724 8.14 6.591e-ll 41-61 
DM01724 8.14 6.831e-I0 39-59 
DM01724 8.14 8.697e-09 55-75" 


1686 


PR00320 


G-PROTEIN BETA WD-40 REPEAT 
SIGNATURE 


PR00320A 16.74-8:463*09 73-88 


io9U 


PD02269 


CYTIDINE DEAMINASE HYDROLASE 
ZINC AMINOHY. 


PD02269C 16.36 7.882e-17 79-92 
PD02269A 10.06 I.000e-15 29-41 
PD02269D 11.98 5.000*14 1 10- 
125 


1691 


BL00790 


Receptor tyrosine kinase class V proteins. 


BL00790D 12.41 8.297e-09 429- 
454 


1692 


BL00750 


Chaperonins TCP-t proteins. 


BL00750B 16.I7 2.000e-39 69- 
119 BL00750A 20.07 8.286e-36 
25-68 BL00750C 25.65 8.579e-23 
152-184 




rK.UUJU4- 


TAILLESS COMPLEX POLYPEPTIDE 1 
(CHAPERONE) SIGNATURE 


PR00304C 8.69 1.250e-18 86-106 
PR00304B 1 1.60 2.059e-17 56-75 
PR00304A 9,20 3.605e-15 34-51 


1692 


PR00298 


60 KD CHAPERONIN SIGNATURE 


PR00298B 13.59 7.353e-ll 88- 
116 


1692 


BL00296 


Chaperonins cpn60 proteins. 


BL00296B 15.98 4.115*13 76- 
130 BL00296A 17.20 5.648e-10 
12-66 


1694 


BL00415 


Synapsins proteins. 


BL00415N 4.29 4.710*10 225- 
269 


1694 


PD01234 


PROTEIN NUCLEAR BROMODOMAIN 
TRANS. 


PD01734R 15 53 5 R75p in o/fi 
261 


1694 


BL00795 


Involucrin proteins. 


BL00795C 17.06 7.698*10 213- i 
258 


1694 


PR00208 


GL1ADIN AND LMW GLUTENIN 
SUPERFAMILY SIGNATURE 


PR00208A 1 2.59 9.384e-09 247- ! 
265 


1694 


DM00406 


GLIADIN. 


DM00406 7.73 9.800e-09 245-258 


1696 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 1.000*09 212- 
229 BL00028 16.07 6.143e-09 
365-3S2 
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1700 


BL00030 


Eukaryotic RNA-binding region RNP-1 
proteins. 


BL00030A 14.39 $.1 43e-l 3,332- 
351 • ' 


1700 


PD02784 


PROTEJN NUCLEAR **. 
R1BONUCLEOPROTEIN. 


PD02784B 26.46 6.943e-09 442- 
485 


1701 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 S.500e-10 283- 
299 PF00023A 16.03 9.625e-10 
347-363 PF00023A 16.03 .1.321* 
09 1 84-200 PF00023 A 1 6.03 
1.643e-09 150-166 


1701 


BL00906 


Uroporphyrinogen decarboxylase proteins. 


BL00906D 24.33 7.750e-09 212- 
256 


1701 


PF00791 


Domain present in ZO-1 and Unc5-like netrin 
receptors. 


PF00791B 28.49 8.159e-14 117- 
172 PF00791B 28.49 8.319e-13 
217-272 PF00791B 28.49 3.179e- 
12 184-239 PF00791B 28.49 
5.168e-12 347-402 PF00791B 
28.49 5.727e-l 1 250-305 
PF00791B 28.49 2.8 17e-09 17-72 
PF00791B 28,49 8.514e-09 84- 
139 PF00791C 20.98 1.000e-08 
98-137 


1702 


BL00107 


Protein kinases ATP-binding region proteins. 


BL00107B 13.31 1.643e-10 202- 
218 


1702 


PF00992 


Troponin. 


PF00992A 16.67 9.526e-09 749- 
784 


1708 


PR00671 


1NHIBIN BETA B CHAIN SIGNATURE 


PR00671C 4.1 8 8.966e-09 212- 
232 


1709 


PR00678 


PI3 KINASE P85 REGULATORY 
SUBUN1T SIGNATURE 


PR00678H 9.13 7.805e-12 292- 
315 


1710 


PR00412 


EPOXIDE HYDROLASE SIGNATURE 


PR00412C ll.30 2.421e-12 169- 
183 PR00412A 13.23 7.947e-12 
104-123 PR00412B 12.59 7.429e- 
10 123-139 


1711 


PR00217 


43 KD POSTSYNAPTIC PROTEIN 
SIGNATURE 


PR00217C 10.91 7.247e-10 293- 
309 


1712 


PR0032O 


G-PROTEIN BETA WD-40 REPEAT 
SIGNATURE 


PR00320A 16.74 9.122e-09 277- 
292 PR00320A 16.74 9.780e-09 
233-248 PR0Q320C 13.01 l.OOOe- 
Vis 2j3-24o 


1713 


BL01230 


RNA methyltransferase trmA family proteins. 


BL01230E 15.79 2.918e-ll 487- 
503 


1719 


BL00039 


DEAD-box subfamily ATP-dependent 
helicases proteins. 


BL00039D 21.67 2.957e-09 434- 
480 


1719 


DM01537 


kw SKI2W SKI2 NUCLEOLAR HELIC ASE. 


DM01537B 21.63 7.830e-09 408- 

A « 


1721 


PR00453 


VON WILLEBRAND FACTOR TYPE A 
DOMAIN SIGNATURE 


PR00453A 12.79 2.957e-10 33-51 


1721 


r KuU Dz / 


nACTRTN PFPFPTOR SIGNATURE 


PR00527I 5.36 6.559e-09 419-439 


1721 


PR00014 


FIBRONECTIN TYPE III REPEAT 
SIGNATURE 


PR00014C 15.44 6.870e-09 381- 
400 


1721 


PR00477 


PHOSPHOGLYCERATE KINASE FAMILY 
SIGNATURE 


PR00477I 8.53 1.000e-08 168-186 


1725 


PR00493 


BREAST CANCER TYPE I 
SUSCEPTIBILITY PROTEIN SIGNATURE 


PR00493G 7.57 3.711e-14 693- 
714 


1726 


BL00443 


Glutamine amidotransferases class-11 
proteins. 


BL00443F 16.68 8.714e-09 85- 
101 


1728 


DM01206 


CORONAVIRUS NUCLEOCAPSID 


DM012O6B 10.69 7.28 8e- 10 167- 
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PROTEIN. 


187 ~ 




DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 3.41 le-10 331- 
364 DM00215 19.43 7.107e-10 
336-369 DM00215 19.43 9.679e- 
10 335-368 DM0021519.43 
3.136e-09 342-375 DM00215 
19.43 5.1 19e-09 315-348 
DM00215 19.43 8.322e-09 326- 
359 


1728 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 8.627e-09 335- 
350 


1728 


BL00048 


Protamine PI proteins. 


BL00048 6.39 5.026e-10 152-179 
BL00048 6.39 6.329e-10 173-200 
BL00048 6.39 8.224e-10 161-188 
BL00048 6.39 3.363e-09 155-182 
BL00048 6.393.475e-09 1 63-190 
BL00048 6.39 3.925e-09 167-194 
BL00048 6.394.150e-09 151-178 
BL00048 6.39 4.150e-09 159-186 
BL00048 6.394.825e-09 171-198 
BL00048 6.39 5.838e-09 176-203 
BL00048 6.39 8.200e-09 177-204 
BL00048 6.39 9.550e-09 153-180 


1728 


PR002U 


GLUTELIN SIGNATURE 


PR00211B 0.86 1.827e-ll 345- * 
366 PR00211B 0.86 3.571 e-11 
339-360 PR0021 IB 0.86 6.917e- 
09 325-346 PR00211B 0.86 
1.000e-08 351-372 


1731 


BL01221 


PMP-22 / EMP / MP20 family proteins. 


BL01221C 26.20 1.281e-34 59- 
104 BL01221D 13.99 5.966e-27 
136-163 BL01221A 17.26 
2.385e-26 1-29 BL01221B 13.29 
1.000e-14 38-52 ' 


1/JJ 




'Homeobox 1 domain proteins. 


BL00027 26.43 4.000e-10 297- 
340 


1711 
J / JJ 


DD A A AO /I 


HOMEOBOX SIGNATURE 


PR00024A 1 1.87 4.150e-09 289- 
301 


1734 


BL00027 


'Homeobox' domain proteins. 


BL00027 26.43 4.000e-10 297- 
340 


1734 


PR00024 


HOMEOBOX SIGNATURE 


PR00024A 1 1.87 4.150e-09 289- 
301 


1 / JO 


DT AA1A7 


5-lUU/iCaBP type calcium binding protein. 


BL00303B 26.15 5.075e-13 73- 
110 


1718 i 
i / Jo 


DT AAA! O . 


fcj-hand calcium-binding domain proteins. 


BL00018 7.41 3.400e-12 85-98 
BL00018 7.41 8.043e-09 49-62 


171ft 
1 / Jo 


PT>AA/1CA 


RECOVERIN FAMILY SIGNATURE 


PR00450C 12.22 6.582e-09 44-66 
PR00450C 12.22 9.772e-09 80- 
102 


1740 


PD01941 


TRANSMEMBRANE COTRANSPORtfr 
SYMP. 


PT\A 1 QA IT 10 OiC A t\£f\~. t £ « j 

ruviyQH* \y.yo 4.960e-I6 84- 
139 PD01941B 15.02 2.093e-ll 
4-51 


1742 


BL00672 


Serine proteases, V8 family, histidine 
proteins. 


BL00672B 9.84 3.554e-09 214- 
231 


1742 


PR00839 


V8 SERINE PROTEASE FAMILY 
SIGNATURE 


PR00839E 12.04 8.062e-09 213- 
230 


1745 


BL00674 


AAA-protein family proteins. 


BL00674B 4.46 7.814e-10 360- 
382 


1745 


DM01022 


LRR REPEAT. 


DM01022A 7.35 1 .900e-09 954- 
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961 


1745 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 6.1 86e-09 267- 
282 


1746 


BL00674 


AAA-protein family proteins. 


BL00674B 4.46 7.814e-10 360- 


1746 


DM01022 


LRR REPEAT. 


DM01 022A 7.35 1.900e-09 954- 
961 


1746 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


r»r> nt\/\ ac\t\ A r\r\ C .1 Q/Za AO OAT 

282 


1747 


BL00215 


Mitochondrial energy transfer proteins. 


BL00215A 15.82 9.591e-18 206- 
231 BL00215A 15.82 4.000e-15 
104-129 BL00215A 15.82 
9.400e-15 7-32 BL00215B 10.44 
i.000e-10 154-167 


1747 


PR00927 


ADENINE NUCLEOTIDE 
TRANSLOCATOR 1 SIGNATURE 


PR00927G 11.07 9.036e-ll 158- 
174 PR00927B 14,66 4.652e-10 
239-261 


1747 


PR00926 


MITOCHONDRIAL CARRIER PROTEIN 
SIGNATURE 


PR00926F 17.75 2.826eT09 9-32 
PR00926F 1 7.75 3.217e-09 208- 
231 


1749 


BL01013 


Oxysterol-binding protein family proteins. 


BL01013A 25.14 5.500e-21 537- 
573 BL01013D 26.81 2.161e-18 
807-851 BL01013C9.97 4.231e- 
13 625-635 BL01013B 11.33 
3.017e-ll 603-614 


1751 


BL00711 


Lipoxygenases iron-binding region proteins. 


BL0071 11 18.56 8.630e-28 577- 
615 BL0071 IE 19.66 3.550e-22 
414-451 BL00711G 21.83 
9.1 00e-22 503-535 BLOU711L 
20.75 5.959e-19 268-297 
BL0071 ID 17.56 L923e-16 347- 
373 BL0071 1 H 23.34 1 . / / 1 e- Iz 
535-574 BL00711F 19.79 2.086e- 
10 484-501 


1751 


PR00087 


LIPOXYGENASE SIGNATURE 


PR00087C 15.00 1.1 84e- 17 423- 
444 PROO087A 18.37 7.061 e-12 
385-403 rKUUUo/rJ 13.23 3.Wie- 
10 403-421 


1751 


PR00467 


MAMMALIAN LIPOXYGENASE 
SIGNATURE 


PRUU4o/b V.w J.4UUe-14 .544- 

-i/i/i DDftrt^n i/? /so a nR7#»-no 
3o4 r^KUU4o fjJ lo.oy f.uo^e-uy 

243-265 


1753 


PR00492 


RHO PROTEIN GDP DISSOCIATION 
INHIBITOR SIGNATURE 


Dt> AA/1 07/~ > O <Q 1 Onn^-77^ 177- 

139 PR00492B9.77 8.579e-23 
7/^0^ PR A n4<)9n 14 R2 8 200e-21 
139-155 PR00492A1 1.921. 643e- 
1 R 60-76 


1756 


BL00378 


Hexokinases proteins. 


BL00378A 19.01 8.500e-09 403- 
431 


1 7^7 
1 J J 1 


"dt nno?7 


'Hnmeohox 1 domain nroteins. 


BL00027 26.43 8.615e-33 35-78 


1757 


BL00032 


'Homeobox' antennapedia-type protein. 


BL00032B 10.83 4.259e-27 24-63 
BL00032C 1 1.28 5.909e-20 63-81 


1757 


PR00025 


HOMEOTIC ANTENNAPEDIA PROTEIN 
SIGNATURE 


PR00025B 11.94 4.000e-ll 21-37 


1757 


PR00031 


LAMBDA AND OTHER REPRESSOR 
HELIX-TURN-HELIX SIGNATURE 


PR00031B 16.29 4.960e- 11 58-75 


1757 


PR00024 


HOMEOBOX SIGNATURE 


PR00024C 7.49 9.357e- 13 67-77 
PR00024B 11.27 3.500e-ll 57-68 
PR00024A 11.87 9.400e-ll 42-54 
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Database 
entry ID 


Description 


Results* 


1758 


PR00179 


LIPOCALIN SIGNATURE 


PR00179B 9.56 l.OOOe-12 102- 

1H PRflrtl70P io no i nnhft in 
hj r;xv.uui/yv> ii/.uz i.UUUe-il/ 

130.14$ PR0O179A 13.78 5.6£0e- 

.10 37-50 : 


1758 


BL00213 


Lipocalin proteins. 


BL00213B8.78 8.000e-10 102- 
113 BL00213A 12.95 9.526*10 
37 51 


1759 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 1 .8 1 8e- 1 1 1 64- 
. 187 


1762 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 7.158e-09 171- 
200 


1762 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 5.855e-10 57-85 
PD02283C 17.54 5.855e-10 87- 
115 PD02283C 17.54 6.566e- 10 
117-145 PD02283C 17.54 1.450e- 
0947-75 PD02283C 17.54 
1.450e-09 77-105 PD02283C 
17.54 1.450e-09 107-135 
PD02283C 17.54 5.613e-09 67-95 
PD02283C 17.54 5.613e-09 97- 
125 PD02283C 17.54 6.1 75e-09 
137-165 PD02283C 17.54 7.525e- 
09 37-65 PD02283C 17.54 
8.875e-09 147-175 


1762 


BL00203 


Vertebrate metallothioneins proteins. ' 


BL00203 13.94 3.379e-12 95-141 
BL00203 13.94 3.690e-12 65-1 1 1 
BL00203 33.94 2.978e-ll 35-81 
BL00203 13.94 5.549e-ll 39-85 
BL00203 13.94 6.538e-ll 55-101 
BL00203 13.94 6.538e-ll 85-131 
BL00203 13.94 7.23 le- 11 34-80 
BL00203 13.94 7.429e- 11 125- 
171 BL00203 13.94 7.527e-il 
69-115 BL00203 13.94 8.220e-ll 
99-145 BL00203 13.94 I.670e-I0 
64-110 BL00203 13.94 2.053 e- 10 
94-140 BL00203 13.94 2.149e-10 
124-170 BL00203 13.94 2.81 9e- 
10 159-205 BL00203 13.94 
5.21 3e-l 0 54-100 BL00203 13.94 
5.21 3e-l 0 84-130 BL00203 13.94 
5.691e-10 59-105 BL00203 13.94 
5.691e-10 89-135 BL00203 13.94 
6.936e-10 129-175 BL00203 
13.94 7.51 le-10 115-161 
BL00203 13.94 7.702e-l 0 49-95 
BL00203 13.94 7.702e-I0 79-125 
BL00203 13.94 1.551e-09 44-90 
BL00203 13.94 1. 55 le-09 74-120 
BL00203 13.94 3.1 12e-09 29-75 
BLO02O3 13.94 4.03 le-09 30-76 
BL00203 13.94 4.214e-09 90-136 
BL00203 13.94 4.306e-09 60-106 
BL00203 13.94 5.133e-09 145- 
191 BL00203 13.94 6.235e-09 
119-165 BL00203 13,94 6.327e- 
0950-96 BL00203 13.94 6.327e- 
09 80-126 BL00203 13.94 
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NO: 


Database 
entry ED 


Description 


Results* 






• • 

• • 


6.694e-09 45-91 BL00203 13.94 
8.898e-09( 104-150 


1763 


BL00216 


Sugar transport proteins. 


TIT AAO 1 /CD 07 £LA < ft/lAA-HQ 1 4.1 - 

191 


1766 


BL00456 


Sodium :solute symporter family proteins. 

■ K ■ • ,~ 


13LrU04jOA Ll.yy z.Uoye-JU o^- 
138 BL00456C 24.55 3.721e-29 
22*1-276 BL00456B 18.94 l.OOOe- 
21159^189 - ^ 


1766 


PR00175 


SdblUM/ALAKERE SYMPORTER | 
SIGNATURE 


PR00175B 10180 9.878e-09 226- 

245 . 


1767 


BL00142 


Neutral zinc metallopeptidases, zinc-binding 
region proteins. 


BL00142 8.38 1.857e-09 494-505 


1768 


BL00509 


Ras GTPase-activating proteins. 


BL00509B 10.28 1.643e-12 610- 
621 


1772 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 6.143e-13 252- 
266 PR00048A 10.52 7.429e-13 
476-490 PR00048A10.52 3.118e- 
12 336-350 PR00048A 10.52 
3. 118e-12 364-378 PR00048A 
10.52 4.706e-12 504-518 
PR00048A 10.52 8.412e-12 224- 
238 PR00048A 10.52 3.842e-ll 
392-406 PR00048A 10.52 6.21 le- 
11 308-322 PR00048A 10.52 
6.21 le-1 1 448-462 PR00048B 
6.02 7.23 le- 11 492-502 
PR00048B 6.02 3.250e-10 240- 
250 PR00048A 10.52 6.870e-10 
420-434 PR00048B 6.02 2.421e- 
09 380-390 


1772 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 8.800e-14 327- 
340 PD00066 13.92 1.500e-13 
411-424 PD00066 13.92 5.500e- 
13 383-396 PD00066 13.92 
5.500e-13 439-452 PD00066 
13.92 7.500e-l 3 495-508 
PD00066 13.92 9.000e-13 467- 
480 PD00066 13.92 3.571e-12 
355-368 PD00066 13.92 7.000e- 
12 271-284 PD00066 13.92 
7.923&-10 299-312 PD00066 
13.92 2.500e-09 243-256 


1772 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 3.250e-13 395- 
412 BL00028 16.07 5.950e-13 
367-384 BL00028 16.07 6.478e- 
12 451-468 BL00028 16.07 
8.435e-12 339-356 BLO0O28 
16.07 1.692e-ll 255-272 
BL00028 16.07 3.769e-l 1 227- 
244 BL00028 1 6.07 5. 1 54e- 1 1 
507-524 BL00028 16,07 2.200e- 
10 479-496 BL00028 16.07 
9.400e-10 199-216 BL00028 
16.07 2.029e-09 423-440 
BL00028 16.07 3.571e-09 311- 
328 


1773 


PR00122 


VACUOLAR ATP SYNTHASE 16 KD 


PR00122D 9.97 7.214e-ll 103- 
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SEQID 
NO: 


Database 
entry ID 


Description 


Results* 






SUBUNIT SIGNATURE 


127 PR00122C 8.209.526e-llT~~ 
76-103 


1773 


BL00605 


ATP synthase c subunit proteins. 


BL00605 27.67 4.977e-09 70-124 


1774 


BL00518 


Zinc finger, C3HC4 type (RING finger), 
proteins. 


BL00518 12.23 6.571e-10 30-39 


1776 


BL01160 


Kinesin light chain repeat proteins. 

• ; ^ • '■ f< 


BL01160B 19.54 1.61 Oe-09 33-87 
BL01 1 60B 1 9.54 9.61 9e-09 65- 


1783 


PD0241 1 


PROTEIN TRANSCRIPTION 
REGULATION NUCLEAR. 


IfDWU 21.89 6.786e-15 3967- 
4001 


1783 


PF00856 


SET domain proteins. 


PF00856B 16.42 6.595e-19 3949- 
3971 PFO0856A 26.14 4.125e-12 
3896-3933 


1783 


PF00628 


PHD-finger. 


PF00628 15.84 3.455e-12 86-101 
PF00628 15.84 7.750e-IO 38-53 
PF00628 15.84 5.645e-09 164-179 


1783 


BL00115 


Eukaryotic RNA polymerase II heptapeptide 
repeat proteins. 


BL001 1 5Z 3.12 4.971e-09 2575- 
2624 BL00115Z3.12 7.750e-09 
2582-2631 


1784 


DM01970 


0 kw ZK632. 12 YDR313C ENDOSOM AL 

m. 


DM01970B 8.60 9.423e-10 111- 
124 


1785 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 2.174e-10 177- 
191 


1790 


PF00922 


Vesiculovirus phosphoprotein. 


PF00922A 19.17 7.724e-09 276- 
310 


1792 


PD02059 


CORE POLYPROTEIN PROTEIN GAG 
CONTAINS: P. 


PD02059A 28.10 5.950e-10 34-75 


1794 


BL00326 


Tropomyosins proteins. 


BL00326D 8.76 8.065e-09 165- 
'206 


1795 


BL00326 


Tropomyosins proteins. 


BL00326D 8.76 8.065e-09 173- 
214 


1797 


PR00563 


BETA-3 ADRENERGIC RECEPTOR 
SIGNATURE 


PR00563B 3.98 8.141e-09 8-28 


1799 


PR00450 


RECOVERIN FAMILY SIGNATURE 


PR00450C 12.22 1.570e-09 285- 
307 


1801 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 8.800e-14 290- 
303 PD00066 13.92 4.000e-13 
234-247 PD00066 13.92 4.429e- 
12 262-275 PD00066 13.92 
9.21 7e-l 1206-219 PD00066 
13.92 3.769M0 505-518 
PD00066 13.92 4.115e-10 449- 
462 PD00066 13.92 4.462e-10 
533-546 PD00066 13.92 6.538e- 
10 477-490 


1801 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 5.091e-15 243- 
257 PR00048A 10.52 7.750e-14 
542-556 PR00048A 10.52 3.647e- 
12 215-229 PR00048A 10.52 
4.176e-12486-500 PR00048B 
6.02 6.000e-12 23 1-241 
PR00048B 6.02 6.000e-12 287- 
297 PR00048A 10.52 7.353e- 12 
187-201 PR00048A 10.52 6.684e- 
11 271-285 PR00048A 10.52 
4.130e-I0 299-313 PR00048A 
10.52 3.520e-09 430-444 
PR00048A 10.52 3.880e-09 514- 
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SEQ ID 
NO: 


Database 
entry ID 


Description 


Results* 








528 PR00048A 10.52 6.400e-09 
458-472 PR00048A 10.52 8.560e- 
09 159-173 


1801 


©L00028 * 


Zinc finger, C2H2 type, domain proteins. * i 


562 BL6002& 16.07 1.450e-T3 
218-235 BL00028 16.07 3.348e- 
12 489-506 BL00028 16.07 
7.652e-12 274-291 BL00028 

UrtTO I8^ft.11 4/3*3-4.^0 1 

BL00028 16.07 4.115e-ll 517- 

Z1A DT fifiOOR \ (\ 07 ** 1 1 
jj*t DL^UvU^O lO.V/ J. A*/*tC-11 

246-263 BL00028 16.07 l.OOOe- 

10 107 1\Q RT 0007R 16 07 

5.200e-10 461-478 BL00028 
t « 07 (\ 700^-10 1 90-207 
BL00028 16.07 1.257e-09 357- 
17 A RT 0009R 16 07 9 486e-09 
162-179 


1802 


BL00615 


C-type lectin domain proteins. 


Rl 00^1 S A 16 68 8 920e-ll 137- 
155 


1802 


PR00770 


n/\nnT/\Tirin til T/*\n T» A OTP tID ATC IM 

EOSINOPHIL MAJOR BASIC PRU IblM 
SIGNATURE 


pp 00770P H 79 1 774e-09 1 98- 
01 R 


1802 


PR00356 | 


TYPE II ANTIFREEZE PROTEIN 
SIGNATURE 


PR00356B 14.85 2.521e-09 137- 

1 

1 


1803 


BL00615 


C-type lectin domain proteins. 


BL00615A 16.68 8.920e-l 1 176- 

1 7*1 


1803 


PR00770 


EOSINOPHIL MAJOR BASIC PROTEIN 
SIGNATURE 


PR00770F 13.79 1.774e-09 237- 


1803 


PR00356 


TYPE 11 ANTIFREEZE PROTEIN 
SIGNATURE 


PR00356B 14.85 2.521e-09 176- 
104 


1804 


BL00018 


EF-hand calcium-binding domain proteins. 


BL00018 7.41 9.000e-14 65-78 


1806 


PR00019 


LEUCINE-RICH RErbAl cjIUInAIUKi} 


PR00019B 1 1 36 1 000e-09 219- 
233 


1808 


PD02474 


SYNTHASE SMALL jSUdUinII 
ACETOLACT. 


PD02474B 21.08 8.568e~09 199- 
238 


1809 


PF00922 


Vesiculovirus pnospnoproiein. 


PF00922A 19.17 1.000e-08 249- 
283 


1812 


PR00289 


DISGNTEGRIN SIGNATURE 


PR00289B 1 1.79 1.947e-09 522- 
535 


1814 


PF00242 


DNA polymerase (viral) N-terminal domain 
proteins. 


PF00242F 12.18 8.522e-09 197- 
219 


1815 


PR00780 


LEUSERPIN 2 SIGNATURE 


PR00780B 4.89 4.491e-09 262- 
285 


1816 


BL00226 


Intermediate filaments proteins. 


BL0O226D 19.10 8.027e-13 208- 
255 


1817 


PD01876 


ANTIGEN MELANOMA-ASSOCIATED 
MULTIGENE FAMILY TUM. 


" PD01876C 21.73 3.326e-15 481- 
534 PD01876C 21.73 3.045e-10 
735-788 


1818 


PR00747 


GLYCOSYL HYDROLASE FAMILY 47 
SIGNATURE 


PR00747C 12.06 8.767e-09 337- 
356 


1820 


DM01782 


HYDROGENASE (FE) LARGE CHAIN. 


" DM01782C 13.88 4.400e-19 349- 
368 DM01782F9.01 4.375e-18 
499-515 DM01782B 17.29 
3.412e-10 294-327 


1821 


BL00226 


Intermediate filaments proteins. 


" BL00226D 19.10 7.375e-38 321- 
368 BL00226B 23.86 7.107e-32 
155-203 BL00226C 13.23 3.100e- 
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SEQ ID 
NO: 


Database 
entry ID 


Description 


Results* 








19220-251 BL00226A 12.77 
7.000e-15 55-70 BL00226D 
19.10 7.800e-09 254-301 


1822 


DM01415 


6 SALIVARY GLUE PROTEIN. 


DM01415B 13.78 9.518e-10 4-52 


1822 


BL00026 


Chitin recognition or binding domain 
proteins. 


BL00026 12.95 3.013e-09 43-64 






PROTEIN SFORuLATiON REPEAT 
PRECU. 


PD02283C 17,54 3.588e-09 1 10- 
138 PD02283C 17.54 3.588er09 
120-148 


1822 


PD00866 


GLYCOPROTEIN PROTEIN SPIKE E2 
PRECURSOR PEPLOMER. 


PD00866L 3.73 1. 443 e-09 21-31 
PD00866L 3.7.3 2.770e-Q9 97-107 
PD00866L 3:73 2;776e-09 146- . 
156 PD00866L3.73 2.918e-097- 
17 PD00866L3.73 2.918e-0914- 
24 PD00866L3.73 4.54le-O94- 
14 


1822 


PR00858 


CRUSTACEAN METALLOTHIONEIN 
SIGNATURE 


PR00858B 5.93 2.819e-09 84-103 
PR00858B 5.93 5.021 e-09 1 14- 
133 PR00858B5.93 5.021e-09 
124-143 


1822 


PR00875 


MOLLUSC METALLOTHIONEIN 
SIGNATURE 


PR00875B 5.24 6.595e-09 85-93 


1822 


PR00874 


FUNGMV METALLOTHIONEIN 
SIGNATURE 


PR00874C 4.37 9.739e-10 73-88 
PR00874C 4.37 7.250e-09 83-98 • 


1822 


BL00243 


Integrins beta chain cysteine-rich domain 
proteins. 


BL002431 31.77 3.l43e-ll 13-56 
BL002431 31 77 3 647e-10 26-69 
BL002431 31.77 6.426e-10 106- • 
149 BL002431 3 1.77 7.088e-10 
96-139 BL00243I31.77 9.338e- 
10 36-79 BL002431 31.77 1.254e- 
09 46-89 BL002431 31.77 6.451e- 
093-46 BL002431 31.77 6.704e- 
0977^120 BL002431 31.77 
7.211e-0967-110 BL00243I 
31.77 7.592e-09 116-159 
BL002431 31.77 8.606e-09 92-135 


1822 


BL00198 


4Fe-4S ferredoxins, iron-sulfur binding 
region proteins. 


BL00198 10.43 9.700e-09 6-18 


1822 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 8.024e-14 102- 
148 BL00203 13.94 7.750e-13 
127-173 BL00203 13.94 1.1 03e- 
12 97-143 BL00203 13.94 
3.172e-12 103-149 BL00203 
13.94 3.379e-12 92-138 BL00203 
13.94 4.207e-12 98-144 BL00203 
13.94 4.207e-12 116-162 
BL00203 13.94 5.345e-12 71-1 17 
BL00203 13.94 5.345e-12 107- 
153 BL00203 13.94 9.897e-12 
106-152 BL00203 13.94 1.791e- 
11 118-164 DL00203 13.94 
2.879e-ll 126-172 BL00203 
13.94 3.176e-l 187-133 BL00203 
13.94 4.758e-l 161-107 BL00203 
13.94 5.846e-ll 113-159 
BL00203 13.94 6.044e-ll 112- 
158 BL00203 13.94 7.231e-ll 
93-139 BL00203 13.94 L287e-10 
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128-174 BL00203 13.94 2.245e- 
10 76-122 B100203 13.94 - , . 
3i02e-l0 23-69 BL002&3 13.94 
4.926e-10 78-124 BL00203 13.94 
6.362e-10 81-127 BL00203 13.94 
6.553e-10 82-128 BL00203 13.94 
6.840e-10 111-157 BL00203 
13.94 8.85 le-10 43-89 BL00203 
13.94 8.851e-10 96-142 BL00203 
13.94 2.837e-09 8-54 BL00203 
13.94 3.296e-09 72- 1 1 8 BL00203 
13.94 3.847e-09 117-163 
BL00203 13.94 3.939e-09 86-132 
BL00203 13.94 5.592e-09 123- 
169 BL00203 13.94 5.776e-09 
108-154 BL00203 13.94 6.143e- 
09 30-76 BL00203 13.94 6.143e- 
09 79-125 BL00203 13.94 
6.969e-09 121-167 BL00203 

I t\A *y /r 1 o — r\c\ i c an dt aaoai 

13.94 7.61 2e-0y lo-W. s>\jj\}J.\jo 
13.94 7.796e-09 101-147 
BL002U3 ©.io^e-uy 
BL002O3 13.94 9.63 3 e-09 77-123 
dt nnom no/ii nnrwriR 19 


1824 


PR00860 


VERTEBRATE METALLOTHIONEIN 
SIGNATURE 


PR00860B 7.04 2.929e-20 74-88 
PR00860C 9.61 2.400e-12 88-98 


1824 


PR00858 


CRUSTACEAN METALLOTHIONEIN 
SIGNATURE 


PR00858B 5.93 1.419e-ll 70-89 
PR00858B 5.93 7.070e-ll 65-84 


1824 


PR00874 


FUNGI-IV METALLOTHIONEIN 
SIGNATURE 




1824 


BL00243 


Integrins beta chain cysteine-rich domain 
proteins. 


BL002431 31.77 4.803e-09 65-108 


1824 


PR00876 


NEMATODE METALLOTHIONEIN 
SIGNATURE 


T>T3AAB7^n ^ 77 9 lOtp-IO 67-7S 

PR00876A 6.60 5.886e-09 61-74 


1824 


BL00203 


Vertebrate metallothtoneins proteins. 


BL00203 13.94 7.429e-09 54-100 
BL00203 13.94 8.071 e-09 52-98 
dt nfi9fn 1^ Q4 R 806e-09 55-101 


1824 


PR00875 


MOLLUSC METALLOTHIONEIN 
SIGNATURE 


PR00875D 5.00 9.47 1 e-09 59-70 


1825 


PR00360 


C2 DOMAIN SIGNATURE 


586 


1825 


PR00399 


SYNAPTOTAGMIN SIGNATURE 


PTJDn^OOA Q ^9 8 R75e-09 360- 
376 


1829 


DM01206 


CORONAVIRUS NUCLEOCAPSID 
PROTEIN. 


nMTH 906ft 10 69 8 767e-10 567- 
587 DM01 206B 10.69 1.000e-09 
563-583 




trU\Jl J j l 


PROTEIN REPEAT NEUROFILAMENT 
TRIPL. 


PD01351B 13.72 6.786e-10 196- 
222 PD01351B 13.72 2.597e-09 
198-224 


1829 


BL0OO35 


'POU domain proteins. 


BL00035B 14.46 3.127e-09 634- 
655 


1829 


BL00229 


Tau and MAP proteins tubul in-binding 
domain proteins. 


" BL00229A 23.57 3.182e-09 178- 
217 


1829 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 3.647e-l 1 204- 
219 PR00049D0.00 9.471e-ll 
209-224 PR00049D 0.00 8.500e- 



313 



WO 02/070539 W W PC17US02/05095 



SEQ ID 
NO: 


'Database 
entry ID 


v. . ^Description 


Remits* •> 








10206-221 PR00049DO.OO 
8.500e-10 207-222 PR00049D 
0.00 3,746e-09 182-197 : 


1829 


BL00319 


Amyloidogenic glycoprotein extracellular 
domain proteins. 


BL00319C 17.123.132e-09265- 
299 BL00319C 17.12 4.553e-09 
1013-1047 BL003 1 9C i 7 1 2 
5.61 8e-09 1019-1053 BL00319C 
17.12 7.395e-09 267-301 
BL00319C 17.12 7.632e-09 1017- 
1051 


1829 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 3.089e-10 185- 
218 DM00215 19.43 8.393e-10 
190-223 DM00215 19.43 2.373e- 
09186-219 DM00215 19.43 
7 102e-09 183-216 DM002K 
19.43 8.169e-09188-221 


1829 


BL00422 


Granins proteins. 


BL00422C 1618 fMRRfi-OQ 
291 


1829 


BL00412 


Neuromodulin (GAP-43) proteins. 


BL00412D 16 S4 9 18?<*-1 1 inn* 
1056 BL00412D 16.54 9.12be-10 
1004-1055 BL00412D 16 54 
4.857e-09 1003-1054 BL00412D 
16.54 8,347e-09 1008-1059 
BL00412D 16.54 9.449e-09 1001- : 
1052 


1829 


PR00832 


PAXILLIN SIGNATURE 


PR00832B 9.87 9.526e-09 377- 
401 


1829 


PD02059 


CORE POLYPROTEIN PROTEIN GAG 
CONTAINS: P. . 


PD02059B 24.48 9.620e-09 1 96- j 
231 


1829 


PF00992 


Troponin. 


PF00992A 16.67 9.882e-09 1 005- 
1040 


1829 


BL00048 


Protamine PI proteins. 


BL00048 6.39 6.949e-15 569-596 
BL00048 6.39 1.885e-14 568-595 
BL00048 6.39 3.361e-14 570-597 
BL00048 6.39 8.377e-14 577-604 
BL00048 6.39 8.377e-l 4 578-605 
BL00048 6.39 3.631e-13 571-598 
BL00048 6.39 4.738e-13 576-603 
BL00048 6.39 7.369e-13 582-609 
BL00048 6.39 2.456e-12 575-602 
BL00048 6.39 3.118e-12 573-600 
BL00048 6.39 3.515e-12 567-594 
BL00048 6.39 5.235e-12 581-608 
BL00048 6.39 5.632e-12 583-610 
BL00048 6.39 7.221e-12 557-584 
BL00048 6.39 2.250e-l 1 572-599 
BL00048 6.39 2.875e-ll 585-612 
BL00048 6.39 4.875e-ll 586-613 
BL00048 6.39 5.375e-ll 591-618 
BL00048 6.39 7.375e-ll 589-616 
BL00048 6.39 7.500e-l 1 580-607 
BL00048 6.39 8.625e-ll 588-615 
BL00048 6.39 2.895e-10 563-590 
BL00048 6.39 5.382e-10 574-601 
BL00048 6.39 5.500e-10 566-593 
BL00048 6.39 5.618e-10 584-61 1 
BL00048 6.39 9.171e-10 561-588 
BL00048 6.39 9.882e-10 592-619 



314 



WO 02/070539 




PCT7US02/05095 



SEQ ID 
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entry ID 


Description 


Results* ; 


- 






BL00048 6.39 1.450e-09 590-617 
BL00048 6.39 3.925e-09 560-587 
BL00048 6.39 4.488e-09 562-589 
BL00048 6.39 4.938e-09 579-606 
BL00048 6.39 5.275e-09 564-591 
BL00048 6.39 5.725e-09 558-585 
BLQ0048 6.39 5.725e-09 587-614 
BL00048 6.39 6.625e-09 555-582 
BL00048 6.39 7.075e-09 556-583 
BL00048 6.39 9.438e-09 559-586 
BL00048 6.39 9.888e-09 600-627 


1829 


PD01234 


PROTEIN NUCLEAR BROMODOMAJN 
TRANS. 


PD01234B 15.53 1.000e-08 201- 
219 


1830 


BL00092 


N-6 Adenine-specific DNA methylases 
proteins. 


BL00092 5.35 2.000e-09 136-145 


1831 


PR00511 


TEKTIN SIGNATURE 


PR00511A 13.59 3.700e-14 113- 
130 


1833 


PR00764 


COMPLEMENT C9 SIGNATURE 


PR00764F 16.89 2.286e-09 158- 
179 


1833 


BL00022 . 


EGF-like domain proteins. 


BL00022B 7.54 4.600e-09 138- 
145 


1833 


BL01187 


Calcium-binding EGF-like domain proteins 
pattern proteins. 


BL01187B 12.04 4.086e-10 167- 
1 83 BL01 187B 12.04 4.600e-09 
104-120 ! 


1833 


BL01185 


C-terminal cystine knot proteins. 


BL01185B 21.14 6.929e-09 146- 
195 


1833 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR00010C H.167.429e-09 109- 
120 PR00010C 11.16 7.643e-09 
172-183 PR00010A 11.79 8.846e- 
09 190-202 


1833 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


PR0001 1A 14.06 4.822e-09 208- 
227 PR00011D 14.03 8.957e-09 
67-86 


1833 


BL00243 


Integrins beta chain cysteine-rich domain 
proteins. 


BL002431 31.77 1.000e-08 34-77 


1835 


BL01279 


Protein-L-isoaspartate(D-aspartate)0- 
methyltransferase signa. 


BL01279A 24,27 3.691e-09 419- 
467 


1835 


BL01131 


Ribosomal RNA adenine dimethylases 
proteins. 


BL01 131 A 26.62 4.600e-09 421- 
467 


1835 


BL01230 


RNA methyltransferase trmA family proteins. 


BL01230E 15.79 6.607e-ll 571- 
587 BL01230A 17.88 8.962e-10 
409-428 BL01230B 11.62 8.475e- 
09 436-449 


1835 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 8.780e-09 598- 
613 


1837 


PD01719 


PRECURSOR GLYCOPROTEIN SIGNAL 
RE. 


PD01719A 12.89 2.603e-ll 259- 
287 PD01719A 12.89 8.105e-10 

t yy-JL** i 


1838 


BL01162 


Quinone oxidoreductase / zeta-crystallin 
proteins. 


BL01162C 22.80 1.269e- 18 151- 
195 BL01162A 15.38 1.265e-ll 
64-87 


1838 


BL00279 


Membrane attack complex components / 
perforin proteins. 


BL00279C 31.64 3.156e-09 134- 
188 


1838 


BL00059 


Zinc-containing alcohol dehydrogenases 
proteins. 


BL00059B 16.08 7.273e-09 93- 
121 


1841 


PD01066 


PROTEIN ZINC FINGER ZINC-FINGER 
METAL-BINDING NU. 


PD01066 19.43 5.41 5e-26 46-85 
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1841 


PR00048 


: G2H2rTYPE ZH$C FINGER SIGNATURE 


PRG0048A10.525.737e41 140- 
154 PR00048A 10.52 6.087e-10 
224-238 


1841 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 4.130e-12 227- 
244 BL0002816.07 7.577e-ll 
352-369 BL00028 16.07 3.400e- 
10 380-397 BL00028 16.07 
6.400e-10 199-216 BL00028 
16.07 1.257e-09 143-160 
BL00028 16.07 2.029e-09 171- 
188 BL00028 16.07 5.886e-09 
408-425 


1841 


PR00967 


ACUTE MYELOID LEUKEMIA 1 
PROTEIN SIGNATURE 


PR009671 12.41 8.130e-09 466- 
481 


1841 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 2.038e-10 187- 
200 PD00066 13.92 1.600e-09 
396409 PD00066 13.92 9.400e- 
09 215-228 


1841 


BL00479 


Phorbol esters / diacylglycerol binding 
domain proteins. 


BL00479A 19.86 6.553e-09 214- 
237 BL00479A 19.86 9.809e-09 
367-390 


1842 


PD02910 


TRANSCRIPTION PROTEIN FACTOR 
REGULATION A. 


PD02910A 15.43 9.839e-09 62-97 


1843 


PD02199 
• 


SUBUNIT HYDROGEN ION TRANSPORT 
T. 


PD02199A 20.58 1.000e-40 10-61 
PD02199D 13.18 1.000e-40 364- 
405 PD02199F 15.02 1.000e-40 
440-482 PD02199J 11.42 l.OOOe- 
40 723-762 PD02199K 1522 
1.000e-40 792-831 PD02199G 
9.43 4.447e-24 53 1-555 
PD02199B 27.90 1.474*22 263- 
306 PD02199H 13.62 2.636e-21 
576-599 PD02199E7.56 8.642e- 
19405-424 PD02199C 17.60 
8.085e-14 313-329 PD02199I 
8.90 4.780e-09 616-624 


1844 


BL00218 


Amino acid permeases proteins. 


BL00218E 23.30 5.920e-10 343- 
383 


1845 


BL00048 


Protamine PI proteins. 


BL00048 6.39 9.526e-10 160-187 


1845 


BL00422 


Granins proteins. 


BL00422C 16.18 4.000e-09 590- 
618 


1845 


PR00833 


POLLEN ALLERGEN POA PI 
SIGNATURE 


PR00833H 2.30 8.385e-09 943- 
958 


1845 


PF00992 


Troponin. 


PF00992A 16.67 7.900e-13 568- 
603 PF00992A 16.67 4.090e-ll 
566-601 PF00992A 16.67 5.817e- 
10 570-605 PF00992A 16.67 
8.479e-10 579-614 PF00992A 
16.67 2.0ooe-09 564-599 
PF00992A 16.67 4.789e-09 575- 
610 PF00992A 16.674.908e-09 
532-567 PF00992A 16.67 6.803e- 
09 536-571 PF00992A 16.67 
7.632e-09 562-597 PF00992A 
16.67 8.697e-09 585-620 
PF00992A 16.67 9.053e-09 583- 
618 PF00992A 16.67 9.289e-09 
516-551 
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1845 


PFG,1140 


Matrix protein (MA), pl5. 

■ '\ ■ 

r 

• 


PF01 140D 15.54 8.500e-12 505- 
540 PF01140D 15.54 7.120e-U 
584-619 PF01140D 15.54 9.760e- 
11586-621 PF01140D 15.54 
3.813e-10 588-623 PF01140D 
15.54 4.938e-10 563-598 
PF01 140D 15.54 6.738erl0 519- 
554 PF01140D 15.54 8.313e-I0 
503-538 PF01140D 15.54 9.325e- 
10 549-584 PF0 11 40D 15.54 
9.663e-l 0 567-602 PF01 HOD 
15.54 9.775e-10 565-600 
PF01140D 15.54 1.000e-09 582- 
617 PF01140D 15.54 2.884e-09 
575-610 PF01140D 15.54 3.198e- 
09 551-586 PF01140D 15.54 
3.302e-09 517-552 PF01140D | 
15.54 4.140e-09 533-568 
PF01140D 15.54 4.872e-09 578- 
613 PFO1140D 15.54 6.860e-09 
581-616 PF01140D 15.54 7.174e- 
09 594-629 PFOU40D 15.54 
X384e-09 576-611 PF01140D 
15.54 7.593e-09 572-607 
PF01140D 15.54 8.640e-09 570- 
605 PF01140D 15.54 8.744e-09 
507-542 PF01140D 15.54 8.744e- 
09 596-63 1 PF0 1 1 40D 1 5. 54 
9.163e-09 577-612 PF01 140D 
15.54 9.267e-09 579-614 
PF01 140D 15.54 9.791e-09 574- 

r r\r\ T\ !T\ 1 1 A f\T\ 1 C CA 1 AAAn AO 

609 PF01140J-) 15.D4 l.OOUe-Ua 
531-566 


1848 


BL00811 


Oleosins proteins. 


BL00811B 10.37 9.791e-09 307- 
336 


1852 


BL00415 


Synapsins proteins. 


BL00415N 4.29 4.153e-09 301- 
345 


1852 


DM00668 


ZEIN. 


DM0O668B 22.01 8.01 8e-09 291- 
343 


1853 


BL00107 


Protein kinases ATP-binding region proteins. 


BL00107A 18.39 3.250e-17 133- 
164 


1853 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109E 14.41 3.045e-ll 254- 
277 


1853 


PF00791 


Domain present in ZO-I and Unc5-like netrin 
receptors. 


PF0079IB 28.49 2.317e-16 689- 
744 PFO0791B 28.49 3.753e-15 
523-578 PF00791B 28.49 4.316e- 
12 656-711 PF0079IB 28.49 
1.727e-ll 589-644 PF00791B 

PF00791C 20.98 4.913e~l 1 570- 
609 PF00791B 28.49 6.330e-10 
722-777 PF00791C 20.98 5.853e- 
09 703-742 


1853 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 5.200e-13 722- 
738 PF00023B 14.20 l.OOOe- 12 
652-662 PF00023A 16.03 2.000e- 
12 755-771 PF00023A 16.03 
7.857e-ll 656-672 PF00023A 
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16.03 8.286e-l 1 622-638 
PF.00023B 14.20 4.682e-09 519- 
529 PF00023A16 03 6 143*09 i 
589-605 PF00023A 16.03 6.786e- 
09 689-705 




rD00078 


REPEAT PROTEIN ANK NUCLEAR 
ANKYk. 


PD00078B 13.14 9.000e-l 1 582- 
595 PD00078B 13.14 8,435e-09 
649-662 


1854 


PR00449 


TRANSFORMING PROTEIN P21 RAS 
SIGNATURE 


PR00449A 13.20 4.971*1 4 4-26 


1856 


BL00317 


WAP-type 'four-disulfide core' domain 
proteins. 


BL00317B 14.58 3.550e-13 48-70 




nn AAA A""» 

PK00003 


4-DISULPHIDE CORE SIGNATURE 


PR00003C 7.69 6.357e-09 54-64 


1857 


BL01019 


ADP-ribosylation factors family proteins. 


BL01019B 19.49 7.517e-21 95- 
150 


1857 


BL01020 


SARI family proteins. 


BL01020C 15.35 2.30le-18 79- 
130 


1857 


PR00328 


GTP-BINDING SARI PROTEIN 
SIGNATURE 


PR00328C 13.16 2.841e-10 78- 
104 


1858 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 2.385e-15 128- 
141 PD00066 13.92 5.714e-12 
100-113 


1858 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048B 6.02 7.188e-10 97-107 
PR00048B 6.02 3.842e-09 125- 
135 PR00048A 10.52 6.040e-09 
137-151 


1859 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 1.000e-l 1 174- i 
188 PR00048B6.02 1.692e-ll 
162-172 


1859 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 3.739e-ll 165- 
178 


1859 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.072.385e-ll 177- 
194 BL00028 16.07 3.769e-ll 
121-138 BL00028 16.07 8.269e- 
11 149-166 


1860 


PF00651 


BTB (also known as BR-C/Ttk) domain 
proteins. 


PF00651 15.00 2.895e-l 145-58 


loot) 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 2.038e-10 419- 
432 PD00066 13.92 7.231 e- 10 
391404 


1860 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 5.696e-10 400- 
414 PR00048A 10.52 8.435e-10 
428-442 


1860 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 8.269e-11 403- 
420 BL00028 16.07 3.400e-10 
375-392 BL00028 16.07 3.057e- 
09 431-448 | 


1861 


DM00547 


I kw CHROMO RRON/rnnniuAiM 
SHADOW GLOBAL. 


VM[)\)5Wb 23.43 7.643e-34 606- 
653 DM00547B 1 1.28 7.907e-16 
155-169 DM00547C 17.30 
8.650e-14 209-231 DM00547D 
lL60 6.500e-l 3 277-291 
DM00547E 13.94 L000e-1 1 307- 
330 


1861 


BL00039 


DEAD-box subfamily ATP-dependent 
heiicases proteins. 


BL00039D 21.67 6.379e-10 590- " 
636 


1862 


BL01115 


GTP-binding nuclear protein ran proteins. 


BL01115A 10.22 5.330e-ll 18-62 
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1862 


BL01019 


ADP-ribosylation factors family proteins. 


BL01019A 13.20 1.809e-09 52-92 


1862 


PR00449 


TRANSFORM ING PROTEIN P21 RAS 
SIGNATURE 


PR00449C 17.27 3.647e-19 59-82 
PR00449A I3.2^7;000e-l5 18-40 , L 
PR00449D 10.79 *£75^l* 121- 
135 PR00449E 13.50 8.920e-14 
157-180 PR00449B 14.34 8.500e- 
0941-58 


1867 


BL01283 


T-box domain proteins. 


BL01283D 1 1.70 7.868e-3 1 59-92 
BL01283C 13.05 2.537e-14 25-39 


1867 


PR00937 


T-BOX DOMAIN SIGNATURE 


PR00937D 13.41 5.378e-15 24-39 
PR00937F 12.53 1.450e-12 83-92 
PR00937E 11.86 5.592e-12 62-76 
PR00937C 10.51 5.219e-10 5-15 


1870 


DM01803 


1 HERPESVIRUS GLYCOPROTEIN H. 


DM01803A 10.51 8.699e-09 100- 
121 


1872 


BL00470 


Isocitrate and isopropylmalate 
dehydrogenases proteins. 


BL00470A 16.25 5.179e-14 10-31 
BL00470C 15.43 4.103e-10 223- 
238 BL00470E 16.52 1.900e-09 
287-297 


1873 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 3.893e-09 44-60 
PF00023B 14.20 9.1 82e-09 40-50 


1874 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 3.893e-09 72-88 
PF00023B 14.20 9.182e-09.68-78 


1877 


PD01066 


PROTEIN ZINC FINGER ZINC-FINGER 
METAL-BINDING NU. 


PD01066 19.43 2.019e-26 51-90 


1877 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 6.786e-16 427- 
444 BL00028 16.07 1.900e-13 
287-304 BL00028 16.07 3.700e- 
13 481-498 BL00028 16.07 
1.000e-12 315-332 BL00028 
16.07 1.000e-12 399-416 
BL00028 16.07 3.348e-12 453- 
470 BL00028 16.07 4.522e-12 
371-388 BL00028 16.07 6.885e- 
11 343-360 BL00028 16.07 
4.600e-l 0 509-526 


1877 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 8.500e-17 424- 
438 PR0O048A 10.52 7.000e-14 
312-326 PR00048A 10.52 1.643e- 
13 396-410 PR00048A 10.52 
3.57le-13 478-492 PR00048B 
6.02 9.000e- 12 300-310 
PR00048A 10.52 1.000e-ll 506- 
520 PR00048A 10.52 5.737e-ll 
340-354 PR00048A 10.52 1.391e- 
10 284-298 PR00048B6.02 
6.063e-l 0 412-422 PR00048B 
6.02 1.474e-09 494-504 

366 


1877 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 4.600e-14415- 
428 PD00066 13.92 7.000e-14 
469-482 PD00066 13.92 5.500e- 
13 303-316 PD00066 13.92 
4.429e-l 2 331-344 PD00066 
13.92 9.217e-l 1 497-510 
PD00066 13.92 2.038e-10 387- 
400 PD00066 13.92 6.400e-09 
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1878 


PR00830 


ENDOPEPTIDASE LA (LON) SERINE 
PROTEASE (S16) SIGNATURE 


359-372 

PR00830A 8.41 4.927e-13 222- 
242 


1878 


PR00300 


ATP-DEPENDENT CLP PROTEASE ATP- 
BINDING SUBUNIT SIGNATURE 


PR00300A9.56 1.545e-ll 218- 
237 


1878 


PR00051 


BACTERIAL CHROMOSOMAL 
REPLICATION INITIATOR (DNAA) 
SIGNATURE 


PR00051A 10,68 3 : 647e-!0 215- 
236 


• im 


PR00819 


CBXX/CFQX SUPERFAMILY 
SIGNATURE 


! PR00819B 10 83 9 213e-10 9l7. j 
233 


1878 


BL00674 


AAA-protein family proteins. 


BL00674B 4,46 9.000e-24 21 5- 
237 BL00674C 22 60 8 44fte-9fl 
248-291 BL00674D 23.41 
5.140e-l 8 308-355 BL00674E 
15.24 9.217e-16 390-410 
BL00674A 1 6 91 5 304e-09 1 Rl - 
202 


1878 


BL00113 


Adenylate kinase proteins. 


BL00113A 12 74 8 615e.no 710 
236 


1879 


BL00456 


Sodium:solute symporter family proteins. 


BL00456A 22 59 1 QV7#».i? ^ on 

BL00456B 1 8.94 9.780e-17 111- 
141 


1880 


BL00456 


Sodium:solute symporter family proteins. 


BL00456A 22.59 1.957e-32 35-90. 
BL00456C24 55 1 225e-11 173 

228 BL00456B 18.94 9.780e-17 
111-141 


1884 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 1.818e-ll 159- 
182 


1885 


PR00830 


ENDOPEPTIDASE LA (LON) SERINE 
PROTEASE (S16) SIGNATURE 


PR00830A 8.41 5.897e-10 352- 
372 


1885 


BL00847 


MCM family proteins. 


BL00847D 15.16 8.568e-25 343~ 
384 BL00847B 24 76 8 971e«9<; 
194-237 BL00847C 18.79 9.270e- 
10 301-335 


1885 


PR00449 


TRANSFORMING PROTEIN P21 RAS 
SIGNATURE 


PR00449A 13.20 1.931e-09 346- 
368 


1885 


BL00674 


AAA-protein family proteins. 


BL00674B 4.46 5.3 78e-09 345- 
367 


1886 


PF00651 


BTB (also known as BR-C/Ttk) domain 
proteins. 


PF00651 15.00 7.000e-l 050-63 


1886 


PR00501 


KELCH REPEAT SIGNATURE 


PR00501A 8.25 5.295e-09 507- 
521 


1886 


DM00099 


4 kw A55R REDUCTASE TERMINAL 
DIHYDROPTERIDINE. 


DM00099B 14.73 6.625e-09 479^~ 
489 


1887 


PD00306 


PROTEIN GLYCOPROTEIN PRECURSOR 
RE. 


PD00306A 10.26 6.625e-13 544~ 
558 


1887 


PD01364 


MUCIN GLYCOPROTEIN PRECURSOR 
MEM. 


PD01364B 13.94 5.500e-09 544- 
560 


1888 ; 


PD00306 


PROTEIN GLYCOPROTEIN PRECURSOR 
RE. 


PD00306A 10.26 6.625e-13 544- 
558 


1888 


PD01364 


MUCIN GLYCOPROTEIN PRECURSOR 
MEM. 


PD01364B 13.94 5.500e-09 544- 
560 


1889 


PD00306 


PROTEIN GLYCOPROTEIN PRECURSOR 
RE. 


PD00306A 10.26 6.625e-13 544- 
558 


1889 
1890 


PD01364 
PF00938 


MUCIN GLYCOPROTEIN PRECURSOR 

MEM. 

Lipoprotein. 


PD01364B 13.94 5.500e-09 544- ' 
560 

PF00938E 19.50 6.096e-09 272- 
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307 


1891 


PF00925 


GTP cyclohydrolase II. 


PF00925F 13.23 9.850e-09 356- 
367 


1893 


BL00226 


Intermediate filaments proteins. 


BL00226A 12.>77 5555*43 1 39- 

154 .." ■ ; ' ■ 


1895 


PF00035 


Double-stranded RNA binding motif. 


PF00035B 1 2.08 7.750e-09 273- 
287 


1896 


PF00622 


Domain in SPla and the RYanodine Receptor. 


PF00622B 21.00 9.250e- 11 170- 
192 



TABLE 4 



SEQID 


Model 


Description 


E-value 


Score 


Repeats 


Position 


950 


UCH-2 


Ubiquitin carboxyl-terminal hydrolase 
family 


o.oe-zo 


on .c 


i 
i 




950 


UCH-1 


Ubiquitin carboxyl-terminal hydrolases 
famil 


O 1 <v 

z.je-13 




i 
i 




952 


efhand 


EF hand 






i 

X 


322-350 


952 


Adeno_E 
1A 


Early El A protein 


8.9 


-168.3 


1 


298-448 


953 


SH2 


— — — - 

SH2 domain 


1 .JC'lv 


68.4 


1 


320-396 


954 


SH2 


SH2 domain 


1.5e-16 


68.4 


1 


347-423 


955 


RCC1 


Regulator of chromosome condensation 
(RCC1) 


1.3e-13 


58.6 


4 


148- 

197:200- 
249:318- 
367:370- 
418 


958 


BRCT 


BRCA1 C Terminus (BRCT) domain 


7.2 


1.2 


1 


45-130 


962 


UQ con 


Ubiquitin-conjugating enzyme 


5.4e-39 


143.0 


1 


2-117 


963 


HQ con 


Ubiquitin-conjugating enzyme 


1.6e-57 


204.5 


1 


2-132 


971 


kinesin 


Kinesin motor domain 


2.2e- 
154 


526.3 


1 


47-372 


971 


WD40 


WD domain, G-beta repeat 


le-53 


191.9 


7 


1327- 

1361:1367- 

1402:1432- 

1466:1472- 

1511:1523- 

1557:1564- 

1600:1606- 

1640 


971 


filament 


Intermediate filament protein 


2.6 


-195.6 


1 


423-805 


971 


HDV ag 


Hepatitis delta virus delta antigen 


6.2 


-47.5 


1 


703-880 


971 


PFEMP 


Plasmodium falciparum erythrocyte 
membrane p 


8.9 


-86.8 


1 


475-585 


971 


G6PD 


Glucose-6-phosphate dehydrogenase, 
NAD bindi 


9.6 


-123.8 


1 


912-1049 


971 


DUF232 


Putative transcriptional regulator 


9.7 


-30.1 


1 


616-750 


975 


Kelch 


Kelch motif 


5.7e-62 


219.3 


4 


267- 

312:314- 

359:361- 

406:408- 

453 


975 


BTB 


BTB/POZ domain 


3.1e-38 


140.4 


1 


23-130 


977 


kinesin 


| Kinesin motor domain 


9.7e- 


520.8 


I 


53-353 
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SEQ1D 


Model 


Description 


E-value 


Score 


Repeats 


Position 








153 • 








977 


FHA 


FHA domain 


0.078 


16.9 




470-534 


977 


KMP11 


Kinetoplastid membrane protein 11 


3.5 


-3.2 


■ | 


355-438 


977 


SART-1 


SART-1 family 


7:1 


-360.1 : 


1 


325-881 


977 


Transposa 
se 22 


LI transposable element 


8.7 


-180.7 


It— 


322-62? 


979 


Ribosoma 
i LiO 


Ribosomal protein L10 


6 


-33.3 




70-173 


980 


Aa trans 


Transmembrane amino acid transporter 


1.6e-75 


264.3 




69479 


980 


oxidored 
ql 


NADH-Ubiquihone/plastoquinone 


7.6 


-169,3 


- 


63-326 


980 


xan ux_pe 
rmease 


Permease familv 

a wiiuvogw £011111 Jr 


R A 


-ZVl.,5 




ljy-4/1 


980 


TrpjT>r_ 
perm 


Tryptophan/tyrosine permease family 


9.6 


-297.9 




70-474 


982 


HYR 


HYR domain 


ze-jj 


101 1 

1 J 1.1 


z 


lUO- 

187:188- 

ZO/ 


982 


EGF 


EGF-like domain 






'i 


7/17 
/HZ- 

773:780- 

61 l.o lo- 

849 


982 


sushi 


Sushi domain (SCR repeat) 


0.28 






lftd-777 

332 


982 


TNFR c6 


TNFR/NGFR cysteine-rich region 


0.74 


9.6 




563-601 


982 


laminin E 
GF 


Laminin EGF-like (Domains III and V) 


6.6 


-12.8 


^— 


746-791 1 


982 


metalthio 


Metallothionein 


7.6 


-11.6 


-r : — — 


HA A OA/1 


982 


HMG_Co 
A_synt 


Hydroxymethylglutaryl-coenzyme A 
synthas 


8.7 


-342.7 


-j 


JjO-DZj 


986 


MHCJL 
alpha 


Class II histocompatibility antigen, alp 


8.8e-13 






37- 1 f)£ 


986 




Immunoglobulin domain 


8e-05 






179.1 Rrt 


987 


LRR 


Leucine Rich Repeat 


2.6e-12 


54.3 


4 


O0--/1 .yz- 
114:115- 
137:138- 
159 


987 


UVR 


UvrB/uvrC motif 


5.3 


-1.5 


1 


453-486 


990 


UVR 


UvrB/uvrC motif 


3.8 


-0.3 


1 


552-588 


991 


CKJIJe 
ta 


Casein kinase II regulatory subunit 


4.5e-69 


242.9 


1 


5-124 


994 


RNAj>oi 
B 


RNA polymerase beta subunit 


0 


1199.4 


1 


26-1010 


994 


PHD 


PHD-finger 


5.9 


-17.0 


1 


1013-1048 


995 


zf«C3HC4 


Zinc finger, C3HC4 type (RING finger) 


4.9e-05 


30.2 


1 


10-59 


995 


zf-B box 


B-box zinc finger 


0.0022 


24.7 


1 


92-134 


996 


HLH 


Helix-loop-helix DNA-binding domain 


2e-07 


38.1 


1 


276-327 


997 


ras 


Ras family 


3.3e-12 


2.0 


1 


23-145 


998 


pkinase 


Protein kinase domain 


3.1e-08 


-16.4 


1 


1-139 


1000 


ig 


Immunoglobulin domain 


2.7e-06 


34.4 


2 


42-95:225- 
281 


1001 


Y_phosph 
atase 


Protein-tyrosine phosphatase 


4.2 


-86.2 


I 


180-409 


1002 


PX 


PX domain 


5.8e-19 


76.4 


1 


22-138 


1002 


Peptidase 
S21 


Assemblin (Peptidase family S21) 


8.9 


-173.6 


I 


76-352 


1003 


Y_phosph 


Protein-tyrosine phosphatase 


1.1 


-79.3 


1 


98-327 
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seqid 


Model 


Description 


iP If ft 111 A, 


Score 


JVC |J Gil to 


Position 




atase 












1004 


Y_phosph 
atase 


Protein-tyrosine phosphatase 


4.2 


-86.2 




180-409 


1008 


oxidored_ 
ql 


NADH-Ubiquinone/pIastoquinone 
(complex I) 


5.2 


-165.9 


i 

i • ' ■ ■ ' 


64-311 


1010 


filament 


Intermediate filament protein 


iff 


-1 Of) ^ 


— 


331-647 


1010 


Tektin 


Tektin family 


1.5 


99R S 




192-507 


1010 


bZIP 


bZIP transcription factor 


4.4 


-3.7 


i 


253-317 


1010 


spectrin 


Spectrin repeat 


o.o 


-lo.U. 


-: ~— 


390-49 Q 


1010 


SART-1 


SART-1 family 


o 
O 








1010 


MyosinJ 
ail 


Myosin tail 


9.8 


-555.1 


1 


6-734 


1013 


Defensin_ 
propep 


Defensin propeptide 


1.2e-26 


102.0 




45-97 


1013 


defensins 


Mammalian defensin 


7.3e-14 


59.5 




110-138 


1014 


filament 


Intermediate filament protein 


0.69 


-180.5 




314-579 


1014 


PolyA_po 
1 


Poly A polymerase family 


2.5 


-64.5 




348-463 


1014 


ERM 


Ezrih/radixin/moesjn family 


8.7 


-223.3 




226-487 


1014 


Transposa 
se 12 


Transposase 


9.4 


-152.0 




155-465 


1015 


zf-C2H2 


Zinc finger, C2H2 type 


1.2e-55 


198.2 


13 


129- 

152:349- 
371:379- 
401:407- 
429:446- 
468:474- 
496:505- 
527:533- 
556:562- 
58d:903- 
yZDjyji- 

981:987- 
1010 


1015 


60s_ribos 
omal 


60s Acidic ribosomal protein 


0.23 


-21.0 


1 


61-194 


1015 


TFIIS 


Transcription factor S-II (TFIIS) 


0.82 


2.1 


1 


446-484 


1015 


rubredoxi 
n 


Rubredoxin 


2.8 


-8.3 


1 




1015 


zf-BED 


BED zinc finger 


9 


-7.0 


1 


y /z-iui i 


1021 


SSF 


Sodiumrsolute sym porter family 


1.7e-05 


-65.8 


1 


->-lo4 


1028 


zf-C2H2 


Zinc finger, C2H2 type 


4.3e-3Q 


1 13.4 




1 (\C\ 

122:132- 
182:188- 

910-916- 
238 


1098 


KRAB 


KRAB box 


7.1e-24 


92.8 


1 


4-44 


1028 


zf-BED 


BED zinc finger 


0.63 


3.4 


1 


78-123 


1029 


fn3 


Fibronectin type III domain 


1.8e-13 


58.2 


1 


146-231 


1029 


ig 


Immunoglobulin domain 


0.0013 


25.5 


1 


275-335 


1031 


polypreny 
lsynt 


Polyprenyl synthetase 


0.013 


-82.6 


1 


119-332 


1032 


Defensin_ 
P^pep 


Defensin propeptide 


1.2e-26 


102.0 


1 


68-120 


1032 


defensins 


Mammalian defensin 


7.3e-14 


59.5 


1 


133-161 


1033 


MAGE 


MAGE family 


3.8e-34 


126.8 


1 


1-208 
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SEQ ID 
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Description 


E-value 


Score 


Repeats 


Position 




LRR 


Leucine Rich Repeat 

« .:• ' ' ■* • , ■.: . . . 


2.3e-l7 


71.1 


6 


62-84:87- 

110:111- 

134:135- 

158:159- 

181:186- 

209 


1034 


JLJtvKL, 1 


- — — r-r r~7 • 

Leucine rich repeat C-terminal domain 


7.4e-05 


29.6 




221-271 


1034 




Immunoglobulin domain 


0.061 


19.9 




283-343 


1034 


LRRNT 


Leucine rich repeat N-terminal domain 


0.19 


18.3 




33-60 


1035 


CoaE 


Pephospho-CoA kinase 


3e-93 


323.2 




359-537 


1035 


Cytidylylt 
ransf 


Cytidylyltransferase 


8.3e-06 


31.4 




191-315 


1035 


SKI 


Shikimate kinase 


0.64 


-65.2 




356-510 


1035 


ArgK 


ArgK protein 


7.5 


-212.4 


i 


341-541 


1038 


lipase 


Lipase 


l.le-12 


49.8 




1-198 


1044 


homeobox 


Homeobox domain 


2.6e-30 


114.1 


1 


155-211 


1048 


SKI 


Shikimate kinase 


0.49 


-63.6 




6-185 


1049 


fn3 


Fibronectin type III domain 


4.7e-78 


272.7 


5 


159- 

245:257- 
343:360- 
459:480- 
565:577- 
665 


1049 


ig 


Immunoglobulin domain 


2.5e-05 


31.1 


-j 


79-137 


1050 




Immunoglobulin domain 


0.019 


21.5 




1-58:93- 
142 


1051 


MHCJ 


Class I Histocompatibility antigen, 
domains 


2.1 


-83.5 




24-116 




zf-C3HC4 


Zinc finger, C3HC4 type (RING fineeri 


0.88 


0,8 


1 


198-232 


1 AC7 


p450 


Cytochrome P450 


5.7e-08 


-1.4 




66-377 


1060 


Ribosoma 

1 O 1 

I S2l 


Ribosoma] protein S2l 


0.5 


-7.2 


1 


753-810 


1060 


Tropomyo 
sin 


Tropomyosin 


9 


-120.2 


— 


208-412 


1064 


zf-C3HC4 


Zinc finger, C3HC4 type (RING fineeri 


0.88 


0.8 


T 


141-175 


1065 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


0.88 


0.8 




191-225 


1066 


aaj)erme 
ases 


Amino acid permease 


3.2e-06 


-173.3 




46-455 


1066 


Aa trans 


Transmembrane amino acid transporter 


0.4 


-172.1 




48434 


1066 


Tr Rjyr_ 
perm 


Tryptophan/tyrosine permease family 


2.6 


-282.5 




48-392 


T066 


oxidored 
ql 


NADH-Ubiquinone/plastoquinone 


4.6 


-164.9 




170-421 


1066 


xanjirjje 
rmease 


Permease family 


9.7 


-202.9 




60-397 


1068 


zf-C2H2 


Zinc finger, C2H2 type 


6.8e- 
125 


428.3 


22 


101- 

128:169- 
191:197- 
219:225- 
247:253- 
275:281- 
303:309- 
324:330- 
352:358- 
380:386- 
408:414- 
436:442- 
464:470- 



324 



WO 02/070539 




PCT7US02/05095 



SEQ ID 
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E-value 


Score 


Repeats 


Position 














492:524- 
546:552- 
565:571- 

A'ii'A.'i'i 

OH J*. 030- 

705*711- 
733:739- 
761 


luOo 


ICR ATI 


ISJVrt.Jt> DOX 


4.1e-25 






4-44 


liloo 


n ATA 
OAl A 


vja i A zinc linger 


9 




-j 


970-^95 


1068 


LIM 


LIM domain 


4.3 


-17.1 




311-368 


1068 


zf-BED 


BED zinc finger 


4.7 


-4.4 




315-353 


1068 


zf-TRAF 


TRAF-type zinc finger 


4.9 


-6.5 




163-212 


1068 


FYVE 


FYVE zinc finger 


6.7 


-24.5 




245-320 


1068 


TFIIS 


Transcription factor S-II (TFIIS) 


7.3 


-6.0 




281-319 


1070 


C2 


C2 domain 


l.le-38 


142.0 


2 


178- 

263:319- 
406 


1071 


ig 


Immunoglobulin domain 


1.9 


12.9 


1 


19-66 


1075 


WD40 


WD domain, G-beta repeat 


1.7e-27 


104.8 


6 


12-53:59- 

96:160- 

194:200- 

236:245- 

281:287- 

322 


1078 


WD40 


WD domain, G-beta repeat 


2.2e-25 


97.8 


5 


16-52:60- 

96:106- 

142:191- 

227:234- 

270 


1078 


FYVE 


FYVE zinc finger 


2e-22 


87.9 


1 


279-353 


1078 


DnaJ CX 

XCXGX 

G 


DnaJ central domain (4 repeats) 


3.1 


-45.6 


1 


304-360 


1079 


PH 


PH domain 


1.5e-25 


98.3 


2 


120- 

215:298- 
391 


1081 


KRAB 


KRAB box 


6e-21 


83.0 


1 


45-81 


1082 


Ribosoma 
1 Lll 


Ribosomal protein Lll 


ie-64 


228.4 


1 


117-248 


1084 


zf-C2H2 


Zinc finger, C2H2 type 


6.4e- 
185 


627.7 


25 


109- 

131:137- 

159:165- 

187:221- 

243:249- 

271:277- 

299:305- 

327:333- 

355:361- 

383:389- 

411:445- 

467:473- | 

495:501- 

523:529- 

551:557- 

579:585- 

607:641- 
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SEQID 


i Model 


Description 


E-valiie 




Kepeais 


rositton 














OuJ.OO?- 

691:697- 
719:753- 

//J./OI- 
Ouj.oUV- 

831:837- 

859:865- 

OO / ,QJJ m 

915 


1084 


CBM 1 


Fungal cellulose binding domain 


2.6 


4.4 


1 


38-66 


1084 


LIM 


LIM domain 


3 


-15.7 


1 


783-847 


1084 


zf-BED 


BED zinc finger 


5.9 


-5.3 


2 


205- 

244:737- 

776 


1085 


Band 7 


SPFH domain / Band 7 family 


8.1e-42 


152.3 


1 


39-214 


1087 


TPR 


TPR Domain 


4.5e-16 


66.8 


2 


58-91:92- 
125 


1090 


WH1 


WH1 domain 


0.0017 


11.6 


1 


11-119 


1091 


zf-DHHC 


DHHC zinc finger domain 


0.033 


-11.5 


1 


120-158 


1094 


Calx-beta 


Calx-beta domain 


0.19 


-11.6 


1 


23-117 


1P95 


zf-C2H2 


Zinc finger, C2H2 type 


5.4e-82 


285.8 


12 


288- 

311:337- 

359:365- 

387:393- 

415:421- 

443:449- 

471:477- 

499:505- 

527:533- 

555:561- 

583:589- 

oi i:oi/- 


1095 


SCAN 


SCAN domain 


1.5e-54 


194.6 


1 


46-141 


1095 


zf-BED 


BED zinc finger 


3.3 


-3.0 


9 


472:574- 


1097 


7tm_2 


7 transmembrane receptor (Secretin 
family) 


6.8e-21 


82.8 


1 


325-580 


1097 


GPS 


Latrophilin/CL-l-like GPS domain 


9.5e-13 


55.8 


1 


273-323 


1097 


Srg 


C.eiegans Srg family integral 
membrane prote 


4.5 


-217.5 


1 


309-565 


1099 


lectin c 


Lectin C-type domain 


0.001 1 


7.2 


1 
I 


a inn 


1100 


PDZ 


PDZ domain (Also known as DHR or 
GLGF) 


0.0014 


25.3 


1 


12-91 


1100 


Tymo 45 
kd 70kd 


Tymovirus 45/70Kd protein 


1.8 


-283.9 


1 


1-398 


1101 


cadherin 


Cadherin domain 


8.9c-95 


328.3 


5 


64- 

156:170- 
265:279- 
381:394- 
485:498- 
595 


1101 


Cadherin_ 
C term 


Cadherin cytoplasmic region 


4.7e-80 


279.4 


1 


643-794 


1103 


COesteras 
e 


Carboxylesterase 


0.98 


-265.9 


1 


31-265 


1104 


DSPc 


Dual specificity phosphatase, catalytic 


9.7e-30 


112.2 


1 


133-315 
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doma 










1105 


efhand 


EF hand 


3.1e-21 


84.0 


3 


124- 

152:160- 
188:208- 
236 


1108 


zf-C2H2 


Zinc finger, C2H2 type 


3.5e-68 


239.9 


10 


189- 

212:240- 
262:268- 
290:296- 
319:325- 
347:353- 

^7C,1 OO 

375:382- 
404:909- 
93 1 \y 37- 
you:9oo- 
988 1 


1108 


SET 


SET domain 


0.0012 


-18.9 


1 


37-175 


1108 


zf-BED 


BED zinc finger 

■ 


0.1 


10.4 


2 

_ 


270- 
320:922- 

Vol 


1108 


FYVE 


FYVE zinc finger 


6.8 


-24.6 


J 


O/CO 1&A 


1109 


Nucleosid 
e_tra2 


Na+ dependent nucleoside transporter 


2.5e- 

1 OO 


635.7 





too tztn 


1109 


TLC 


TLC ATP/ADP transporter 


3.0 


-Joz.u 






1109 


ATP- 

synt A 


ATP synthase A chain 


6.5 


-79.5 




365-503 


1110 


PHD 


PHD-finger 


5.2 


-16.5 


■ 


77-120 


1113 


Peptidase 
C48 


Ulpl protease family, C-terminal cataly 


6.5e-34 


126.1 




254-415 


1114 


ras 


Ras family 


5.5e-31 


116.4 




ca ooo 


1114 


arf 


ADP-ribosylation factor family 


0.0054 


-80.3 


i 


37-213 


1115 


SPRY 


SPRY domain 


7.3e-10 


46.2 


1 


O0 1 Att\ 

281-419 


1115 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


0.00076 


26.2 




OO 1 O A 

88-134 


1116 


LRR 


Leucine Rich Repeat 


2.1e-05 


31.4 


6 


OO tCA.CC 

37-64:65- 
9z:93- 

10fV171- 
1ZU. lZi- 

147-175- 
198:255- 
278 


1117 


RasGAP 


GTPase-activator protein for Ras-like 
GTPase 


6.4e-28 


106.2 


1 


268-440 


1117 


PH 


PH domain 


U.Zo 


13./ 





7-7K 
/-/ o 


1117 


C2 


C2 domain 


1 o 






01-171 


1117 


bZIP 


bZIP transcription factor 


9 


-6.7 




997-1054 


1118 


COX3 


Cytochrome c oxidase subunit III 


1 *7 
I./ 


-zzo.y 


— 


77-105 


1118 


sugar tr 


Sugar (and other) transporter 


O O 

z.z 


1 TO 'X 


J 




1120 


LMWPc 


Low molecular weight phosphotyrosine 
protein 


y. /e-jo 


1 08 6 




7-1 3R 


1 17? 


1V1 


M nrotein reoeat 


6.6 


13.1 


2 


148- 

168:216- 
236 


1123 


lectin c 


Lectin C-type domain 


4.3e-ll 


50.3 


1 


579-646 


1123 


Tropomyo 
sin 


Tropomyosin 


0.17 


-90.1 


1 


304-500 


1123 


filament 


Intermediate filament protein 


0.17 


-164.6 


1 


287-537 


1123 


spectrin 


Spectrin repeat 


5.8 


-18.2 


1 


422-523 


1127 


vwa 


von Willebrand factor type A domain 


4.1e-63 


223.1 


1 


252-450 


1127 


trypsin 


Trypsin 


2.1e-43 


157.6 


1 


463-734 
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1127 


sushi 


Sushi domain (SCR repeat) 


2.4e-24 


94.3 


3 


37-85:86- 
140:147- 
200 


1128 


Neur_cha 
n memb 


Neurotransmitter-gated ion-channel tra 


le-88 


308.2 


1 


84-334 


1128 


oxidored 
ql N 


NADH-Ubiquinone bxidoreductase 


7 


-14.7 


1 


123-183 


1129 


C4 


C-terminal tandem repeated domain in 
type 4 


1.4e- 
148 


507.0 


2 


477- 

584:585- 

699 


1129 


Collagen 


Collagen triple helix repeat (20 copies) 


1.9e-60 


214.3 


7 


20-78:84- 

142:143- 

202:205- 

265:266- 

325:329- 

388*405- 

464 


1132 


filament 


Intermediate filament protein 


2.1 


-193.2 


1 


90-330 


1132 


Tropomyo 
sin 


Tropomyosin 


7.7 


-119.0 


1 


151-353 j 


1135 


zf-C2H2 


Zinc finger, C2H2 type 


8.3e-26 


99.2 


•j 


Z/O- 

303:312- 

369:375- 
399:405- 
429 


1136 


Ribosoma 
i S2 


Ribosomal protein S2 


1.7e-78 


274.2 


1 


34-198 


1137 


ATP- 
synt ab 
N 


ATP synthase alpha/beta family, beta- 
ba 


5.2e-26 


99.8 


1 


uj- iji,y 


1139 


ATP- 
synt_ab 


ATP synthase alpha/beta family, 
nucleot 


3.1e-06 


-33.1 


1 


71-183 


1139 


ATP- 
synt ab 
N 


ATP synthase alpha/beta family, beta- 

ba * 


0.015 


14.4 


1 


10-68 


1140 


ATP- 
synt ab 
N 


ATP synthase alpha/beta family, beta- 
ba 


5.2e-26 


99.8 


1 


63-129 


1140 


ATP- 
synt_ab 


ATP synthase alpha/beta family, 
nucleot 


2.6e-06 


-31.8 


1 


132-261 


1141 


ank 


Ankyrin repeat 


6.7e-34 


126.0 


3 


463- 
495:496- 
528:529- 
561 


iiyii 
1141 


BRCT 


BRCA1 C Terminus (BRCT) domain 


1.5e-15 


65.1 


2 


578- 

689:705- 

812 


1141 


zf-C3HC4 


Zinc finder r^HP4 tvn<» fl?rMtt ftn<r»p\ 


U.UoJ 


ft T 

9.7 


1 


86-122 


1143 


GTP EFT 
U 


Elongation factor Tu GTP binding 
domain 


0.031 


-59.5 


1 


68-140 


1145 


RF-1 


Peptidyl-tRNA hydrolase domain 


3.3e-05 


10.5 


1 


46-159 


1146 


RF-1 


Peptidyl-tRNA hydrolase domain 


6.4 


-51.5 


1 


46-114 


1148 


WD40 


WD domain, G-beta repeat ] 


6.9e-07 


36.3 


1 


44-80 


1149 


Band 41 


FERM domain (Band 4.1 famiM 


l.le-77 


271.5 


1 


45-235 


1150 


Metallothi 
o PEC 


Plant PEC family metallothionein 


5 


-38.3 


1 


90-139 


1153 


pkinase 


Protein kinase domain 


1.7e-90 


314.1 


1 


43-299 
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1155 


SCP 


SCP-like extracellular protein 


6.2e-14 


56.1 


1 


28-200 


1156 


ras 


Ras family i 


1.5e-15 


40.0 


1 


5-98 


1159 


6PF2K 


6-phosphofructo-2-kinase 


5.2e- 
152 


518.4 


1 


26-249 


1159 


PGAM 


Phosphoglycerate mutase family 


6e-100 


345.5 


1 


250-435 


1160 


zf-C2H2 


Zinc finger, C2H2 type 


1.3e- 
104 


361.0 


13 


223- 

245:251- 

273:279- 

301:307- 

329:335- 

357:363- 

385:391- 

413:419- 

441:447- 

469:475- 

497:501- 

523:529- 

551:557- 

579 


1160 


PHD 


PHD-finger 


0.55 


-7.5 


1 


448-509 


1160 


zf-BED 


BED zinc finger 


1.4 


0.3 


2 


264- 

302:542- 

580 


1160 


DnaJ CX 

XGXGX 

G 


DnaJ central domain (4 repeats) 


2.9 


-45.3 


1 


509-572 


1160 


LIM 


LIM domain 


7.7 


-19.2 


1 


477-539 


1160 


THIS 


Transcription factor S-II (TFI1S) 


7.9 


-6.3 


1 


450-485 


1162 


Patatin 


Patatin-like phospholipase 


0,00033 


-0.6 


1 


1-171 


1163 


pkinase 


Protein kinase domain 


2.3e-94 


326.9 


1 


53-303 


1163 


RIOl 


RIO1/ZK632.3/MJ0444 family 


0.37 


-100.2 


1 


A~l 0 4 C 

47-245 


1164 


Oxysterol 
BP 


Oxysterol-binding protein 


3.8e-47 


170.0 


1 




1166 


OATP__C 


Organic Anion Transporter Polypeptide 
(OATP) 


9.9e- 
193 


660.3 


1 


Oo-44J 


1166 


OATP_N 


Organic Anion Transporter Polypeptide 
(OATP) 


le-o7 




i 
i 




1166 


7tm 5 


7TM chemoreceptor 


O.I 


-10 / .*f 


i 
i 




1166 


sugar tr 


Sugar (and other) transporter 


7.8 


-195.1 


1 


48-570 


1167 


pentaxin 


Pentaxin family 


2,3e-U7 


-/.J 


1 


9* Oft 


1168 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


1.7e-05 


Jl./ 


i 
l 


DO /-J 


1169 


Peptidase 
Ml 


Peptidase family Ml 


4e-156 


532.1 


1 


69-458 


1170 


ig 


Immunoglobulin domain 


0.0016 


25.1 


1 


30-109 


1172 


BTB 


BTB/POZ domain 


8.7e-3i 


1 OO A 


1 




1173 


F-box 


F-box domain 


0.0019 


24.9 


1 


16-64 


1174 


TPR 


TPR Domain 


4.5 


9,3 


1 


iUl-oo4 


1175 


20G- 
Felluxy 


20G-Fe(II) oxygenase superfamily 


1.6e-06 


35.1 


1 


527-648 


1175 


TPR 


TPR Domain 


4.5 


9.3 


1 


301-334 


1176 


20G- 
FeII_Oxy 


20G-Fe(II) oxygenase superfamily 


1.6e-06 


35.1 


1 


557-678 


1177 


NajCa^E 

X 


Sodium/calcium exchanger protein 


l.le-23 


92.1 


1 


236-381 


1179 


lactamase 
B 


Metallo-beta-lactamase superfamily 


0.059 


-4.5 


1 


13-212 


1180 


fibrinogen 
C 


Fibrinogen beta and gamma chains, C- 
term 


1.6e-32 


121.5 


1 


207-416 
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SEQ II> 
1181 


Model 

MIF 


Description 

Macrophage migration inhibitory factor 
(MIF) 


E-value 

7.6e-67 


Score 
235.5 


Repeats 
l 


Position 
2-H5 


1182 


SSF 


Sodiunoolute symporter family 


I.7e- 
234 


792.4 


l 


69-503 


1182 


xan_ur_pe 
rmease 


Permease family 


4.3 


-193,8 


l 


143-514 


1182 


. PNTB 


NADfP) transhydrogenase beta subunit 


5 


-389.7 


l 


33-325 


1182 


60KDJM 
P 


60Kd inner membrane protein 


6.2 


-129.6 


l 


H3-241 


1182 


oxidored 
ql 


NADH-Ubiquinone/plastoquinone 


7.4 


-169.1 


l 


87-298 


1183 


Cation_ef 
flux 


Cation efflux family 


2.3e-58 


207.3 


l 


73-3 ll 


1185 


AAA 


ATPase family associated with various 
eel 


4e-89 


309.5 


2 


236- 

421:500- 

620 


1185 


HypB Ur 
eG 


HypB/UreG nucieotide-binding domain 


3.2 


-65.8 




234-337 


1185 


ArsA_AT 
Pase 


Anion-transporting ATPase 


8.5 


-195.7 


i 


234-482 


1186 


HC03_co 
transp 


HC03- transporter family 


0 


1389.6 




141-1023 


1186 


xanjjrj>e 
rmease 


Permease family 


0.33 


-164.5 




518-985 


1187 


homeobox 


Homeobox domain 


2.4e-16 


67.7 




51-107 


1188 


efhand 


EF hand 


6.7 


9.0 




13-41 


1191 


GST_C 


Glutathione S-transferase, C-terminal 
domain 


0.93 


0.3 




134-326 


1194 
1195 


PPR 
thiored 


PPR repeat 
Thioredoxin 


0.0019 


24.9 




14-48 


1197 


ENV_pol 
yprotein 


ENV nolvnrrtfpin (mat nnlvnrntAm'N f 


0. 018 

1 n A f\o 

1. ze-Uo 


-13.7 

O A 1 

-24. 1 




390-497 
86-529 


1200 


UQ con 


Ubiquitin-conjugating enzyme 


1.3e-23 


91.9 




60-190 


1202 


7tmJ ! 


7 transmembrane receptor (rhodopsin 
family) 


2.1e-23 


91.2 




59-306 


lzUz 


7tm 5 


7TM chemoreceptor 


4.3 


-164.7 




37-314 


1203 


7tmJ 


7 transmembrane receptor (rhodopsin 
family) 


5.9e-37 


136.2 




59-341 


\l\)5 


7tm 5 


7TM chemoreceptor j 


2.2 


-159.4 




37-338 


1 OAii 

1204 
lzU4 


SH3 

UBA | 


SH3 domain 
UBA/TS-N domain 


2.5e-05 


3l.l 




257-317 




PGAM 


Phosphoglycerate mutase family 


0.00013 
0.00044 


28.8 
-75.5 




36-76 
438-625 


1205 


heme_l 


Heme/Steroid binding domain 


0.00053 


19.5 




37-112 


1 OAT 

1207 


transmem 
brane4 


Tetraspanin family 


0.29 


-69.8 




11-110 


1208 


OATP_C 


Organic Anion Transporter Polypeptide 


1.3e- 
135 


464,0 




148-524 


1208 


kazal 


Kazal-type serine protease inhibitor d 


0.11 


4.9 




555-601 


1208 


sugar tr 




U.15 


-145.7 




128-626 


1208 


lig chan 


Ligand-gated ion channel 


3.3 


-153.6 




193-524 


1208 


7tm 1 


7 transmembrane receptor 


4.2 


-H5.5 




177-473 


1208 


Cytidylylt 
rans 


Phosphatidate cytidylyltransferase 


4.6 


-87.5 




177-268 


1215 


thyroglob 
ulinj 


Thyroglobulin type- 1 repeat 


3.2e-35 


130.4 


2 


90- 

153:216- 
281 


1215 
1215 


kazal 

efhand j 


Kazal-type serine protease inhibitor 
EF hand 


7.5e-09 
0.057 


42.8 
20.0 


I 

2 


40-84 
351- 
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379:388- 
416 


1216 


annexin 


Annexin 


7e-61 


215.7 


2 


72- 

139:144- 
211 


1217 


annexin 


Annexin 


2.2e-86 


300.4 


3 


44- 

111:116- 
1831199- 
267 


1218 


annexin 


Annexin 


7e-61 


215.7 


2 


44- 

ii Li tz: 

11 l:llo- 


1219 


Armadillo 
_seg 


Armadillo/beta-catenin-like repeat 


5.3e-30 


113.1 


5 


351- 
jyi:4U5- 

498:499- 
541:543- 
585 


1219 


GA 


GA module 


8.3 


-5.7 


1 


180-224 


1221 


p450 


Cytochrome P450 


le-122 


42L1 


1 


30-483 


1222 


p450 


Cytochrome P450 


1.7e-06 


-35.8 


1 


30-328 


1223 


A2M_N 


Alpha-2-macroglobulin family N- 
terminal regi 


4.3e-12 


-71.6 


1 


1-468 


1225 


Thymosin 


Thymosin beta-4 family 


Z3e-16 


67.8 


1 


2-41 


1227 


WD40 


WD domain, G-beta repeat 


1.6e-37 


138.1 


7 


115- 

151:165- 
201:207- 
244:250- 
286:293- 
328:334- 
370:391- 
431 


1236 


F-box 


F-box domain 


5e-07 


36.8 


1 


210-258 


1236 


UvrD- 
helicase 


UvrD/REP helicase 


0.00011 


-157.2 


1 


441-920 


1237 


LRR 


Leucine Rich Repeat 


2.3e-25 


97.7 


8 


42-65:66- 

88:89- 

111:112- 

134:135- 

157:158- 

180:181- 

203:204- 

227 


1238 


TPR 


TPR Domain 


9.6e-54 


192.0 


10 


22-55:56- 

86:87- 

120:121- 

154:155- 

loo:189- 

222:223- 

255:290- 

323:328- 

361:362- 

395 


1241 


cadherin 


Cadherin domain 


0.00011 


29.0 


2 


48- 

151:165- 
254 


1243 


TTL 


Tubulin-tyrosine ligase family 


3.6e-31 


117.0 


1 


1-225 


1245 


UCH-1 


Ubiquitin carboxyl-terminal hydrolases 
famil 


4.2e-08 


40.4 


1 


190-221 
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1245 


zf-UBP 


Zn-finger in ubiquitin-hydroiases and 
other 


1.2e-05 


312 


1 


62-148 


1246 


ank 


Ankyrin repeat 


3.8e-92 


319.5 


11 


12-44:45- 

77:79- 

111:112- 

144:145- 

177:179- 

211:212- 

244:245- 

277:278- 

310:312- 

344:345- 

374 


1250 


Clq 


Clq domain 


0.00033 


-3.8 


1 


827-946 


1250 


filament 


Intermediate filament protein 


1.3 


-187.6 


1 


360-645 


1250 


spectrin 


Spectrin repeat 


1.9 


-12.5 


1 


492-591 


1 1CA 

1250 


TNF 


TNF(Tumor Necrosis Factor) family 


4.1 


-20.5 


1 


835-946 


1 Off A 

1250 


Apolipopr 
otein 


Apolipoprotein A1/A4/E family 


5.7 


-113.3 


1 


200-469 


1 OCA 

1250 


sigma70 


Sigma-70 factor 


9 


-116.2 


1 


462-650 


1252 


laininin_ 
Nterm 


Laminin N-terminal (Domain VI) 


3.2e-52 


186.9 


1 


1-223 


1252 


laminin E 
GF 


Laminin EGF-like (Domains III and V) 


2e-38 


141.0 


3 


225- 
292:295r 
355:358- 
409 


too 

1252 


NTR 


NTR/C345C module 


7.2e-30 


112.6 


1 


479-591 


1 OO 

1252 


Keratin B 
2 


Keratin, high sulfur B2 protein 


6.7 


-81.2 - 


1 


318-451 


1253 


enolase 


Enol-ase 


0.038 


-162.0 


1 


11-136 




HIT 


HIT family 


L5e-55 


198.0 


1 


51-162 


1257 


ank 


Ankyrin repeat 


5.9e-24 


93.0 


4 


39-67:68- 
100:101- 
133:134- 
164 


1258 


lectin c 


Lectin C-type domain 


1.3e-21 


85.2 




51-181 




lectin c 


Lectin C-type domain 


1.3e-21 


85.2 




51-181 


1262 


mito_carr 


Mitochondrial carrier protein 


8.7e-67 


235.3 




7-100:102- 

221:224- 

319 


17£3 i 


serpin 


Serpin (serine protease inhibitor) 


8.7e- 
109 


374.8 




87-463 


17£il 


MHCJ 


Class I Histocompatibility antigen, 
domains 


l,3e- 
141 


483.9 




25-203 


1265 


MHCJ 


Class I Histocompatibility antigen, 
domains 


2.3e- 
130 


446.5 




25-203 


1266 


MHCJ 


Class I Histocompatibility antigen, 
domains 


2.7e- 
117 


403.1 




22-187 


1267 


MHCJ 


Class I Histocompatibility antigen, 
domains 


1.1C- 

115 


307 7 
07 1. 1 




25-196 


1268 


MHCJ 


Class I Histocompatibility antigen, 
domains 


2.9e- 
118 


406.3 




25-196 


1268 


ig 


Immunoglobulin domain 


9.5e-08 


39.2 




221-286 


1269 


MHCJ 


Class I Histocompatibility antigen, 
domains 


4.2e- 
132 


452.3 




25-204 


1271 


MHCJ 


Class I Histocompatibility antigen, 
domains 


le-144 


494.2 




25-203 


1272 


MHC I 


Class I Histocompatibility antigen, 


2.8e-95 


329.9 




25-204 
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domains 










1273 


MHCJ 


Class I Histocompatibility antigen, 
domains 


3.3e- 
140 


479.2 


1 


25-203 


1274 


Kelch 


Kelch motif 


1.4e-91 


317.7 


6 


271- 

J J O.J Jo- 

366:368- 
413:415- 

*iOU.*fOZ- 

507:509- 
554 


1274 


BTB 


BTB/POZ domain 


3.1e-38 


140.4 


J 




1275 


IQ 


1Q calrnodulin-binding motif 


0.0037 


23.9 


1 


394-414 


1276 


Glycos_tr 

ansf_2 


Glycosyl transferase 


2.2e-22 


87.8 


1 


lZO-JUo 


1276 


RicinJBJ 
ectin 


QXW lectin repeat 


0.0045 


23.7 


2 


a no 

478- 
<i Q«<on 

^7 


1280 


ig 


Immunoglobulin domain 


3.4e-05 


3U./ 




oz- 

14«*'174_ 

240 


1281 


UCH-1 


Ubiquitin carbdxyl-terminal hydrolases 
famil 


1.5e-12 


55.1 


1 


100-131 


1285 


UBX 


UBX domain 


2.9e-22 


87.4 




205-284 


1286 


UBX 


UBX domain 


2.9e-22 


87.4 




251-330 


1287 


DnaJ 


DnaJ domain 


9e-38 


138.9 




8-70 


1288 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


7.9e-09 


42.8 




9-57 


1288 


zi-B box 


B-box zinc finger 


1.8e-06 


34.9 




177-217 


1288 


SPRY 


SPRY domain 


0.0016 


14.8 




479-605 


1288 


zf-UBRl 


Putative zinc finger in N-recognin 


2.2 


-18.4 




180-237 


1289 


PKD 


PKD domain 


0 


1026.8 


16 


293- 

372:397- 

483:485- 

568:572- 

677:679- 

763:765- 

846:849- 

931:933- 

1017:1019- 

1099:1101- 

1153. lloD- 

10£G«1771 
JZ07.1Z/1- 

ljjj.ijjj 

1 40 c.i 440- 
1522*1524- 
1612:1614- 
1696 


1289 


REJ 


REJ domain 


290 


zr i o.\J 


1 


1723-2248 


19RQ 
izt>j* 


PLAT 


PLAT7LH2 domain 


2.6e-25 


97.5 




2673-2789 


1289 


GPS 


Latrophilin/CL-l-like GPS domain 


l.le-15 


65.5 




2566-2615 


1289 


lectin c 


Lectin C-type domain 


0.59 


-23.7 




2-87 


1289 


DUF26 


Domain of unknown function DUF26 


7.8 


-16.5 




2220-2262 


1290 


CNH 


CNH domain 


2.7e-24 


94.2 




69-375 


1292 


RhoGAP 


RhoGAP domain 


1.8e-59 


211.0 




125-279 


1293 


Peptidase 
M10 


Matrixin 


8.2e- 
110 


378.2 




48-21 1 


1293 


fh2 


Fibronectintype II domain 


3.5e-84 


293.1 


3 


230- 

271:288- 
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329:347- 
388 




hcmopexi 
11 


Hemopexin 


le-33 


125.5 


3 


486- 
530:608- 
654:656- 
699 


1294 


Pentidase 
M!0 


ivj.au ixin 


8.2e- 

no 


378.2 


1 


48-211 


1294 


fii2 


Fibronectin type II domain 


3.5e-84 


293.1 


3 


230- 
271:288- 
329:347- 
388 


1297 


UQ__con 


Ubiquitin-conjugating enzyme 


3.3e-20 


80.6 


1 


400-557 


1297 


TT ORF2 


TT viral ORF2 


3.6 


-92.9 


1 


546-667 


1297 


UQ con 


Ubiquitin-conjugating enzyme 


3.3e-20 


80.6 


1 


400-557 


1297 


TT ORF2 


TT viral ORF2 


3.6 


-92.9 


1 


546-667 


1298 


ig 


Immunoglobulin domain 


2.9e- 
236 


798.3 


21 


25-84:119- 

177:245- 

303:339- 

397:432- 

490:524- 

583:618- 

676:709- 

768:801,. 

859:894-' 

954:990- 

1048:1121- 

1179:1217- 

1275:1382- 

1440:1477- 

1535:1569- 

1627:1664- 

1722:1756- 

1816:1851- 

1911:1947- 

2005:2040- 

2098 


1298 


Adeno E 
3 CRf 


Adenovirus E3 region protein CR1 


0.062 


-3.7 


1 


1212-1288 


1299 


nding 


Cyclic nucleotide-binding domain 


6.2e-28 


106.2 


1 


363-459 


1299 




Ion transport protein 


8.9e-21 


82.5 


1 


69-265 


1299 


ATP- 
synt DE 


ATP synthase, Delta/Epsilon chain, 

Inno* 


6.8 


6.0 


1 


478-525 


1301 


RGS 


Regulator of G protein signaling 

/I run am 


1.6e-49 


177.9 


1 


56-172 I 


1302 


THF DH 
G CYH 
C 


1 cuemyuroioittlc 
dehydrogenase/cyclohyd 


le-99 


344.7 


1 


60-235 


1303 


gln-synt 


Glutamine synthetase 


1.3e- 
179 


610.1 


1 


1-321 


1304 


gln-synt 


Glutamine synthetase 


0.13 


-214.4 


1 


1-95 


1305 


SCAN 


SCAN domain 


7.5e-61 


215.6 


1 4 


42-137 


1306 


Methyitra 
nsf 5 


MraW methylase family 


4.6e-06 


-96.4 


1 


70-205 


1307 


ion trans 


Ion transport protein 


8.3e-18 


72.6 


1 1 


438-672 


1307 


ank 


Ankyrin repeat 


1.9e-06 


34.8 


2 


77- 

108:163- 
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195 


1307 


Srg 


C.elegans Srg family integral 
membrane prot 


8.1 


-222.4 


1 


418-669 


1310 


Ammoniu 
m transp 


Ammonium Transporter Family 


1.9e-56 


200.9 


1 




1310 


FecCD 


FecCD transport family 


0.89 


-200.6 


1 


97-331 


1311 


Kelch 


Kelch motif 


2.6e-60 


213.8 


6 


311- 

jDy.joi- 

411:413- 

43<S.40U- 
DUD.DU /- 
jDO.Jjy- 
DUO 


1311 


BTB 


BTB/POZ domain 


1 Co. OC 


mQ i 


i 
i 


91-1 Rl 


1313 


zf-B_box 


B-box zinc finger 


9.6e-30 


112.2 


2 


92- 

459 


1313 


zf-C3HC4 


Zinc finger, C3HC4 type (RING linger) 


l.Je-zi 


01 ft 




IJ>- J7. Jt 1 

385 


1313 


SPRY 


SPRY domain 


3.2e-23 


90.6 




672-813 


1313 


PHD 


PHD-finger 


0.97 


-9.8 




14-62 


1313 


NB-ARC 


NB-ARC domain 


9.5 


-151.3 




48-311 


1313 


zf-UBRl 


Putative zinc finger in N-recognin 


9.8 


-24.7 




421-470 


1314 


SRCR 


Scavenger receptor cysteine-rich 
domain 


1.7e-25 


98.1 




37-133 


1315 


adh short 


short chain dehydrogenase 


5.8e-33 


122.9 


\ .- 


38-293 


1317 


ANP 


Atrial natriuretic peptide 


1.2e-51 


185.0 




43-150 


1318 


EGF 


EGF-like domain 


3.6e-26 


100.4 


6 


512- 

547:553- 

J7L.J7/" 

634:640- 

/V7 5*747- 

781:787- 
827 


1318 


ig 


- j— r — ; — 3 : " 

Immunoglobulin domain 


Z.. 1 C'UU 


14 7 




170-227 


1318 


TIL 


Trypsin Inhibitor like cysteine rich dom 


1.9 


-6.8 




741-787 


1318 


ldl_recept 
a 


Low-density lipoprotein receptor 
domain 


0.0 


R R 
-o.o 




551-593 


1318 


TILa 


TILa domain 


O.V 


© « 
-o.J 




7H-792 


1321 


FKBP 


FKBP-type peptidyl-prolyl cis-trans 
isomeras 


5.7e-45 


162.8 


_ 


234-336 


1322 


PX 


PX domain 


1 1*» 9*. 
i. ie-zj 


OR 8 

70.0 




^73-382 


1322 


SH3 


SH3 domain 


O. JC" 1Z 


f.l fl 


~~ r 


3-59 


1323 


ras 


Ras family 


5.2e-16 


45.4 




35-218 


1323 


GTP EFT 
U 


Elongation tactorTu Glr binding 
domain 


U.D J 


-7^ 1 






1323 


MobB 


Molybdopterin guanine dinucleotide 
synthesis 


A ft 


4*i R 






1 OLD 


c 


PpII division nrotein 


5.7 


-217.6 




36-247 


1324 


pkinase 


Protein kinase domain 


4.6e-51 


183.1 




199-527 


1325 


pkinase 


Protein kinase domain 


4.6e-51 


183.1 




199-527 


1327 


Peptidase 
CI 


Papain family cysteine protease 


7.3e- 
110 


378.4 




73-349 


1328 


Peptidase 
CI 


Papain family cysteine protease 


7.3e- 
110 


378.4 


1 


114-390 


1330 


« 


Immunoglobulin domain 


4.6e-07 


36.9 


2 


52- 

108:145- 
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201 


1332 


Sec7 


Sec7 domain 


Oe-71 


251.3 


1 


345-536 


1332 


PH 


PH domain . 


0.25 


16.2 


1 


567-676 


1332 


w 


IQ calmodulin-binding motif 


0.35 


17.4 


1 


13-33 


1333 


cadherin 


Cadherin domain 


1.6e-94 


327.4 


5 


68- 

159:173- 
268:282- 
386:399- 
490:503- 
600 


1333 


Cadherin^ 
C term 


Cadherin cytoplasmic region 


1.6e-76 


267.6 


1 


648-819 


1335 


lipocalin 


Lipocalin / cytosolic fatty-acid binding 
pr 


7.5e-35 


129.2 


1 


5-133 


1336 


SH3 


SH3 domain 


2.4e-10 


47.8 


1 


495-549 


1336 


PID 


Phosphotyrosine interaction domain 
(PTB/PID) 


0.29 


-13.3 


1 


52-181 


1336 


SAM PN 
T 


Sterile alpha motif (SAM)/Pointed 
domain 


6 


-25.0 


1 


608-687 


1336 


SH2 


SH2 domain 


8.6 


-25.5 


1 


268-347 


1338 


UCH-2 


Ubiquitin carboxyl-terminal hydrolase 


1.5e-20 


81.7 


1 


350-411 


1338 


UCH-1 


Ubiquitin carboxyl-terminal hydrolases 


l.Ie-12 


55.5 


1 


111-142 


1338 


TFIIA 


Transcription factor IIA, alpha/beta s 


1.7 


-151.4 


1 


386-698 


3338 


Tyrao 45 
kd 70kd 


Tymbvirus 45/70Kd protein 


3.3 


-291.1 


1 


310-647 


1338 


MARCKS 


MARCKS family 


6.8 


-95.6 


1 


573-865 


1340 


RCC1 


Regulator of chromosome condensation 
(RCC1) 


5.6c-10 


46.6 


6 


77- 

136:140- 
192:195- 
245:248- 
298:301- 
356:359- 
406 


1342 


Peptidase 
Ml 


Peptidase family Ml 


4.6 


-194.5 




159-470 


1342 


ubiquitin 


Ubiquitin family 


5.4 


-4.4 




538-616 


1345 


crystall 


Beta/Gamma crystallin 


3.3e-38 


140.4 




1-53:61- 
144 


1345 


WD40 


WD domain, G-beta repeat 


6.7e-09 


43.0 




269-305 


1348 


AAA 


ATPase family associated with various 
cellul 


0.97 


-33.9 




131-307 


1 i/tn 
1349 


AAA 


ATPase family associated with various 
cellul 


0.97 


-33.9 




131-307 


1351 


PHD 


PHD-finger 


6.3e-15 


63.0 




190-237 


net 


PHD 


PHD-finger 


6.3e-15 


63.0 




190-237 


1353 


CUB 


CUB domain ~^ 


3.3e-13 


57.3 




416-524 


1353 


sushi 


Sushi domain (SCR repeat) 


1.5e-05 


31.9 




357-412 


1355 


SPRY 


SPRY domain 


3.4e-19 


77.2 




396-519 


1355 


fii3 


riuruiicLiin type m Qomain 


1 - no 

le-U8 


42.4 




259-345 


1355 


zf-B box 


B-box zinc finger 


8.9e-07 


35.9 




44-86 


1356 


Patatin i 


Patatin-like phospholipase 


0.59 


-42.1 




10-118 


1358 


ELM2 


ELM2 domain 


3.2e-23 


84.0 




195-256 


1358 


mybJ)N 
A-binding 


Myb-like DNA-binding domain 


l.le-09 


45.6 




299-345 


1359 


Reprolysi 
n 


Reprolysin (M12B) family zinc metallo 


l.4e-12 


55.3 




239-457 


1359 


Pep_M12 
Bjpropep 


Reprolysin family propeptide 


3.6e-12 


53.9 




90-216 
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1359 


tsp_l 


Thrombospondin type 1 domain 


3.4e-07 


37.4 


5 


551- | 
601:829- 
884:888- 
y 44:5*40- 

1002:1007- 
1057 


1359 


EB 


EB module 


8.4 


-11.7 


1 


452-531 


1363 


ank 


Ankyrin repeat 


6.2e-26 


99.6 


5 


72- 

104:105- 
137:138- 
170:180- 
212:222- 
255 


1366 


C2 


C2 domain 


le-75 


264.9 


2 


161- 

247:293- 
Jol 


1367 


C2 


C2 domain 


le-75 


264.9 


2 

— 


161- 

Jol 


1368 


p450 


Cytochrome P450 


5e-l 10 


j/o.y 




Al ^09 


1370 


aa_perme 
ases 


Amino acid permease 


1.3e-08 


-108.1 


i 


49-452 


1370 


Neur__cha 
n memb 


Neurotransmitter-gated ion-channel fe- 


0.37 


-96.5 


i 

— 


330-585 


1370 


ion trans 


lon transport protein _J 


1.4 


-2.3 


J 


zoo-Ol J 


1370 


Transp_cy 
t pur 


Permease for cytosine/purines, uracil 


1.5 


1 TO C 

-1 78.0 




Al AAO 


1370 


Aa trans 


Transmembrane amino acid transporter 


1.9 


-iyu.3 




cn Af\Q 

o/-*ruy 


1370 


DUF140 


Domain of unknown function DUF140 


2.6 


-156.6 


i 


109-312 


1370 


Nucleosid 
e tran 


Nucleoside transporter 


3.2 


-154.9 


1 


JD /-OJO 


1370 


xan_ur_pe 
rmease 


Permease family 


4.1 


-193.2 




56-429 


1370 


DUF6 


Integral membrane protein DUF6 


7.1 


-22.4 




536-671 


1370 


NADHdh 


NADH dehydrogenase 


7.3 


-213.9 




212-616 


1370 


SNF 


Sodiummeurotransmitter symporter 
fam 


9.2 


-458.6 




117-450 


1372 


kinesin 


Kinesin motor domain 


2.1e- 
135 


463.2 


1 


53-341 


1372 


Translin 


Translin family 


10 


-82.4 




QIC ACSl 

315-462 


1373 


LRR 


Leucine Rich Repeat 


1.9e-27 


104.6 


7 


6U-oJ.o4- 
1 0R- 

l\J l * x WO 

155:157- 
180:181- 
204:205- 
225 


1373 


ig 


Immunoglobulin domain 


1.2e-05 


32.1 


1 


310-368 




T "RRPT 


T purine rich reneat f% terminal domain 


4.7e-05 


30.2 




249-294 


1373 


£h3 


Fibronectin type III domain 


7.Ie-05 


29.6 




425-505 


1373 


LRRNT 


Leucine rich repeat N-terminal domain 


0.83 


13.7 




27-58 


1374 


kinesin 


Kinesin motor domain 


9.5e-14 


1.2 




1-131 


1375 


zf-DHHC 


DHHC zinc finger domain 


9.1e-34 


125.6 




101-165 


1376 


EGF 


EGF-like domain 


3.2e-45 


163.6 


9 


49-84:90- 

126:132- 

167:177- 

213:217- 

252:286- 
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321:327- 
362:368- 
401:407- 
442 


1376 


CUB 


CUB domain 


9.6e-18 


72.4 


-7— 


809-918 


i inc. 


TIL 


Trypsin Inhibitor like cysteine rich 
domai 


0.73 


-2.0 


1 


84-132 


1376 


Keratin_B 
2 


Keratin, high sulfur B2 protein 


0.9 


-67.6 


1 


111-242 


IJ/O 


granulin 


Granulin 


1.1 


-12.1 


1 


285-323 


13 /O 


metalthio 


Metallothionein 


5.9 


-10.3 




363-431 j 


1376 


DUF141 


Domain of unknown function DUF141 


6.7 


-15.1 




799-922 


1380 


ion trans 


Ion transport protein 


0.066 


16.8 




153-338 


1380 


ABC2_m 
embrane 


ABC-2 type transporter 


6.1 


-130.1 


1 


145-334 


1380 


oxidored 
ql 


NADH-Ubiquinone/plastoquinone 


6.2 


-167.5 


1 


46-317 


1380 


OATP C 


Organic Anion Transporter Polypeptide 


9.5 


-236.4 


-j 


75-316 


1381 


pkinase 


Protein kinase domain 


9.7e-80 


278.3 


1 


205-486 


1384 


ig 


Immunoglobulin domain 


0.00034 


27.4 




65-142 


1388 


LRR 


Leucine Rich Repeat 


4.9e-16 


66.7 




20-43:44- 

67:68- 

91:92-115 


1388 


LRRCT 


Leucine rich repeat C-terminal domain 


7.4e-09 


42.9 




125-175' 


1388 


GPS 


Latrophilin/CL-l-like GPS domain 


0.0041 


20.8 


1 


641-693 


1388 


HRM 


Hormone receptor domain 


0.0076 


16.3 




285-354 


1388 


TtmJL 


7 transmembrane receptor (Secretin 
family) 


0.01 


-96.0 




704-981 


1388 


ig 


Immunoglobulin domain 


3.3 


10.9 




196-265 


1389 


MACPF 


MAC/Perforin domain 


0.016 


-71.3 




30-333 


1391 


HD 


HD domain 


8.3e-07 


36.1 




32-127 


1392 


efhand 


EFhand 


1.5e-05 


31.8 


2 


1-25:33-61 


1394 


MORN 


MORN repeat 


l.le-32 


122.1 


7 


39-61:62- 

85:86- 

10fMOQ- 

IU0.1U7" 

131:132- 
154:155- 
177:178- 
200 


1395 


MORN 


MORN repeat 


3.1e-31 


117.2 


6 


39-61:62- 

85:86- 

108:143- 

165:166- 

188:189- 

211 | 


1396 


EPHJbd 


Ephrin receptor ligand binding domain 


4.6e- 
135 


462.1 


1 


31-204 


1396 


pkinase 


Protein kinase domain 


8e-74 


258.7 


1 


635-892 


1396 


fh3 


Fibronectin tvnp Iff Hnmain 


z.oe-jj 


117.5 


2 


329- 

425:437- 

524 


1396 


SAM 


SAM domain (Sterile alpha motif) 


2.5e-21 


84.3 


1 


928-992 


1396 


PHD 


PHD-finger 


3.7 


-15.1 


1 1 


259-296 


1397 


Exo_endo 
_P nos 


Endonuclease/Exonuclease/phosphatase 
fa 


0.34 


-11.2 


1 


357-643 


1397 


sugar tr 


Sugar (and other) transporter 


8.8 


-196.7 


1 


7-343 


1397 


Exo_endo 
_phos 


EndonucIease/Exonuclease/phosphatase 
fa 


0.34 


-11.2 


1 


357-643 
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Donate 

tvepeaia 


i usihoji 


1397 


sugar tr 


Sugar (and other) transporter 


ft ft 

o.o 




i 
i 




1401 


horn'eobox 


Homeobox domain 


n no7 


-O.J 


i 
l 




1403 


Tropomyo 
sin 


Tropomyosin 






i 

X 




1403 


filament 


Intermediate filament protein 


2.6 


-195.5 


1 


412-760 


1403 


UvrD- 
helicase 


UvrD/REP helicase 


Q 

o 


O/tO A 


1 

1 




1403 


Myosin t 
ail 


Myosin tail 


8.9 


-551.2 


1 


20-777 


1404 


Clq 


Clq domain 




-J.o 


i 

X 


ft77 QAA 


1404 


filament 


Intermediate filament protein 


i i 
1.3 


-lo/.O 


1 
1 




1404 


spectrin 


Spectrin repeat 


l.y 




1 
I 




1404 


TNF 


TNF(Tumor Necrosis Factor) family 


4.1 


-20.5 


1 


835-946 


1404 


Apolipopr 
otein 


Apolipoprotein A1/A4/E family 


5.7 


-113.3 


l 


200-469 


1404 


sigma70 


Sigma-70 factor 


9 


-116.2 


l 


462-650 


1405 


Cache 


Cache domain 


6.5e-12 


53.0 


2 


402- 

481:721- 

813 


1406 


ASC 


Amiloride-sensitive sodium channel 


2e-125 


430.1 




159-579 


1407 


pkinase 


Protein kinase domain 


0.22 


-115.5 


l 


5-217 


1408 


PBP 


Phosphatidylethanolamine-binding 
protein 


1.8e-71 


250.9 


I 


1-167 


1410 


abhydrola 

' se 


alpha/beta hydrolase fold 


1.4 


-12.1 


l 


75-318 


1412 


rrm 


RNA recognition motif. 


2.7e-12 


54.3 


3 


259- 
329:360- 
433:477- 
550 


1414 


DEP 


Domain found in Dishevelled, Egl-10, 
and Pie 


3.5e-32 


120.3 


2 


173- 

247:275- 
349 


1414 


PH 


PH domain 


le-09 


45.7 


I 


29-144 


1414 


PDZ 


PDZ domain (Also known as DHR or 
GLGF) 


0.013 


18.7 


2 


375- 

A CCA Cf\ 

4:> 6:460- 
531 i 


1418 


SCAN 


SCAN domain 


2.6e-51 


183.9 


l 


36-131 


1418 


zf-C2H2 


Zinc finger, C2H2 type 


4.6e-25 


96.7 


5 


406- 

428:435- 

457:463- 

485:522- 

545:553- 

575 


1418 


KRAB 


KRAB box 


0.023 


14.2 


l 


220-260 


1418 


zf-BED 


BED zinc finger 


10 


-7.4 


l 


438-486 ! 


1419 


EGF 


EGF-like domain 


5.1e-28 


106.5 


9 


13-40:45- 

72:77- 

104:109- 

136:141- 

168:173- 

200:205- 

232:237- 

264:269- 

296 


1419 


Keratin_B 
2 


Keratin, high sulfur B2 protein 


1.2 


-69.3 


l 


100-251 


1419 


Gamma- 
thionin 


Gamma-thionins family 


4.8 


-8.7 


I 


124-163 
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1421 


Ferricjred 
uct 


Ferric reductase like transmembrane 
com 


2.8e-64 


227.0 


1 


63-564 


1421 


NAD_bin 
ding 


Qxidoreductase NAD-binding domain 


4 


-34.1 


1 


381-551 


1421 


FAD_bin 
dingj) 


Oxidoreductase FAD-binding domain 


5 


-28.7 


1 


245-335 




rubredoxi 
n 


Rubredoxin 


6.9 


-11.0 


1 


409-436 ! 


1422 


zf-C2H2 


Zinc finger, C2H2 type 


5e-12 


53.4 


2 


1057- 

1079:1085- 
1109 


1422 


zf-TRAF 


TRAF-type zinc finder 


5 


-6.6 


1 


1056-1101 


1422 


Tymo_45 
kd 70kd 


Tymovirus 45/70Kd protein 


6 


-298.2 


1 


343-700 


1422 


zf-BED 


BED zinc finger 


7.1 


-6.0 


1 


1070-1 HO 


1423 


7tm 5 


7TM chemoreceptor 


3.3 


-162.5 


1 


181-451 


1423 


Cytidylylt 
rans 


Phosphatidate cytidylyltransferase 


5 


-87.9 


1 


21-135 


1424 


CAP GL 
Y 


CAP-Gly domain 


7.2e-46 


165.8 


2 


196- 

238:398- 
440 


1424 


ank 


Ankyrin repeat 


9.2e-09 


42.5 


3 


1-40*42- 
76-79-1 1 1 


1425 


PAP2 


PAP2 superfamily 


1.5e-08 


41.8 


1 


166-313 « 


1426 


SCAN 


SCAN domain 


6.2e-70 


245.7 


1 


33-128 


1426 


zf-C2H2 


Zinc ringer, C2H2 type 


1.5e-44 


161.4 


6 


239- 

261 '267- 

289:295- 

317:323- 

345:351- 

373:379- 

401 


1426 


zf-BED 


BED zinc finger 


0.67 


3.1 


1 


280-318 


1426 


DC1 


DC1 domain 


5.1 


3.6 


1 


295-338 


1426 


zf-C4 


Zinc finger, C4 type (two domains) 


9.7 


-55.1 


1 


323-364 


1427 


xan_ur_pe 
rmease 


Permease family t 


7.1 


-199.5 


1 


104-453 


1428 


LRR 


Leucine Rich Repeat 
— „ 


2.3e-16 


67.8 


4 


80- 

103:104- 
127:128- 
151:152- 
175 


1428 




Leucine rich repeat C-terminal domain 


0.00079 


26.2 




185-234 


1431 


PH 


PH domain r 


7.6e-15 


62.8 




19-117 




rri 


PH domain 


6.4e-21 


82.9 




55-153 


1434 


filament j 


Intermediate filament protein 


2.9 


-196.8 




128-488 


1434 


K-box 


K-box region 


4.1 


-38.4 




277-357 


1434 


OspD 


Borrelia outer surface protein D 


5.3 


-69.3 




151-409 


1434 


Apolipopr 
otein 


f\yviiyuyi uicin /ai/ah/e iaxniiy 


5.7 


-113.3 




56-318 


1437 


RhoGAP 


RhoGAP domain 


5.7e-57 


202.7 




1152-1305 


1437 


PH 


PH domain 


3.5e-18 


73.8 




922-1030 


1437 


PDZ 


PDZ domain (Also known as DHR or 
GLGF) 


0.0013 


25.4 




28-128 


1441 


annexin 


Annexin 


4.1e- 
109 


375.9 


4 


18-79:80- 
135:151- 
219:227- 
294 
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SEQ ID 


Model 


Description 


Wi 1 irAiiiA 

Hi-vaiue 






Position 


1443 


zf-C3HC4 


Zinc linger, C3HC4 type (KlJNij linger; 


ft ftA7< 


18 7 

1 o. / 




1256-1307 


1443 


PHD 


PHD-finger 


0.18 


-2.9 


1 


1255-1310 


1443 


WD40 


WD domain, G-beta repeat 


7 Q 




— 


189-225 


1443 


K tetra 


K+ channel tetramerisation domain 


J.J 


-1Q ^ 


- — : 




1443 


Clathrin 


Region in Clathrin and VPS 


Q O 

o.y 


9^ 7 


— 




1444 


Rlla 


Regulatory subunit of type II PKA R-s 


a o i 
0.31 


in s 
lU.o 


— 


17-S4 


1444 


Alpha_ad 
aptin_C 


Alpha adaptin AP2, C-terminal domain 


7 


£1 A 


— ' 


01-1 55 


1445 


TK 


Thymidine kinase 


1 /to Oft 






61-231 


1446 


GDPD 


Glycerophosphoryl diester phosphodies 


r At* ftfi 

D.4e-Uo 


Aft ft 
nU.v 


-j 




154-401 


1446 


Glycos_tr 
ansf 4 


Glycosyi transferase 


A Q 

4.9 


87 & 




99-145 


1447 


adh short 


short chain dehydrogenase 


f\ no 

0.9 o 


-yi.o 






1448 


K-box 


K-box region 


0.64 


-28.5 


i 


61-126 


1449 


PHD 


PHD-finger 


A A1 

0.01 


o.4 


J 




1452 


rrm 


RNA recognition motif. 


A C 1 A 

4.5 e- 19 


/O.o 


1 


77 1 


1454 


rvt 


Reverse transcriptase 


3.6e-34 






OQC C7A 


1454 


Gag MA 


Matrix protein (MA), pi 5 


0.0018 


-21.0 


i 


10-131 


1454 


Gag p30 


Gag P30 core shell protein 


0.54 


-80,3 




oil ion 
Zl 1-3VU 


1458 


COX5A 


Cytochrome c oxidase subunit Va 


1.2e-55 


198.3 


l 


42-131 


1459 


Guanylate 
kin 


Guanylate kinase 


6.2e-38 


139.4 




etc fS*A 

515-624 


1459 


PDZ 


PDZ domain (Also known as DHR or 
GLGF) 


6.8e-ll 


49.6 


1 


ac/T IOC 

256-335 


1459 


SH3 


SH3 domain 


0.027 


5.9 




1AQ A 1 C 

J48-4 1 j 


1459 


L27 


L27 domain 


0.049 


20.1 


i 


186-238 


1459 


Caulimo_ 
mov 


Caulimovirus movement protein 


7.1 


-185.3 


* 


420-673 


1459 


A_deamin 
ase 


Adenosine/AMP deaminase 


7.8 


-138.5 


i 


64-421 


1461 


hexokinas 
e 


Hexokinase 


4.3e- 
284 


957.2 


i 


53-499 


1463 


Occludin 


Occludin/ELL family 


6.3 


-249.1 


i 


33-394 j 


1464 


trypsin 


Trypsin 


4.7e-72 


252.8 




30-ziz 


1466 


DDHD 


DDHD domain 


8.6e- 
117 


401.4 


i 




613-860 


1466 


DUF203 


Domain of unknown function 


8.7 


/CO © 

-oy.o 


J 


7^/l_zl.Aft 
ZO4-*f0U 


1467 


Glycosjr 
ansf 1 


Glycosyi transferases group 1 


1.8e-27 


in/1 n 

104.7 


1 


ZoO-4 /U 


1468 


EMP24_ 
GP25L 


emp24/gp25L/p24 family 


3.5e-70 


246.6 


i 


5-183 


1469 


EMP24 
GP25L 


emp24/gp25L/p24 family 


3.5e-81 


283.1 


1 


5-208 


1470 


14-3-3 


14-3-3 protein 


142 


4oO.D 




5-94Q 

J-Z*ry 


1471 


filament 


Intermediate filament protein 


0.53 


-177.6 


i 


2-249 


1471 


spectrin 


Spectrin repeat 


7.1 


1 A 1 

-19.1 




a/l ion 


1472 


MtN3 slv 


MtN3/saliva family 


5.4 


-31.6 




35-139 


1472 


ATP- 
synt_A 


Air syntnase a cnam 


7 4 


-80.2 




91-214 


1474 


zf-CCCH 


Zinc finger C-x8-C-x5-C-x3-H type 


4.3e-07 


37.0 


2 


531- 

557:663- 

688 


1474 


zf-C2H2 


Zinc finger, C2H2 type 


6.3 


11.3 


2 


205- 

229:618- 

642 


1475 


SpoUjne 
thylase 


SpoU rRNA Methylase family 


2.1e-27 


104.5 


1 


145-301 



341 



WO 02/070539 W W PCT/US02/05095 



oiL\2 ID 


Model 


Description 


E-value 


Score 


Repeats 


Position 


147A 


filament 


Intermediate filament protein 


2.4 


-194.5 


1 


427-720 


1476 


Tropomyo 
sin 


Tropomyosin 


3 


-111.7 


1 


539-761 


14/0 


spectrin 


Spectrin repeat 


3,3 


-15.3 


1 


427-526 


14/0 


K-DOX 


K-box region 


7.1 


-41.3 


1 


248-335 


1476 

-TT^Z 


Borrelia_ 
ortA 


BorreliaORF-A 


9.8 


-102.9 


1 


440-736 


14/7 


MMR_HS 
i> 1 


GTPase of unknown function 


l,5e-90 


314.2 


1 


178-521 


1/177 


T"\T TOCO 

Dur-258 


Protein of unknown function, DUF258 


9.6 


-84.6 


1 


343-465 


1470 

14/y 


KNase_P 

TT 

ri 


3' exoribonuclease family 


1.2e-96 


334.5 


2 


48- i 

251:358- 

581 


1479 


SI 


SI RNA binding domain 


0.057 


19.9 


1 


675-750 


1479 


KH- 
doraain 


KH domain 


0.35 


9.3 


1 


609-651 


149.7 
145Z 


CULF1 


Fibrillar collagen C-terminal domain 


5.8e-29 


109.6 


1 


97-277 


1482 


Collagen 


Collagen triple helix repeat (20 copies) 


8.5e-05 


29.4 


1 


2-61 


1483 


COLFI 


Fibrillar collagen C-terminal domain 


L6e-35 


131.4 


I 


110-293 


1 A OO 

1483 


Collagen 


Collagen triple helix repeat (20 copies) 


8.5e-05 


29.4 


1 


2-61 


1484 


CH 


Calponin homology (CH) domain 


5.6e-I4 


59.9 


1 


4-104 


1485 


MoaE 


MoaE protein 


7 


-55.7 


1 


21-96 


1486 


zf-C2H2 


Zinc finger, C2H2 type 


6.8 


10.9 


2 


871- 

896:904, 
929 


1487 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


2.6 


-3.3 


1 


336-377 


1489 


FH2 


Fornnn Homology 2 Domain 


0.00017 


-49.7 


1 


3-329 


1 AC\f\ 

1490 


AAA 


ATPase family associated with various 


4e-45 


163.3 


1 


370-565 


1490 


SKI 


Shikimate kinase 


0.068 


-52.5 


1 


369-506 


1490 


Viraljieli 
easel 


Viral (Superfamily 1) RNA helicase 


1.8 


-67.0 


1 


371-563 


1490 


LON 


ATP-dependent protease La (LON) 
domai 


3.6 


-69.4 


1 


12-220 




Tropomod 
ulin 


Tropomodulin 


4.1e-78 


272.9 


1 


34-402 


1401 

14yl 


WH2 


WH2 motif 


0.83 


16.1 


1 


534-553 


1/101 
14J/1 


pkinase 


Protein kinase domain 


5.9 


-136.0 


1 


334-538 


1494 


xan_urj>e 
rmease 


Permease family 


2.9 


-189.3 


1 


72-377 


Ida a 


Na_sulph 
_symp 


Sodium:sulfate symporter 
transmembran 


5.3 


-356.1 


1 


212-541 


1404 
14^4 


uiycosjr 
ansf 4 


Glycosyl transferase 


7.3 


-90.3 


1 


374-528 


1494 


STE3 


Pheromone A receptor 


7.5 


-203.9 


1 


314-603 


1404 


HT won 1 
UUrZJ.1 


Domain of unknown function DUF221 


9.6 


-234.2 


1 


196-576 


1494 


7tm 5 


7TM chemoreceptor j 


9.7 


-171.0 


1 


122-365 


1404 


oxidored_ 

n1 

qi 


NADH-Ubiquinone/pIastoquinone 


9.7 


-171.5 


1 


42-264 


1495 


lectin c 


Lectin C-type domain 


4 7p-14 


IZO.D 


1 


53-164 


1496 


Cytochro 
me B561 


Cytochrome b561 


2.1e- 
113 


390.2 


1 


1-240 


1498 


Hydrolase 


haloacid dehalogenase-like hydrolase 


0.0045 


16.9 


1 


31-443 


1498 


Cation_A 
TPase"c 


Cation transporting ATPase, C-terminu 


0.26 


-25.8 


1 


535-706 


1498 


oxidored 
q4 


NADH-ubiquinone/plastoquinone 
oxidore 


4.4 


-34.0 


1 


631-705 


1499 


DEAD | 


DEAD/DEAH box helicase 


3e-64 


226.9 


1 


139-356 


1499 


helicase 


Helicase conserved C-terminal domain 


4.1e-32 


120.1 


1 


447-518 
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SEOID 


Model 


Description 


E-vahic 


Score 


Repeats 


Position 




C 












1502 


C2 


C2 domain 


1.3e-51 


184.9 


2 


2o- 

109:184- 
264 


1503 


ank 


Ankyrin repeat 


1.6e-31 


118.2 


6 


743- 

775:776- 

807:808- 

840:842- 

874:875- 

915:916- 

948 


1504 


ank 


Ankyrin repeat 


1.4e- 
167 


570.1 


16 • 


57-89:90- 

122:123- 

155:210- 

242:243- 

275:276- 

308:363- 

398:399- 

431:432- 

464:525- 

557:558- 

590:591- 

623:644- 

O/j.O/o- 

Tt c\^'^^ i 

/1U. /! l- 
116 


1504 


SAM 


SAM domain (Sterile alpha motii) 


i no 




i 


872-934 


1504 


PARP 


Poly(ADP-ribose) polymerase catalytic 
domai 


0.022 


-59.4 


1 


954-1161 


1504 


3Beta_HS 
D 


3-beta hydroxysteroid 
dehydrogenase/isomera 


1.3 


-204.0 


1 


208-536 


1505 


PHD 


PHD-finger 


0.26 


-4.5 


1 


132-191 


1505 


DC1 


DC1 domain 


2.8 


5.8 


1 


131-159 


1506 


PHD 


PHD-finger 


0.26 


-4.5 


1 


156-215 


1506 


DC1 


DC1 domain 


2.8 


5.8 


1 


155-183 


1507 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


3.6e-08 


40.6 


1 


224-261 


1507 


LON 


ATP-dependent protease La (LON) 
domain 


0.007 . 


-21.6 


1 


305-510 


1507 


TPR 


TPR Domain 


1.6 


13.6 


2 


41-74:75- 
108 


1508 


ig 


Immunoglobulin domain 


le-76 


268.3 


12 


78- 

131:171- 
245:276- 
330:364- 

516:552- 

OX J .0 J**' 

705:740- 

797:828- 

880:914- 

981:1012- 

1067:1101- 

1169 


1512 


FYVE 


FYVE zinc finger 


3.2e-14 


60.7 


1 


152-261 


1512 


HypA 


Hydrogenase expression/synthesis 
hypA 


0.81 


-51.2 


1 


97-194 


1512 


RNA PO 
L M 15K 


RNA polymerases M/15 Kd subunit 


2.1 


-1.7 


1 


160-212 
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Model 


i DescriDtioii 


E-valuc 


Score 


Repeats 


Position 




D 












1516 


Gelsolin 


Gelsolin repeat 


0.76 


5.9 




1011-1052 


1518 


V1R 


Vomeronasal organ pheromone 
receptor family 


2.3e-ll 


41.1 




73-338 


1518 


7tm_l 


7 transmembrane receptor (rhodopsin 
family) 


0.044 


-69.7 


1 


62-325 


1520 


SPRY 


SPRY domain 


5.2e-24 


93.2 


1 


95-230 


1521 


Tropomod 
ulin 


Tropomoduiin 


0.065 


-128.4 


1 


214-495 


1521 


LRR 


Leucine Rich Repeat 


1.8 


15.0 


4 


348- 

371:376- 

403:404- 

427:432- 

456 


1522 


zf-CCHC 


Zinc knuckle 


0.051 


15.9 


1 


13-30 


1523 


Skpl 


Skpl family 


6.3e-10 


46.4 


1 


17-80 ! 


1524 


RhoGAP 


RhoGAP domain 


4.2e-31 


116.8 


1 


125-285 


1525 


UQ con 


Ubiquitin-conjugating enzyme 


3.4e-39 


144.9 


1 


1-126 


1527 


LRR 


Leucine Rich Repeat 


1.6e-35 


131.4 


9 


86- 

109:110- 
133:134- 
157:158- 
181:182- 
205:206- 
229:230- 
251:254- 
277:279- 
302 


1527 


LRRNT 


Leucine rich repeat N-terminal domain 


6.6e-06 


33.1 


1 


33-60 


1527 


LRRCT 


Leucine rich repeat C-terminal domain 


0.048 


17.9 


1 


312-362 


1528 


K tetra 


K+ channel tetramerisation domain 


0.0016 


-5.0 


1 


117-220 


1529 


MORN 


MORN repeat 


1.8e-24 


94.7 


7 


1049- 

1071:1072- 
1094:1100- 
1122:1123- 
1143:1151- 
1171:1198- 
1220:1221- 
1244 


1529 


VPS9 


Vacuolar sorting protein 9 (VPS9) 
domain 


8.3e-06 


32.7 


1 


1551-1656 


1529 


RCC1 


Regulator of chromosome condensation 
(RCC1) 


8.5e-06 


32.7 | 


3 


168- 

216:527- 
574:579- 
625 


1 590 


Knou.br 


RhoGEF domain 


0.097 


-40.8 


1 


694-884 


1 59Q 


rri 


PH domain 


0.23 


16.5 


1 


901-1005 


1530 


profilin 


Profilin 


1.4e-63 


224.6 


1 


3-135 


1531 


profilin 


Profilin 


7 A a. AQ 


172 .4 


1 


3-119 


1532 


60s ji bos 
omal 


60s Acidic ribosomal protein 


3.3 


-34.2 


1 


39-153 


1533 


jmjC 


jmjC domain 


0.01 


-0.1 


1 


1-50 


1533 


PHD 


PHD-flnger 


0.042 


2.9 


2 


508- 

549:609- 

655 


1534 


Icinesin 


Kinesin motor domain 


3.5e-64 


226.7 


1 


1-177 


1536 


aminotran 

3 j 


Aminotransferase class-Ill 


7.8e-42 


152.4 


1 


1-373 
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Model 


Description 


E-value 


Score 


Repeats 


Position 


1536 


LEA 


Late embryogenesis abundant protein 


5.6 


-4.7 


1 


109-177 


.1539 


Hydrolase 


haloacid dehalogenase-like hydrolase 


0.0064 


15.3 


1 


264-685 


1539 


Cation_A 
TPase C 


Cation transporting ATPase, C-terminu 


4.6 


-46.1 


1 


784-916 


1541 


TPR 


TPR Domain 


0.00036 


27.3 


2 


135- 

168:204- 
237 


1542 


PCMT 


Protein-L-isoaspartate(D-aspartate) 0- 
methyl 


1.2e-ll 


21.8 


1 


9-224 


1543 


Peptidase 
C54 


Peptidase family C54 


3.1e-58 


206.9 


1 


76-364 


1545 


homeobox 


Homeobox domain 


4.8e-26 


100.0 


1 


233-286 


1546 


zf-C2H2 


Zinc finger, C2H2 type 


3.1e-85 


296.6 


14 


14-36:42- 

64:70- 

92:99- 

122:128- 

150:163- 

185:199- 

221:227- 

249:255- 

277:283- 

305:311- 

333:339- 

361:367- 

389:395- 

417 


1546 


TFIIS 


Transcription factor S-II (TFIIS) 


1.9 


-1.0 


1 


202-237 


1546 


zf-BED 


BED zinc finger 


2.3 


-1.7 


1 


324-362 


1546 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


4.6 


-5.4 


1 


341-372 


1547 


Ribosoma 
1 S5 


Ribosomal protein S5 


3.4e-09 


44.0 


1 


222-352 


1548 


IQ 


IQ calmodulin-binding motif 


le-24 


95.5 


5 


748- 

768:771- 

791:794- 

814:935- 

955:958- 

978 


1552 


DUF6 


Integral membrane protein DUF6 


0.14 


6.6 


1 


150-279 


1552 


SBF 


Sodium Bile acid symporter family 


9.2 


-75.2 


1 


143-321 


1553 


zf-C2H2 


Zinc finger, C2H2 type 


2.1e-05 


31.4 


3 


80- 

105:107- 
130:144- 
169 


1554 


F-box 


F-box domain 


7.7e-05 


29.5 


1 


4-52 


1555 


Ran BP1 


RanBPl domain 


l.le-88 


308.0 


1 


37-161 


1555 


WH1 


WHl domain 


6.8 


-26.8 


1 


45-159 


1556 


actin 


Actin 


8.4e- 
151 


514.4 


1 


1-372 


1557 


GTP EFT 
U 


Elongation factor Tu GTP binding 
doma 


yJ 


-7 J. O 


i 
i 


y i-jd j 


1557 


Defensin_ 
propep 


Defensin propeptide 


9.8 


-11.4 


1 


1-50 


1559 


GTP EFT 
U 


Elongation factor Tu GTP binding 
domain 


1.50-11 


51.8 


1 


125-348 


1559 


GTP EFT 
U D3 


Elongation factor Tu C-terminal 
domain 


8.Ie-07 


33.1 


1 


451-541 


1559 


GTP EFT 
U D2 


Elongation factor Tu domain 2 


le-06 


35.8 


1 


363-446 



345 



WO 02/070539 




PCT/US02/05095 



SEOID 


Model 


Description 


E-value 


Score 


Repeats 


Position 


1559 


ATP-bind 


Conserved hypothetical ATP binding 
protei 


8.5 


-132.9 


1 


126-312 


1559 


dynamin 


Dynamin family 


8.7 


-85.0 




110-278 i 


1567 


carbjtnhy 
drase 


Eukaryotic-type carbonic anhydrase 


4.6e- 
170 


578.4 




5-241 


1568 


An_perox 
idase 


Animal haem peroxidase 


8.1e- 
164 


557.6 . 


■y- 


144-683 


1568 


DUF37 


Domain of unknown function DUF37 


6.5 


-36.0 


1 


462-518 


1569 


DAO 


FAD dependent oxidoreductase 


0.055 


-90.8 




49-381 


1571 


CH 


Calponin homology (CH) domain 


1.4er25 


98.4 


1 


126-233 


1573 


NUDIX 


MutT-like domain 


5.5e-12 


53.3 




96-221 | 


1574 


HECT 


"HECT-domain (ubiquitin-transferase) 


4.3e-16 


66.9 


1 


281-573 


1575 


ig 


Immunoglobulin domain 


2.8 


11.5 




122-1 fl7 


1577 


7tm 1 


7 transmembrane receptor 


3.4 


-113.2 




42-246 


1577 


Bac_rhod 
opsin 


Bacteri orhodopsin 


9.6 


-139.9 




111-313 


1578 


fn3 


Fibronectin tvne III domain 

A IVJ VliVVVlU VTUV ill VJwllJ 141 J 1 


021 


11 2 




121-21 1 


1579 


Acetyltran 
sf 


Acetyl transferase (GNAT) family 


2.4e-14 


61.1 


_ 




1580 


Acetyltran 
sf 


Acetyitransferase (GNAT) family 


2.4e-14 


61.1 






1580 


Acetyltran 
sf 


Acetyltransferase (GNAT) family 


2.4e-14 


61 1 






1582 


GJDA 


Glucose inhibited division protein A 


0.0017 


-414.5 


i 


68-196 


1583 


efhand 


EF hand 


1.2 


It. 7 






1585 


Dynein h 
eavy 


Dynein heavy chain 


3.6e-18 


Vi. I 


_ 




1587 


Sm 


Sm protein 


2e-07 


38.1 




4"?- 124 


1588 


PDZ 


PDZ domain (Also known as DHR or 
GLGF) 


3.1e-15 


64.1 




3-83 


1590 


MAPI L 
C3 


Microtubule associated protein 1A/1B, 
light 


0.04 


-35.1 




99-187 


1591 


Syntaxin 


Syntaxin 


2.3e-09 


38.1 


_ 


1-266 


1591 


synaptobr 
evin 


Synaptobrevin 


5.8 


-14.5 


. 


1 R4-772 


1591 


DUE148 


Domain of unknown function DUF148 


7.7 


-38.1 




17-129 


1592 


laminin E 
GF 


Laminin EGF-like (Domains III and V) 


1,2 


-4.2 




153-196 


1592 


EGF 


EGF-like domain 


2.4 


10.9 


3 


140- 
177:284- 
313:351- 
380 


1592 


metalthio 


Metallothionein 


4.8 


-9.3 


1 


288-348 


1593 


DnaJ 


DnaJ domain 


3.4e-40 


146.9 


1 


3-69 


1594 


HMG_bo 

X 


HMG (high mobility group) box 


2.6e-27 


104.1 


1 


346-414 


1598 


HMGJx> 

X 


HMG (high mobility group) box 


3.4e-30 


113.8 


1 


45-113 


1600 


CUB 


CUB domain 


6.9 


-43.0 


1 


224-313 


1601 


DUF6 


Integral membrane protein DUF6 


l.le-12 


55.6 


2 


113- 

238:266- 
390 


1601 


sugarjr 


Sugar (and other) transporter 


5.7 


-191.4 


1 


55-405 


1601 


oxidored 
ql 


NADH-Ubiquinone/plastoquinone 
(complex I) 


6 


-167.3 


I 


131-389 


1602 

., 


FF 


FF domain 


2.3e-33 


124.3 


5 


272- 

321:339- 

388:406- 
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461:486- 
541:622- 
673 


1602 


WW 


WW domain 


4.3e-20 


80.2 


2 


oo 
85- 

11*7-100 

158 


1603 


FF 


FF domain 


2.3e-33 




D 


4. J J- 

284:302- 

J31.J07- 
Jlrr.JOJ" 

636 


1603 


WW 


WW domain 


l.le-09 


45.6 


1 


92-121 


1605 


AT hook 


AT hook motif 


1.9 


11.9 


1 


360-372 


1606 


aminotran 
5 


Aminotransferase class-V 


1.3e- 
127 


437.3 


1 


37-377 


1607 


aminotran 
5 


Aminotransferase class-V 


3.5e-94 


326.3 


1 


37-331 


1611 


Granin 


Granin (chromogranin or secretogranin) 


6.6 


-185.2 


1 


125-609 


1612 


PHD 


PHD-finger 


0.59 


-7.8 


1 


551-610 


1613 


Branch 


Core-2/I-Branching enzyme 


le-77 


271.6 


1 


46-313 


1614 


mbt 


mbt repeat 


3.2e- 
101 


349.7 


4 


78- 

153:192- 
265:304- 
381:412- 
486 


1614 


SAM_PN 
T 


Sterile alpha motif (SAM)/Pointed 
domain 


0.0021 


6.9 


1 


809-888 


1614 


SAM 


SAM domain (Sterile alpha motif) 


0.023 


20.6 


1 


822-885 


1615 


UPF0103 


Protein of unknown function DUF52 


4.7e-64 


226.2 


1 


n OTA 

y-270 


1616 


C2 


C2 domain 


6.8e-36 


132.7 


2 


ol/o- 


1617 


C2 


C2 domain 


7.8e-35 


129.2 


2 


87- 

176:236- 
323 


1618 


C2 


C2 domain 


U. 1 u 




i 
i 




1619 


7tm_l 


7 transmembrane receptor (rhodopsin 
family) 


3.9e-20 


80.3 


1 


217-427 


1620 


K tetra 


K+ channel tetramerisation domain 


Ue-25 


98.7 


1 


3-101 


1620 


BTB 


BTB/POZ domain 


y 




i 


01 1 f\A 
Z 1 - 1 Uf 


1623 


cyclin 


Cyclin, N-terminai domain 


0.057 


-1.4 


1 


46-149 


1624 


zf-C2H2 


Zinc finger, C2H2 type 

k 


o i a 


n- o 
O.O 


A 

4 


3/1 ^0«01 
340/. /I- 

01 »1 19 

134:143- 


1627 


PDZ 


PDZ domain (Also known as DHK or 

(jrlAjr) 


4.ze-io 




1 






MttA Hcf 
106 


mttA/Hcfl06 familv 


2.5 


-14.8 


1 


54-107 


1628 


Tropomyo 
sin 


Tropomyosin 


4 


-114.1 


1 


74-272 


1628 


Syntaxin 


Syntaxin 


5.7 


-103.5 


1 


82-402 


1628 


PI3J>I4_ 
kinase 


Phosphatidylinositol 3- and 4-kinase 


8.5 


-118.0 


1 


125-342 


1628 


HlyD 


HlyD family secretion protein 


9.4 


-64.2 


1 


129-400 


1628 


UPF0089 


Uncharacterised protein family 


10 


-132.7 


1 


150-338 


1629 


rrm 


RNA recognition motif. 


1.7e-47 


171.2 


2 


72- 
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142:156- 
226 


1632 


Ribosoma 
1 L10 


Ribosomal protein L10 


4.2 


-31.6 


1 


70-173 


1632 


Ribosoma 
I L10 


Ribosomal protein L10 


4.2 


-31.6 


1 


70-173 


1635 


death 


Death domain 


1.5 


2.0 


I 


261-34R 


1636 


death 


Death domain 


1.5 


2.0 


1 


73-160 


1637 


Gelsolin 


Gelsolin repeat 

s » 


4.1e-92 


319.4 


6 


27-76:148- 

188:265- 

307:398- 

451:523- 

564:626- 

668 


1639 


TBC 


TBC domain 


2e-08 


15.3 


1 


98-293 


1640 


TBC 


TBC domain 


2e-07 


1.3 


1 


98-297 


1641 


homeobox 


Homeobox domain 


0.0097 


7.0 


1 


83-135 


1646 


transmem 
brane4 


Tetraspanin family 


1.6e-75 


264.3 


I 


18-264 


1652 


LacY_sy 
mp 


LacY proton/sugar symporter 


4.9 


-335.5 


I 


66-299 


1652 


oxidored 

qi 


NADH-Ubiquinone/plastoquinone 
(complex I) 


7.6 


-169.3 


1 


53-279 


1653 


Colipase 
C 


Colipase, C-terminal domain 


4.3e-24 


93.5 


1 


21-65 


1654 


SH2 


SH2 domain 


0.0091 


3.5 


1 


1-83 


1658 


sun 


Translation initiation factor SUI1 


4.4e-46 


166.5 




50-149 


1661 


jmjC 


jmjC domain 


0.00052 


13.9 




308-415 


1663 


SH3 


SH3 domain 


0.011 


9.5 




332-388 


1664 


UBA 


UBA/TS-N domain 1 


6.7e»06 


33.0 




194-233 


1664 


TUDOR 


Tudor domain 


0.2 


-0.7 




506-627 


1665 


Ribosoma 
1 S21 


Ribosomal protein S21 


0.0039 


11.7 




10-62 


1666 


transmem 
brane4 


Tetraspanin family 


2.2e-71 


250.6 




23-264 


1669 


RuBisCO 
small 


Ribulose bisphosphate carboxylase, 
smal 


2.8 


-52.1 




354-430 


1671 


LRR 


Leucine Rich Repeat 


2e-50 


180.9 


12 


29-47:48- 
71 '17- 

95:96- • 

118:119- 

142:143- 

166:167- 

189:190- 

213:214- 

235:236- 

259:260- 

283:284- 

307 


1671 


7tm 1 


7 transmembrane receptor 


0.0032 


-43.2 




439-688 


1671 


Cytidylylt 
rans 


Phosphatidate cytidylyltransferase 


7.1 


-89.9 




520-617 


1671 


oxidored 
ql 


NADH-Ubiquinone/plastoquinone 


9.7 


-171.5 




475-685 


1671 


MerC 


MerC mercury resistance protein ! 


9.8 


-87.5 




534-632 


1672 


UCH-2 


Ubiquitin carboxyl-terminal hydrolase 
family 


4e-28 ! 


106.9 




292-364 


1672 


UCH-1 


Ubiquitin carboxyl-terminal hydrolases 


l.9e-14 


61.4 


1 


35-66 
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■famil 
lam 11 










1673 


F-box 


F-box domain 


0.00071 


26.3 




87-134 




Lamp 


jj/sosornc-associaceu iiicrnur diic 
glycoprotein 


S 7 


-191 1 
-171.1 




351-653 


lO/D 


r vjvjr i 


rvju i iamny 01 caruonyurdiv Kiiiaoco, 
N-termi 








20-282 


1675 


FGGY_C 


FGGY family of carbohydrate kinases, 
C-termi 


5.5e-23 


89.8 




285-491 


1676 


Keratin B 
2 


Keratin, high sulfur B2 protein 


7.4 


-81.9 


< 


24-201 


1678 


S_100 


S-100/ICaBP type calcium binding 
domain 


8.3 


-9.6 




909-941 


1680 


pyrjredox 


Pyridine nucleotide-disulphide 
oxidoreducta 


7.4e-24 


92.7 


1 


12-369 


1681 


WD40 


WD domain, G-beta repeat 


9.3 e- 14 


59.1 


6 


305- 
343:352- 

432:440- 
621 


1683 


Ribosoma 
1 L44 


Ribosoma! protein L44 


le-Jo 


l*fZ. i 


1 
1 


17 05 


1686 


WD4U 


WD domain, G-beta repeat 


o.ye-14 




4. 


o-*f j.JU-oO 


1688 


FliE 


Flagellar hook-basal body complex 
protein Fl 


5.1 


-27.5 


1 


673-763 


1690 


dCMP_cy 
t_deam 


Cytidine and deoxycytidyiate 
deaminase 


2e-13 


58.0 


1 


12-100 


1691 


G- gam ma 


GGL domain 


5.1 


-8.6 


1 


712-760 


1692 


cpn60_TC 
PI 


TCP-l/cpn60 chaperonin family 


0.012 


-260.5 


1 


32-187 


1693 


Glycopho 
rin A 


Glycophorin A 


4.4 


-43.1 


1 


16-149 


1696 


zf-C2H2 


Zinc finger, C2H2 type 


4.5e-15 


63.5 


6 


92- 

115:120- 

198:210- 
233:329- 
353:363- 

Ooxj 


1698 


LRR 


Leucine Rich Repeat 


0.44 


17.0 


4 


37-58:59- 
80:81- 
102:103- 
125 


1699 


VHS 


VHS domain 


9.5e-60 


211.9 


1 


5-146 


1700 


rrm 


RNA recognition motif. 


3.6e-23 


90.4 


3 


128- 

203:332- 
402:413- 
480 


1701 


ank 


Ankyrin repeat 


3.8e- 
101 


349.4 


12 


12-44:45- 

77:79- 

111:112- 

144:145- 

177:179- ! 

211:212- 

244:245- 

277:278- 

310:312- 
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344:345- 
374:375- 
407 


1701 


URO-D 


Uroporphyrinogen decarboxylase 
(URO-D) 


7.8 


-229.7 


1 


79-354 


1702 


pkinase 


Protein kinase domain 


0.00012 


-68.0 


1 


31-278 


1702 


HEAT 


HEAT repeat 


1.2 


15.6 


2 


342- 

380:499- 

537 


1705 


cyclin 


Cyclin, N-terminal domain 


3.8e-ll 


50.5 


1 


157-279 


1708 


WD40 


WD domain, G-beta repeat 


1.5e-ll 


51.8 


5. 


278- 

313:371- 

407:413- 

447:493- 

529:535- 

569 


1709 


SH2 


SH2 domain 


0.002 


10.0 


1 


287-364 


1710 


abhydrola 
se 


alpha/beta hydrolase fold 


2.2e-20 


81.2 


1 


124-355 


1710 


abhydrola 
se 2 


Phospholipase/Carboxylesterase 


1.2 


-84.9 


1 


179-359 


1711 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


0.002 


23.7 


1 


280-320 


1711 


PHD 


PHD-finger 


1.9 


-12.4 


1 




1712 


WD40 


WD domain, G-beta repeat 


3.2e-18 


74.0 


5 


67- ~* 
101:123- 
157:164- 
199:209- 
246:253- 
290 


1714 


Acyltransf 
erase 


Acyltransferase 


0.0011 


14.0 


1 


83-217 


1719 


helicase__ 
C 


Helicase conserved C-terminal domain 


1.2e-19 


78.7 


1 


397-475 


1719 


DEAD 


DEAD/DEAH box helicase 


4.6e-07 


10.4 


1 


2-210 


1719 


IPT 


Isopentenyl transferase 


7.9 


-160.7 


1 


17-172 _j 


1720 


Na_CaJE 

X 


Sodium/calcium exchanger protein 


8e-76 


265.3 


2 


109- 

249:471- 
616 


1720 


DUF205 


Domain of unknown function DUF 


5.3 


-112.7 


1 


85-253 


1720 


Fumarate 
red D 


Fumarate reductase subunit D 


8.4 


-49.5 


1 


518-620 


1721 


vwa 1 


von Willebrand factor type A domain 


9.6e-29 


108.9 


1 r 


34-205 


1721 


fo3 


Fibronectin type III domain 


3e-15 


64.1 


2 


212- 

287:332- 
413 


1724 


pkinase 


Protein kinase domain 


1.6e-06 


-41.3 


1 


40-359 


1725 | 


BRCT 


BRCA1 C Terminus (BRCT) domain 


le-15 


65.6 


3 


3-93:642- 
730:753- 
833 


1727 


BRCT 


BRCA1 C Terminus (BRCT) domain 


7.2 


1.2 


1 


45-130 


1728 


rrm 


RNA recognition motif. 


l.le-05 


32.4 


2 


545- 

612:880- 
942 


1728 


zf-CCCH 


Zinc finger C-x8-C-x5-C-x3-H tvpe 


0.012 


22.2 


1 


285-311 


1728 


PWI 


PWI domain 


0.047 


-0.6 


1 


6-78 


1730 


PX 


PX domain 


2.5e-31 


117.5 


1 


94-211 


1731 


PMP22_C 
laudin 


PMP-22/EMP/MP20/Claudin family 


1.5e-56 


201.3 


1 


1-157 
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1733 


homeobox 


Homeobox domain 




21 6 


1 


268-339 


1733 


Vacjrusi 
on 


Chordopoxvirus fusion protein 


5 9 


-21.5 


1 


286-364 


1733 


CUT 


CUT domain 


O.J 


-TR 0 




159-240 


1734 


homeobox 


Homeobox domain 


U.WUJ / 


91 6 




268-339 


1734 


Vac_Fusi 
on 


Chordopoxvirus fusion protein 


5 9 


-21.5 


1 


286-364 


1734 


/"«T XT* 

CUT 


cu l aomain 


R S 

O.J 


-38.0 


1 


159-240 


1738 


einand 


lir nana 


1 5e-16 


68.4 


2 


40-68:76- 
104 


1742 


trypsin 


Trypsin 


6.5 


-55.6 


1 


33-255 


1743 


Exonuclea 
se 


Exonuclease 




135.2 


1 


39-213 


1745 


Smr 


Smr domain 


0.0029 


13.0 


1 


1594-1672 


1745 


RyR 


RyR domain 


1.3 


-90 ft 


1 
1 




1745 


L27 


L27 domain 


J.O 


J.l 


1 
1 


1 S71-1696 


1746 


Smr 


Smr domain 


0.0029 


13.0 


1 


1611-1689 


1746 


RyR 


RyR domain 


1.5 




1 


XjX /"ItUj 


1746 


L27 


L27 domain 


3.6 


5.1 


1 




1746 


Smg4JJP 
F3 


Smg-4/UPF3 family 


9.7 


-74.5 


1 


1379-1518 


1747 


mito_carr 


Mitochondrial carrier protein 


1.3e-81 


284.5 


3 


1-90:98- 
9RR 


1748 


OSCP 


ATP synthase delta (OSCP) subunit 


8.4 


-94.4 


1 


67-213 


1749 


Oxysterol 
BP 


Oxysterol-binding protein 


"i A ~ no 

3.4e-7o 






/too 07^ 


1749 


PH 


PH domain 


o Oo rvo 
z.ze-uy 







S9-146 

JA" Jl*T\J 


1750 


lactamase 
B 


Metallo-beta-lactamase superfamily 


4.1 


-28.9 




1-153 


1751 


lipoxygen 
ase 


Lipoxygenase 


9.5e- 
133 


454.4 


1 


126-703 


1751 


PLAT 


PLAT/LH2 domain 


i .oe-j j 






2-1 16 


1753 


Rho_GDI 


RHO protein GDP dissociation 
inhibitor 


3.2e- 
1 1 j 


389.5 


1 




1-187 


1756 


GRAM 


GRAM domain 


7 ^p-17 

/ .JC" 1 / 


60 4 





110-177 


1757 


homeobox 


Homeobox domain 


j.oe-zy 


1 1 v.«? 




21-77 


1758 


hpocalm 


Lipocalin / cytosolic fatty-acid binding 
P r 




95 4 


-: 


38-168 


1759 


ig 


Immunoglobulin domain 


3.1e-13 


57.4 


2 


35- 

112:160- 
234 


1761 


oxidored 


IN PuJri m U Diquinone/piasioquinone 
(complex 1) 


U.U / o 


-128.7 


1 


16-259 


1761 


DUF250 


Domain 01 unxnown iuncuon, uufzju 


0 4S 


-85.1 


1 


162-337 


1761 


DUF6 


Integral membrane protein DUF6 


2.5 


-14.8 


1 


192-328 


1761 


secY 


eubacterial secY protein 


7 9 


-z*»o. u 


1 
1 


36-243 


1761 


FecCD 


FecCD transport family 


9.8 


-221.2 


1 


120-328 


1 7£9 
X /OZ 


jveraiin x> 
2 


KpraHn hiph sulfur ft 2 nrotein 


3.3e-07 


32.9 


2 


11- 

113:114- 
204 


1763 


sugar_tr 


Sugar (and other) transporter 


0.018 


-119.2 


1 


43-449 


1763 


oxidored 
ql 


NADH-Ubiquinone/plastoquinone 
(complex I) 


4.7 


-165.1 


1 


146-442 


1765 


VPS9 


Vacuolar sorting protein 9 (VPS9) 
domain 


4.5e-33 


123.3 


1 


380-484 


1765 


RA 


Ras association (RalGDS/AF-6) 
domain 


0.73 


-5.9 


1 


518-607 
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IriUQCl 


Description 


E-value 


Score 


Repeats 


Position 


1766 


oor 


boaium.solute symporter family 


5.9e- 
206 


697.6 




106-535 


1766 


VA1UU1 cu 

«i 


inauh- u piquinone/piastoquinone 
(complex I) 


4.8 


-165.2 


1 


216-521 


1766 


MfP 
iviir 


Major intrinsic protein 


5.8 


-121.9 




11W64 


1766 




Cytochrome C biogenesis protein 
transmemb 


8.6 


-97.3 


1 


195-425 


1 

4/0/ 


Peptidase 
M3 


. Peptidase family M3 , 


1.3e- 
203 


689.8 


■ 


251-701 


176R 

* /UO 


iulSUAJr 


GXPase-activator protein for Ras-like 


1.3e-26 


101.9 




449-621 


1768 


PH 


PH domain 


0.012 


22.2 




112-236 


1768 


C2 


C2 domain 


2.6 


-10.9 




249-332 


1768 


Tymo 45 
kd 70kd 


Tymovirus 45/70Kd protein 


4.2 


-293.9 




561-979 


1768 


RNA pol 


DNA-dependent RNA polymerase 


5.1 


-234.7 




381-1225 


I768 


PHD 


PHD-finger 


6.9 


-17.6 




214-273 


1770 


rrm 


RNA recognition motif. 


0.48 


5.6 


2 


238- 

323:352- 
422 


1772 


zf-C2H2 


Zinc finger, C2H2 type 


1.6e-87 


304.2 


13 


49-71:197- 

219:225- 

247:253- 

275:281- 

303:309- 

331:337-' 

359:365- 

387:393- 

415:421- 

443:449- 

471:477- 

499:505- 

527 


1 772 




rjujj zinc linger 


8.9 


-6.9 


1 


490-528 


1 773 


ATP 

synt C 


ATP synthase subunit C 


5.4e-08 


40.0 


1 


62-127 


1774 


zf-B box 


lj-uua zinc linger 


7.2e-14 


59.5 


1 


90-331 


1774 


zf-C3HC4 


^uio iingcr ? ^jni^t type (KiiNu imger) 


1.6e-10 


48.4 


1 


15-58 


1774 


PHD 




1.1 


-10.3 


1 


14-61 


1774 


zf-ANl 


/viNi-iijte £»uic linger 


1.4 


-3.2 




15-53 


1774 


zf-UBRl 


Putative zinc finger in N-recognin 


7.6 


-23.6 


J 


93-146 1 


1775 


FH2 


Formin Homology 2 Domain 


5.4e-50 


179.5 




21-455 


1775 


WH2 


WW9 mntif 
w nz. moiix 


0.0058 


23.3 




441-456 


1777 


PHD 


PHD-finger " 


8.7 


-18.6 


I 


493-522 


1779 


Branch 


L»orc-z/i«-i3rancning enzyme 


3.9e-05 


-52.8 


J 


175-516 


1780 


TFIIS 


Transcription factor S-II (TFIIS) 


3.9 


-3.7 




295-344 


1781 


qui 
on j 


ono aomain 


1.3e-13 


58.6 




454-508 


1 782 


UlCUllCJU 


Intermediate filament protein 


3.9 


-200.0 




134-406 


1783 


SET 


SET domain 


2e-53 


190.9 




3878-4006 


1783 


PHD 


PHD-fmger 


4.7e-28 






20-71:72- 
118:147- 
200 


1783 


HMGJjo 

X 


HMG (high mobility group) box 


0.0047 


12.3 


1 


710-773 


1783 


DC1 


DC1 domain 


8.5 


1.7 


3 


19-50:69- 
98:146-176 


1784 


zf-DHHC 


DHHC zinc finger domain 


2.4e-25 


97.7 


1 


90-154 


1785 


zf-C2H2 


Zinc finger, C2H2 type 


9.3e-14 


59.1 


4 


18-40:64- 
86:92- 
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E-value 
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Kepeais 


rOa ill on 














11 J. I/O- 

200 


*1788 


ion trans 


Ion transport protein 


o.^e-io 




i 
i 


4^R-fi79 


1788 


ank 


Ankyrin repeat 


1.9e-06 


34.8 


2 


n- 

108:163- 
195 


1788 


Srg 


Celegans Srg family integral 
membrane prot 


O 1 

8.1 




i 
l 


H 1 6"00? 


1791 


ubiquitin 


Ubiquitin family 


0.0036 


24. U 


i 




1792 


Gag plO 


Retroviral GAG plO protein 


4e-23 


90.2 


1 


1-89 


1798 


Ribosoma 
I S12 


Ribosomal protein S12 


0.003 


-14.2 


1 


7-66 


1799 


efhand 


EFhand 


l.le-07 


39.0 


3 


281- 
309:318- 
346:353- 
381 


1799 


Acyltransf 
erase 


Acyitransferase 


0.0001 


26.8 


1 


18-203 


1801 


zf-C2H2 


Zinc finger, C2H2 type 


3.2e-79 


276.6 


12 


160- 

182:188- 
210:216- 
238:244- 
266:272- 
294:300- 
322:355- 
377:431- 

AG/1 ./t CO 

481:487- 
509:515- 
537:543- 
565 


1801 


L1M 


L1M domain 


4.7 


-17.4 


1 


433-487 


1802 


lectin c 


Lectin C-type domain 


L4e-25 


98.4 


1 


143-2DU 


1802 


lectin c 


Lectin C-type domain 


L4e-25 


98.4 


1 


1 A1 OCA 

143-25U 


1804 


efhand 


EFhand 


2.5e-08 


41.1 


2 


1 £. A A ,C£ 

16-44:56- 
82 


1806 


LRR 


Leucine Rich Repeat 


1.5e-07 


38.5 


5 


iiv- 

151:152- 

1 /4- 

195:196- 

917-971- 


1807 


COX3 


— — 

Cytochrome c oxidase subunit III 


j.j 


-9^ 9 


i 
i 


51-219 


1807 


CbiM 


CbiM 


9.2 


-93.3 


1 


114-249 


1807 


oxidored_ 
q5_N 


NADH-ubiquinone oxidoreductase 
chain 4 


9.8 


-59.5 


1 


89-169 


1808 


Sulfotrans 
fer 


Sulfotransferase protein 


6.6 


-108.5 


1 


39-282 




lYlCvClllUpil 

OS 


Pflleineurin-like nho^nhoesterase 


0.23 


14.3 


1 


36-241 


1815 


rrrn 


RNA recognition motif. 


0.67 


4.1 


1 


201-269 


1816 


filament 


Intermediate filament protein 


1.5 


-188.9 


1 


1-253 


1817 


MAGE 


MAGE family 


4.2e-17 


70.3 


2 


14- 

613:655- 
867 


1817 


Atrophin- 
1 


Atrophin-1 family 


9.5 


-684.8 


1 


4-783 


1818 


Tropomod 


Tropomodulin 


8.1 


-168.0 


1 


362-653 
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Repeats 
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ulin 












1819 


Hydrolase 


haloacid dehalogenase-like hydrolase 


3.4e-21 


818 


1 


28-282 


1820 


Fe hyd 1 

gc 


Iron only hydrogenase large subunit, C- 
te 


1.3e-92 


321.1 


.1 


204-516 


1820 


Fe hyd S 
SU 


Iron hydrogenase small subunit 


8.3 


-13.1 


1 


529-576 


1821 


filament 


Intermediate filament protein 


le-143 


490.8 


1 


55-366 


1822 


KeratinJB 
2 


Keratin, high sulfur B2 protein 


0.082 


-51.3 


1 


32-174 


1824 


metalthio 


Metallothionein 


5.7e-17 


69.8 


1 


48-108 


1824 


AFP 


Insect antifreeze protein 


2.6 


14.5 


1 


61-72 


1825 


C2 


C2 domain 


4.7e-53 


189.7 


2 


373- 

462:528- 

617 


1825 


RPFDAj; 
ffector 


Rabphilin-3A effector domain 


0.035 


-127.1 


1 


1-237 


1825 


zf-MIZ 


MIZ zinc finger 


0.4 


-10.5 


1 


80-111 


1825 


PHD 


PHD-finger 


0.97 


-9.7 


1 


62-108 


1825 


LIM 


LIM domain 


2.2 


-14.6 


1 


80-128 


1825 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


3.1 


-4.0 


1 


63-105 


1828 


efhand 


EF hand 


8.4e-08 


39.4 


3 


106- 
134:142- 
170:407- 
435 


1829 


60s_ribos 
omal 


60s Acidic ribosomal protein 


7.4 


-38.2 


1 


163-291 


1831 


Tektin 


Tektin family 


1.6 


-227.3 


1 


1-150 


1833 


EGF 


EGF-like domain 


l.le-32 


122.0 


5 


26-54:57- 

85:92- 

124:156- 

187:194- 

226 


1833 


laminin_E 
GF 


Lamtnin EGF-like (Domains III and V) 


1.6 


-5.9 


1 


29-63 


1833 


Bowman- 
Birkjeg 


Bowman-Birk serine protease inhibitor 


4.8 


-16.7 


~j 


37-85 


1833 


metalthio 


Metallothionein 


5.5 


-10.1 




59-117 


1834 


7tmJ 


7 transmembrane receptor (rhodopsin 
family) 


4.8e-05 


-0.8 


-j 


31-273 


1834 


7tm_2 


7 transmembrane receptor (Secretin 
family) 


5.5 


-151.6 




9-248 


1835 


rrm 


RNA recognition motif. 


0.05 


15.7 


- 


75-141 


1835 


Met 10 


Met-10+ like-protein 


0.065 


-100.4 




333-583 


1835 


UPF0020 


Putative RNA methylase family 
UPF0020 


0.073 


-53.8 




410-566 


1835 


PCMT 


Protein-L-isoaspartate(D-aspartate) O- 


0.2 


-114.1 


J 


391-543 


1835 


Methyltra 
nsf 4 


Putative methyltransferase 


2.1 


-82.0 




424-566 


i C5<: 

XOJJ 


notch 


Notch (DSL) domain 


6.1 


-10.5 


1 


233-271 


1836 


Keich 


Kelch motif 


3e-33 


123.9 


4 


242- 

288:290- 

331:333- 

378:380- 

435 


1836 


BTB 


BTB/POZ domain 


2.3 


-15.2 


1 


1-129 


1837 


tepj 


Thrombospondin type 1 domain 


0.0014 


23.3 


5 


139- 

196:203- 

256:257- 
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313:320- 
370:374- 
432 


1837 


EB 


EB module 


6.4 


-10.5 


1 


330-399 


1838 


adh zinc 


Zinc-binding dehydrogenase 


7.3e-55 


195.7 


1 


44-369 


1838 


Lipase G 
DSL 


Lipase/Acylhydrolase with GDSL-Iike 
motif 


5.5 


-20.4 


1 


103-189 


1839 


pkinase 


Protein kinase domain 


2.4e-05 


-58.0 


1 


81-333 


1841 


zf-C2H2 


Zinc finger, C2H2 type 


4.4e-30 


113.4 


7 


141- 

163:169- 
191:197- 
219:225- 
247:350- 
372:378- 
400:406- 
434 


1841 


KRAB 


KRAB box 


5.1e-23 


89.9 


1 


44-84 


1841 


TFIIS 


Transcription factor S-II (TFIIS) 


2.4 


-1.9 


1 


144-179 


1841 


Lentiviral 
_Tat ■ 


Lentiviral Tat protein 


4.9 


-23.0 




35-125 


1841 


zf-C3HC4 


Zinc ringer, C3HC4 type (RING finger) 


7.2 


-7.1 


1 


352-411 


1841 


rubredoxi 
n 


Rubredoxin 


7.4 


-11.1 


1 


403-433 


1842 


E2FTDP 


Transcription factor E2F/dimerisation 
partne 


1.5 


-95.1 


1 


18-229 


1843 


V_ATPas 
e sub a 


V-type ATPase 1 !6kDa subunit family 


0 


1258.7 


1 


26-831 


1843 


UCR 14k 
D 


Ubiquinol-cytochrome C reductase 
compl 


3.2 


-40.4 


1 


6-112 


1843 


FliP 


FliP family 


4.8 


-148.9 


1 


533-661 


1843 


PsbH 


Photosystem II 10 kDa phosphoprotein 


7.1 


5.1 


1 


751-803 


1844 


aa_perme 
ases 


Amino acid permease 


5.6e-08 


-125.3 


1 


28-529 


1844 


Aa_trans 


Transmembrane amino acid transporter 
pro 


0.71 


-178.9 


i 


30-408 


1844 


BPD_tran 
sp_2 


Branched-chain amino acid transport 
syst 


3.3 


11/ A 

-116.9 




1 75-440 


1844 


MVIN 


Virulence factor MVIN 


3.5 


-240.9 


1 ... . 


87-515 


1844 


FecCD 


FecCD transport family 


5.3 


-216.0 




311-521 


1844 


ion trans 


Ion transport protein 


6.2 


-11.7 




288-495 


1844 


7tm 5 


7TM chemoreceptor 


7.1 


-168.6 


-j 


238-498 


1848 


sugar_tr 


Sugar (and other) transporter 


0.0072 


-107.9 




12-407 


1848 


xan_ur_pe 
rmease 


Permease family 


5.6 


-196.7 




38-359 


1848 


FecCD 


FecCD transport family 


6.3 


-217.4 




56-287 


1848 


Nucieosid 
e tran 


Nucleoside transporter 


8.1 


-162.4 


"* — | 


82-365 


1848 


PUCC 


PUCC protein 


9.8 


-283.2 




51-403 


1849 


Filamin 


Filamin/ABP280 repeat 


2.3e-20 


81.1 




396-494 


1849 


zf-BJ>ox 


B-box zinc finger 


9.3e-ll 


49.2 




130- 

176:186- 
227 


1849 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


0.0016 


24.4 




29-62 


1853 


ank 


Ankyrin repeat 


l.le-98 


341.3 


10 


485- 

517:518- 

550:551- 

583:584- 

616:617- 
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650:651- 
683:684- 
716:717- 
749:750- 
780:782- 
814 


1853 


pkinase 


Protein kinase domain 


L6e-50 


181.3 


1 


22-286 


1554 


ras 


Ras family 


1.4C-13 


17.8 


1 


5-194 


1S55 


Acyitransf 
erase 


Acyltransferase 


3e-34 


127.2 


1 


92-289 


1856 


wap 


WAP-type (Whey Acidic Protein) 
'four-disuifi 


0.0025 


18.7 


1 


31-72 


1857 


arf 


ADP-ribosylation factor family 


25e-ll 


24.6 


1 


7-193 


1857 


GTP EFT 
U 


Elongation factor Tu GTP binding 
domain 


1.6 


-82.5 


1 


19-198 


1858 


zf-C2H2 


Zinc finger, C2H2 type 


Zle-09 


44.7 


5 


82- 

104:110- 
132:138- 
161:364- 
386:392- 
416 


1858 


PHD 


PHD-fineer 


7.7 


-18.1 


1 


365-40A 


1859 


zf-C2H2 


Zinc finger, C2H2 type 


7.2e-20 


79.4 


4 


119- 

141:147- 
169:175- 
198:214- 
237 


1860 


BTB 


BTB/POZ domain 


1.7e-29 


111.4 


1 


22-126 


1860 


zf-C2H2 


Zinc finger, C2H2 type 


5.1e-l4 


60.0 


3 


373- 

395:401- 
423:429- 
452 


15ou 


K. tetra 


K+ channel tetramerisation domain 


0.44 


-30.2 


1 


34-124 


I SOU 


2t-r>isD 


BED zinc finger 


8.9 


-6.9 


1 


363-396 


1501 


bNF2 N 


SNF2 and others N-terminal domain 


2.1e-72 


254.0 


1 


137-473 


1501 


helicase 
C 


Helicase conserved C-terminal domain 


1.4e-24 


95.1 


1 


559-632 


1501 


DbAD 


DEAD/DEAH box helicase 


0.26 


-59.6 


1 


124-337 


1861 


SEC-C 


SEC-C motif 


0.43 


13.8 


1 


426-446 


150Z 


ras 


Ras family ; 


2.4e-69 


243.8 


1 


19-217 


1862 


arf 


ADP-ribosylation factor family 


4.1e-05 


-53.5 


1 


1-182 


1 or/I 
JO04 


rrm 


RNA recognition motif. 1 


0.053 


15.4 


1 


129-191 


150/ 


1-box 


T-box 


4,3e-09 


29.9 


1 


1-92 


15/U 


r-box 


F-box domain 


1.8 


15.0 


1 


124-171 


1871 


PH 


PH domain 


l.le-13 


58.9 


1 


170-273 


1872 


isodh 


Isocitrate/isopropylmalate 
dehydrogenase 


1.4e- 
123 


423.9 


1 


10-383 


1873 


ank 


Ankyrin repeat 


1.8e-08 


41.5 I 


2 


39-71:72- 
104 


1874 


auk 


Ankyrin repeat 


1.8e-08 


41.5 


2 


67-99:100- 
132 


1875 


PAP2 


PAP2 superfamily 


0.0034 


13.0 


1 


1-107 


1876 


DUF51 


Protein of unknown function DUF51 


5.7e-13 


52.5 


1 


107-261 


1877 


zf-C2H2 


Zinc finger, C2H2 type 1 


1.5e-72 


254.5 


9 


285- 

307:313- 

335:341- 

363:369- 

391:397- 
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447:451- 
473:479- 
501:507- 
529 


1877 


KRAB 


KRAB box 


2.7e-21 


84.2 


1 


49-89 


1877 


zf-BED 


BED zinc finger 


n 7 a 


9 7 
Z./ 


i 
i 


144-109 


1877 


TFIIS 


Transcription factor S-II (TFIIS) 


1.5 


-0.0 


1 


369-407 


1877 


GATA 


GATA zinc finger 


Z.I 


T 1 

-/.I 


l 


477 K1A 


1877 


LIM 


LIM domain 


2.5 


1 c 1 


i 


171 


1877 


E6 


Early Protein (E6) 


5.7 


-66.7 


1 


315-402 


1878 


AAA 


ATPase family associated with various 


1.3e-62 


OO 1 A 

221 A 


1 


Of7 /I AO ! 

Zl /-4Uy 


1878 


Sigma54_ 
activat 


Sigma-54 interaction domain 


4.6 


-165.4 


1 


184-365 


1879 


SSF 


Sodium :solute sym porter family 


4,8e- 
170 


578.3 j 


1 


58-460 


1880 


SSF 


Sodium:solute symporter femily 


1.3e- 
198 


673.2 


1 


58-487 


1881 


perilipin 


Perilipin family 


6.7e-53 


189.2 


1 


12-369 


1882 


aminotran 
1 2 


Aminotransferase class I and II 


7.5e-07 


25.0 


1 


165-515 


1883 


DSPc 


Dual specificity phosphatase, catalytic 
doma 


2.5e-30 


114.2 


1 


54-199 


1884 


ig 


Immunoglobulin domain 


7.2e~14 


59.5 


2 


34- 

111:155- 
229 


1885 


MCM 


MCM2/3/5 family 


4.6e-12 


-67.3 


1 


106-391 


1886 


Kelch 


Kelch motif 


1.8e-63 


224.3 


6 


262- 

306:309- 

JDO.jDo- 

49V497- 

t7J.*t7 / — 

544 


1886 




BTB 


DTD r&(^\ r 7 A r\ rr\ din 

d ir>/rvjz> aomain 




119 7 

117./ 




26-134 


1887 


UBA 


UBA/TS-N domain 


1.9e-05 


31.5 




726-765 


1887 


rrm 


KNA recognition motir. 


n aaa9£ 


97 R 




1549-1407 


1887 


MARCKS 


MARCKS family 


4.3 


-90.8 


i 


270-571 


1888 


UBA 


TTT) A ITO XT Jam aim 

UBA/TS-N domain 


1 Op AC 






796-765 


1888 


rrm 


RNA recognition motif. 


0.00026 


27.8 




1303-1368 


1888 


MARCKS 


MARCKS tamily 




on ft 


— 


970-^71 

Lit U-J / 1 


1889 


UBA 


UBA/TS-N domain 


l Oo nc 


ji.j 


— 


79 fi_7A c 


1889 


rrm 


RNA recognition motif. 


a aaaoa 


97 ft 
Z/.o 






1889 


MARCKS 


MARCKS family 


4.3 


-90.8 


1 


270-571 


1889 


Atrophin- 
1 


Atrophin-1 family 


6.9 


-O/O.l 




917 QRA 
ZJ /-700 


1890 


Na_HJEx 
changer 


Sodium/hydrogen exchanger family 


2.4e-18 


74.4 


1 


119-520 


i con 


X 


Qr»H iiim/r*o1 fit im PYrhnnopr nrofpin 

OUUIUUI/ wal^lUill CAlilJclllgvJ wlvlli 


3.1 


-43.6 


-j 


270-384 


1890 


7tm 5 


7TM chemoreceptor 


3.8 


-163.7 




259-503 


1890 


Abi 


CAAX amino terminal protease family 


5.9 


-25.8 




309-401 


1890 


oxidored 

qi 


NADH-Ubiquinone/plastoquinone 


7.9 


-169.7 




251-502 


1891 


Acyl- 

CoAJiydr 

o 


Cytosolic long-chain acyl-CoA thioeste 


8.9e-72 


251.9 


2 


26- 

168:200- 
336 


1891 


tRNA ant 


OB-fold nucleic acid binding domain 


3.5 


5.8 


1 


230-307 
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i 












1892 


zf-DHHC 


DHHC zinc finger domain 


8.6e-18 


72.5 


1 


145-207 


1894 


SPRY 


SPRY domain . 


4.5e-ll 


50.2 


1 


353-474 


1894 


fh3 


Fibronectin type HI domain 


0.054 


17.6 


1 


165-258 


1895 


dsrm 


Double-stranded RNA binding motif 


1.5e-12 


55.1 


2 


229- 

293:337- 

401 


1895 


WW 


WW domain 


0.00016 


28.5 


1 


20-49 


1896 


SPRY 


SPRY domain 


9.9e-26 


98.9 


1 


186-301 
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PDB annotation 


SIGNALING PROTEIN CALCIUM 
BINDING, SIGNALING DOMAIN, NPF 
BINDING, FW BINDING, 2 EF-HAND, 
EH DOMAIN, SIGNALING PROTEIN 


SIGNALING PROTEIN CALCIUM 
BINDING, SIGNALING DOMAIN, NPF 
BINDING, FW BINDING, 2 EF-HAND, 
EH DOMAIN, SIGNALING PROTEIN 




CALCIUM BINDING EH2, 
EPIDERMAL GROWTH FACTOR 
RECEPTOR SUBSTRATE CALCIUM 
BINDING, SIGNALING DOMAIN, NPF 
BINDING, EF-HAND, EH 2 DOMAIN 


CALCIUM BINDING EH2, 
EPIDERMAL GROWTH FACTOR 
RECEPTOR SUBSTRATE CALCIUM 
BINDING, SIGNALING DOMAIN, NPF 
BINDING, EF-HAND, EH 2 DOMAIN 






CALMODULIN, CALCIUM BINDING, 
HELIX-LOOP-HELIX, SIGNALLING, 2 
COMPLEX(CALCIUM-BINDING 
PROTEIN/PEPTIDE) 


Coumpound 


EPIDERMAL GROWTH FACTOR 
RECEPTOR PATHWAY CHAIN: 
A; 


EPIDERMAL GROWTH FACTOR 
RECEPTOR PATHWAY CHAIN: 
A; 


CALCIUM-BINDING PROTEIN 
CALMODULIN (VERTEBRATE) 
1CLL3 


EPS 15; CHAIN: NULL; 


EPS15;CHAIN: NULL; 


CALCIUM-BINDING PROTEIN 
RAT ONCOMODULIN IRRO 3 


CALCIUM BINDING PROTEIN 
CALMODULIN (/TR=2«C$ 
FRAGMENT COMPRISING 
RESIDUES 78 - 148 ITRC 3 OF 
THE INTACT MOLECULE) ITRC 
4 


CALMODULIN; CHAIN: A; 
RS20; CHAIN: B; 
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PDB annotation 


METAL BINDING PROTEIN 
CALCIUM BINDING PROTEIN 






CALCIUM BINDING PROTEIN 
CALCIUM BINDING PROTEIN, 
MUSCLE PROirEIN 


MUSCLE PROTCIN CTNC; CARDIAC, 
MUSCLE PROTEIN, REGULATORY, 
CALCIUM BINDING 


HYDROLASE CALCINEURIN; 
HYDROLASE, "PHOSPHATASE, 
IMMUNOSUPPRESSION 


MUSCLE PROTOIN MDE; MUSCLE 
PROTEIN 


CALCIUM BINDING CALCIUM 
BINDING 


SIGNALING PROTEIN CALCIUM 
BINDING, SIGNALING DOMAIN, NPF 
BINDING, FW BINDING, 2 EF-HAND, 
EH DOMAIN. SIGNALING PROTEIN 


SIGNALING PROTEIN CALCIUM 
BINDING, SIGNALING DOMAIN, NPF 
BINDING, FW BINDING, 2 EF-HAND, 
EH DOMAIN. SIGNALING PROTEIN 




STRUCTURAL PROTEIN HELIX- 
TURN-HELIX 


CALCIUM BINDING EH2, 
EPIDERMAL GROWTH FACTOR 


Coumpound 


PARVALBUMIN; CHAIN: A; 


BINDING PROTEIN 
SARCOPLASMIC CALCIUM- 
BINDING PROTEIN 2SCP 3 




PARVALBUMIN; CHAIN: A, B 


TROPONIN C; CHAIN: NULL; 


SERINE/THREONINE 
PHOSPHATASE 2B; CHAIN: A, 
B; 


MYOSIN; CHAIN: A, B, C, D, E, 
F, G, H; 


CALCIUM-BINDING PROTEIN; 
CHAIN: NULL; 


EPIDERMAL GROWTH FACTOR 
RECEPTOR PATHWAY CHAIN: 
A; 


EPIDERMAL GROWTH FACTOR 
RECEPTOR PATHWAY CHAIN: 
A; 


CALCIUM-BINDING PROTEIN 
CALMODULIN (VERTEBRATE) 
ICLL 3 


CARDIAC TROPONIN C; 
CHAIN: A; 


EPS 15; CHAIN: NULL; 


SEQFOL 
D score 
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RECEPTOR SUBSTRATE CALCIUM 
BINDING, SIGNALING DOMAIN, NPF 
BINDING, EF-HAND, EH 2 DOMAIN 


CALCIUM BINDING EH2, 
EPIDERMAL GROWTH FACTOR 
RECEPTOR SUBSTRATE CALCIUM 
BINDING, SIGNALING DOMAIN, NPF 
BINDING, EF-HAND, EH 2 DOMAIN 


GROWTH FACTOR EHi; EH 
DOMAIN, EPS 15, EF-HAND, 
SOLUTION STRUCTURE, SI 00 
PROTEIN 




< 

• 


CALMODULIN, CALCIUM BINDING, 
HELIX-LOOP-HELIX, SIGNALLING, 2 
COMPLEX(CALCIUM-BINDING 
PROTEIN/PEPTIDE) 


CALMODULIN, CALCIUM BINDING, 
HELIX-LOOP-HELIX, SIGNALLING, 2 
COMPLEX(CALCIUM-BINDING 
PROTEIN/PEPTIDE) 


METAL BINDING PROTEIN 
CALCIUM BINDING PROTEIN 






Coutnpound 




EPS15; CHAIN: NULL; 


EPIDERMAL GROWTH FACTOR 
RECEPTOR SUBSTRATE 
CHAIN: A; 


CALCIUM-BINDING PROTEIN 
RAT ONCOMODULIN IRRO 3 


CALCIUM BINDING PROTEIN 
CALMODULIN (/TR=2=C$ 
FRAGMENT COMPRISING 
RESIDUES 78 - 148 ITRC 3 OF 
THE INTACT MOLECULE) ITRC 
4 


CALMODULIN; CHAIN: A; 
RS20; CHAIN: B; 


CALMODULIN; CHAIN: A; 
RS20; CHAIN: B; 


PAR V ALBUMIN; CHAIN: A; 


BINDING PROTEIN 
SARCOPLASMIC CALCIUM- 
BINDING PROTEIN 2SCP 3 
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D score 
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PD B annotation 


COMPLEX 

(TRANSFERASiE/PEPTIDE) 
COMPLEX 

(TRANSFERASE/PEPTIDE) 


COMPLEX 

(TRANSFERASE/PEPTIDE) ITAM 
PEPTIDE; COMPLEX 
(TRANSFERASE/PEPTIDE), SYK, 


5 

j 

! 


1 COMPLEX (PROTO- 
ONCOGENE/EARLY PROTEIN) SRC 
HOMOLOGY 2 [DOMAIN; SH2 
DOMAIN, SIGNAL TRANSDUCTION, 
PEPTIDE COMPLEX, 2 COMPLEX 
(PROTO-ONCOGENE/EARLY 


> 


SH2 DOMAIN 

PHOSPHA1TDY1.INOSITOL 3-KINASE 
REGULATORY ALPHA SH2 


Coumpound 


C-SRC TYROSINE KINASE; 
CHAIN: A, B; ACE-FORMYL 
PHOSPHOTYR-GLU-(N,N- 
DIPENTYL AMINE); CHAIN: C, 
O; 


SYK KINASE; CHAIN: A, C, E, 
G, I, K; T-CELL SURFACE 
GLYCOPROTEIN CD3 EPSILON 
CHAIN; CHAIN: B, D, F, H, J, L; 


TRANSFERASE(PHOSPHOTRA 
NSFERASE) PROTO- 
ONCOGENE TYROSINE 
KINASE (E.C.2.7.1.1 12) 1AB2 3 
(SRC HOMOLOGY 2 DOMAIN) 
(ABELSON, SH2 ABL) 1 AB2 4 
(NMR, 20 STRUCTURES') 1AB2 5 


FYN PROTEIN-TYRO SINE 
KINASE; CHAIN: F; 
PHOSPHOTYROSYL PEPTIDE; 
CHAIN: P 


HYDROLASE(SH2 DOMAIN) 
TYROSINE PHOSPHATASE SYP 
(N-TERMINAL SH2 DOMAIN) 
1AYA 3 (PTPID, SHPTP2) 
(E.C.3. 1.3.48) COMPLEXED 
WITH THE PEPTIDE 1 AYA 4 
PDGFR-1009 1 AYA 5 


P85 ALPHA; CHAIN: NULL; 


SEQFOL, 
D score 
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PDB annotation 






COMPLEX 

(TRANSFERASE/PEPTIDE) 
COMPLEX 

(TRANSFERASE/PEPTIDE) 


COMPLEX 

(TRANSFERASE/PEPTIDE) ITA 
PEPTIDE; COMPLEX 
(TRANSFERASE/PEPTIDE), SY 
KINASE. SH2 DOMAIN. TTAM 




COMPLEX (PROTO- 
ONCOGENE/EARLY PROTEIN) 
HOMOLOGY 2 DOMAIN; SH2 
DOMAIN, SIGN AL TRANSDUC 
PEPTIDE COMPLEX, 2 COMPL] 
(PROTO-ONCOGENE/EARLY 
PROTEINS 


> 


Coumpound 


PRO-LEU-PRO- ASP-PRO-LYS) 
(NMR, MINIMIZED AVERAGE 
STRUCTURE) 2PLD 7 




C-SRC TYROSINE KINASE; 
CHAIN: A, B; ACE-FORMYL 
PHOSPHOTYR-GLU-(N,N- 
DIPENTYL AMINE); CHAIN: C, 
D; 


SYK KINASE; CHAIN: A, C, E, 
G, I, K; T-CELL SURFACE 
GLYCOPROTEIN CD3 EPSILON 
CHAIN; CHAIN: B, D, F, H, J, L; 


TRANSFERASE(PHOSPHOTRA 
NSFERASE) PROTO- 
ONCOGENE TYROSINE 
KINASE (E.C2.7.1.112) 1AB2 3 
(SRC HOMOLOGY 2 DOMAIN) 
(ABELSON, SH2 ABL) 1AB2 4 
(NMR, 20 STRUCTURES) 1AB2 5 


FYN PROTEIN-TYROSINE 
KINASE; CHAIN: F; 
PHOSPHOTYROS YL PEPTIDE; 
CHAIN: P 


HYDROLASE(SH2 DOMAIN) 
TYROSINE PHOSPHATASE SYP 
(N-TERMINAL SH2 DOMAIN) 
1 AYA 3 (PTPID, SHPTP2) 
(E.C.3. 1.3.48) COMPLEXED 
WITH THE PEPTIDE 1 AYA 4 
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PDB annotation 


SH2 DOMAIN 

PHOSPHATIDYLINOSITOL 3-KINASE 
REGULATORY ALPHA SH2 
DOMAIN, P85ALPHA, PI 3-KINASE, 
NMR, C TERMINAL SH2 2 DOMAIN 


COMPLEX 

(PHOSPHOTRANSFERASE/PEPTIDE) 
PROTEIN-TYROSINE KINASE SH2 
DOMAIN, COMPLEX 2 
(PHOSPHOTRANSFERASE/PEPTIDE) 


1 f 

1 : 

2 c 

z 

D 
- 

< f 
J f 

q ! 

a ! 

© 


j i 

^ S 3 c 

ills 

w Q w N 




Coumpound 


P85 ALPHA; CHAIN: NULL; 


SYK PROTEIN TYROSINE 
KINASE; CHAIN: A; ACETYL- 
THR-PTR-GLU-THR-LEU-NH2; 
CHAIN: B; 


SAP SH2 DOMAIN; CHAIN: A, B, 
CD: 


T CELL SIGNAL 
TRANSDUCTION MOLECULE 
SAP; CHAIN: A; SIGNALING 
LYMPHOCYTIC ACTIVATION 
MOLECULE; CHAIN: B; 


PHOSPHORIC DIESTER 
HYDROLASE PHOSPHOLIPASE 
C-GAMMA-1 (E.C.3.1 All) (C- 
TERMINAL SH2 2PLD 3 
DOMAIN COMPRISING 
RESIDUES 663-759) 
COMPLEXED WITH A 2PLD 4 
PHOSPHOPEPTIDE FROM THE 
PLATELET-DERIVED GROWTH 
FACTOR 2PLD 5 RECEPTOR 
(RESIDUES 1018 - 1029: ASP- 
ASN-ASP-PTYR-ILE-ILE- 2PLD 6 
PRO-LEU-PRO-ASP-PRO-LYS) 
OsTMR MINIMIZED AVERAGE 
STRUCTURE) 2PLD 7 


SEQFOL 
D score 
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PDI) annotation 


GUANINE NUCLEOTIDE EXCHANGE 
FACTOR RCCl; GUANINE 
NUCLEOTIDE EXCHANGE FACTOR, 
GEF, RAN, 2 RAS-LIKE NUCLEAR 
GTP BINDING PROTEIN HEADER 
TER 


GUANINE NUCLEOTIDE EXCHANGE 
FACTOR RCCl; GUANINE 
NUCLEOTIDE EXCHANGE FACTOR, 
GEF, RAN, 2 RA S-LIKE NUCLEAR 
GTP BINDING PROTEIN HEADER 
TER 


AMINOPEPTEDASE 
AMINOPEPTIDASE, PROLINE 
IMINOPEPTIDASE, SERINE 
PROTEASE, 2 XANTHOMONAS 


HYDROLASE TIUACYLGL YCEROL- 
HYDROLASE, X-RAY 
CRYSTALLOGRAPHY, 2 
PSEUDOMONADACEAE, 
OXYANION, CIS-PEPTTDE, 
HYDROT.ARP. 


HYDROLASE ALPHA/BETA 


LIPASE LIPASE; LIPASE, 
HYDROLASE, 

PSEUDOMONADACEAE, COVALENT 
INTERMEDIATE , 2 TRIGLYCERIDE 
ANALOGUE, EN ANTIOSELECTIVITY 


UBIQUITIN CONJUGATION UBC2; 
UBIQUITIN CONJUGATION. 


Coumpound 


REGULATOR OF 
CHROMOSOME 
CONDENSATION 1; CHAIN: A, 
B } C; 


REGULATOR OF 
CHROMOSOME 
CONDENSATION 1; CHAIN: A, 
B,C; 

... 


PROLINE IMINOPEPTIDASE; 
CHAIN: A, B; 


TR1ACYLGLYCEROL 
HYDROLASE; CHAIN: NULL; 


PALMITOYL PROTEIN 
THIOESTERASE 1: CHAIN: A: 


TRIACYL- GLYCEROL- 
HYDROLASE; CHAIN: D, E; 


UBIQUITIN-CONJUGATING 
ENZYME RAD6; CHAIN: A, B, C; 


SEQFOL 
D score 


150.38 
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PBB annotation 


UBIQUrilN-CUJNJ UUA 1 

ENZYME 


UBIQUITIN CONJUUAnUJN utsuz; 
UBIQUITIN CONJUGATION, 
UBIQUITIN-CONJUGATING 
ENZYME 
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UBIQUITTN CONJUGATION UBC2; 
UBIQUITIN CONJUGATION, 
UBIQUITIN-CONJUGATING 
ENZYME 


3 
B 

< < < 

555 

D O O < 
J U U ( 

PQ CQ CO % 


UBIQUITIN CONJUUA11UM UB^t; 
UBIQUITIN CONJUGATION, 
UBIQUITIN-CONJUGATING 
ENZYME 
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3 o j 


Coumpound 




UBIQUITIN-CONJUGATING 
ENZYME RAD6; CHAIN: A, B, C; 


UBIQUITIN-CONJUGATING 
ENZYME RAD6; CHAIN: A, B, C; 


UBIQUITIN CONJUGATING 
ENZYME; CHAIN: A; 


UBIQUITIN CONJUGATING 
ENZYME; CHAIN: A; 


UBIQUITIN CONJUGATING 
s ENZYME; CHAIN: NULL; 


UBIQUITIN CONJUGATING 
ENZYME; CHAIN: NULL; 


UBIQUITIN-CONJUGATING 
ENZYME RAD6; CHAIN: A, B, C; 


UBIQUITIN-CONJUGATING 
ENZYME RAD6; CHAIN: A, B, C; 


UBIQUITIN-CONJUGATING 
ENZYME RAD6; CHAIN: A, B, C; 


UBIQUITIN CONJUGATING 
ENZYME; CHAIN: A; 
UBIQUITIN CONJUGATING 


SEQFOL 
D score 




124.83 






70.02 




141.95 
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CONJUGATING ENZYME, YEAST 
UBIQUITIN CONJUGATION UBCl; 
UBIOUITIN CONJUGATION T If* ASP 


UBIQUITIN CONJUGATION UBCl; 
UBIQUITIN CONJUGATION, LIGASE 


MOTOR PROTIiIN MOTOR PROTEIN, 
ATPASE, MICROTUBULE 
ASSOCIATED 


MOTOR PROTIiIN MOTOR PROTEIN, 
ATPASE, MICROTUBULE 
ASSOCIATED 


CONTRACTILE PROTEIN NCD, 
CLARET SEGREGATIONAL 
PROTEIN; NCD CRYSTAL 
STRUCTURE, MICROTUBULE 
MOTORS, KINESIN 2 SUPERFAMILY, 

CONTR A PTTT P PR nTPTXI 


CONTRACTILE PROTEIN NCD, 
CLARET SEGRliGATIONAL 
PROTEIN; NCD CRYSTAL 
STRUCTURE, MICROTUBULE 
MOTORS, KINESIN 2 SUPERFAMILY, 
CONTRACTILE PROTRTTsf 


CONTRACTILE PROTEIN NCD, 
CLARET SEGREGATIONAL 
PROTEIN; NCD CRYSTAL 
STRUCTURE, MICROTUBULE 
MOTORS, KINESIN 2 SUPERFAMILY, 
CONTR A PTTT V. PR OTPTM 


ENDOCYTOSIS/EXOCYTOSIS 
NSECl ; PROTEIN-PROTEIN 
COMPLEX, MU1,TI-SUBUNIT 
TRANSCRIPTION INHIBITOR BETA- 


Coumpound 


O 

if ^ 5 

S8-3 

; g u 
III 


UBIQUITIN CONJUGATING 
ENZYME; CHAIN: NULL; 


KINESIN; CHAIN: NULL; 


KINESIN; CHAIN: NULL; 


MICROTUBULE MOTOR 
PROTEIN NCD; CHAIN: A, B, C, 
D; 


! MICROTUBULE MOTOR 
PROTEIN NCD; CHAIN: A, B, C, 
D; 


MICROTUBULE MOTOR 
PROTEIN NCD; CHAIN: A, B, C, 
D; 


SYNTAXIN BINDING PROTEIN 
1; CHAIN: A; SYNTAXIN 1A; 
CHAIN: B; 

TRANSCRIPTIONAL 


SEQFOL 
D score 




179.88 


207.35 






139.50 
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PDB annotation 


* 

s 

fx, 


COMPLEX <A-rir- 
BINDING/TRANSDUCER) BETAl, 
TRANSDUCIN BETA SUBUNIT; 
GAMMAl, TRANSDUCIN GAMMA 
SUBUNIT; COMPLEX (GTP- 
BINDING/TRANSDUCER), G 
PROTEIN, HETEROTRIMER 2 
SIGNAL TRANSDUCTION 


COMPLEX (Xj IP- | 
BINDING/TRANSDUCER) BETAl, 
TRANSDUCIN BETA SUBUNIT; 
GAMMAl, TRANSDUCIN GAMMA 
SUBUNIT; COMPLEX (GTP- 
BINDING/TRANSDUCER), G 
PROTEIN, HETEROTRIMER 2 
SIGNAL TRANSDUCTION 


COMPLEX (Uir- 
BINDING/TRANSDUCER) BETAl, 
TRANSDUCIN BETA SUBUNIT; 
GAMMAl, TRANSDUCIN GAMMA 
SUBUNIT; COMPLEX (GTP- 
BINDING/TRANSDUCER), G 
PROTEIN, HETEROTRIMER 2 
SIGNAL TRANSDUCTION 


1 t 

2 \ 
5 c 
* I 

2 1 

□ \ 

s : 
5?; 

o ta 

SB 
16 

So 


5 5 

3 « 

* f 

2 ! 

i ! 
Ig 


5 ► 

i | 

1 

s 

SB 
si 


n 
fi 


Coumpound 


REPRESSOR TUPl; CHAIN: A, 
B.C; 


GT-ALPHA/GI-ALPHA 
CHIMERA; CHAIN: A; GT- 
BETA; CHAIN: B; GT-GAMMA; 
CHAIN: G; 


GT-ALPHA/GI-ALPHA 
CHIMERA; CHAIN: A; GT- 
BETA; CHAIN: B; GT-GAMMA; 
CHAIN: G; 


GT-ALPHA/GI-ALPHA 
CHIMERA; CHAIN: A; GT- 
BETA; CHAIN: B; GT-GAMMA; 
CHAIN: G; 


KINESIN; CHAIN: A, B; 


KINESIN; CHAIN: A, B; 


KINESIN; CHAIN: A, B; 


KINESIN MOTOR NCD; CHAIN: 
A: 


SEQFOL 
D score 




115.67 
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PMF 
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PDB annotation 


NCD, CONTRACTILE PROTEIN 
CONTRACTILIi PROTEIN KINESIN, 
MICROTUBULE-BASED MOTOR, 
NCD. CONTRACTILE PH. OTRTNJ 


CONTRACTILE PROTEIN 
CONTRACTILE! PROTEIN, KAR3, 
KINESIN-RELA.TED PROTEIN, 
MOTOR 2 PROTEIN, ATPASE, P- 
LOOP, MICROTUBULE BINDING 
PROTEIN 


CONTRACTILE PROTEIN 
CONTRACTILE PROTEIN, KAR3, 
KINESIN-RELATED PROTEIN, 
MOTOR 2 PROTEIN, ATPASE, P- 
LOOP, MICROTUBULE BINDING 
PROTEIN 


CONTRACTILE PROTEIN 
CONTRACTILE PROTEIN, KAR3, 
KINESIN-RELATED PROTEIN, 
MOTOR 2 PROTEIN, ATPASE, P- 
LOOP, MICROTUBULE BINDING 
PROTEIN 


MOTOR PROTEIiN MOTOR PROTEIN, 
CYTOSKELETON 


MOTOR PROTEIN MOTOR PROTEIN, 
CYTOSKELETON 


GENE REGULATION POZ DOMAIN; 
PROTEIN-PROTEIN INFRACTION 
DOMAIN, TRANSCRIPTIONAL 2 
REPRESSOR, ZINC-FINGER 
PROTEIN, X-RAY 
CRYSTALLOGRAPHY, 3 PROTEIN 
STRUCTURE, PROMYELOCYTIC 


Coumpound 


KINESIN MOTOR NCD; CHAIN: 
A; 


KINESIN-LIKE PROTEIN KAR3; 




KINESIN-LIKE PROTEIN KAR3; 
CHAIN: NULL; 


KINESIN-LIKE PROTEIN KAR3; 
CHAIN: NULL; 


KINESIN HEAVY CHAIN; 
CHAIN: A, B, C, D; 


KINESIN HEAVY CHAIN; 
CHAIN: A, B, C s D; 


PROMYELOCYTIC LEUKEMIA 
ZINC FINGER PROTEIN PLZF; 
CHAIN: A; 


SEQFOL 
D score 


132.55 


165.99 
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PDJJ annotation 


MOTORS, KINliSIN 2 SUPERFAMILY, 
CONTRA CTTT.F. PP OTRrM 


CONTRACTILE PROTEIN NCD, 
CLARET SEGREGATIONAL 
PROTEIN; NCD CRYSTAL 
STRUCTURE, MICROTUBULE 
MOTORS, KINESIN 2 SUPERFAMILY, 
CONTR A CTTl F pp OTPFM 


CO , 

tea 

J Gog 

) O v 

*<vd 

co H 5 

! 85E 

P W < 


MEMBRANE PROTEIN FOUR HELIX 
BUNDLE. ALPHA HPT TV 


MOTOR PROTEIN MOTOR PROTEIN, 
CYTOSKELETOM i 


MOTOR PROTEIN MOTOR PROTEIN, 
CYTOSKELETOM 


MOTOR PROTEIN MOTOR PROTEIN, 


MOTOR PROTEKN MOTOR PROTEIN, 
CYTOSKELRTOW 


MOTOR PROTEIN MOTOR PROTEIN, 
C YTOSKET .RTfUiSl 


s" 

1 
g 

o 

s 

O PL 


CONTRACTILE PROTEIN KINESIN, 
MICROTUBULE-BASED MOTOR, 


I 

i|3I 

rlgg 


CONTRACTILE PROTEIN 
CONTRACTILE PROTEIN, KAR3, 
KINESIN-RELATED PROTEIN. 


Coumpound 




MICROTUBULE MOTOR 
PROTEIN NCD; CHAIN: A, B, C, 
D; 


SYNTAXIN BINDING PROTEIN 
1; CHAIN: A; SYNTAXIN 1A; 
CHAIN: B: 


SSOl PROTEIN; CHAIN: A; 


KINESIN; CHAIN: A, B; 


KINESIN; CHAIN: A, B; 


KINESIN; CHAIN: A, B; 


KINESIN; CHAIN: A, B; 


KINESIN; CHAIN: A, B; 


KINESIN; CHAIN: A, B; 


KINESIN MOTOR NCD; CHAIN: 
A; 


KINESIN MOTOR NCD; CHAIN: 
A; 


KINESIN-LIKE PROTEIN KAR3; 
CHAIN: NULL; 


SEQFOL 
D score 










151.15 


j 151.15 










157.70 
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CONTRACTILE PROTEIN, KAR3, 
KJNE SIN-RELATED PROTEIN, 
MOTOR 2 PROTEIN, ATPASE, P- 
LOOP, MICROTUBULE BINDING 
PROTEIN 


\ I 

5. E 
* § 

1 1 

3 C 
S I 

2 o , 


* 

3 

3' 

-« 
:> 
8 

SB 

2 o 
S o 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN IIA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHIBITORS) 


COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN IIA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 


g * a s 

s ~ *i 2 
5 G ca 

5 < £S a 

30-59 

7) tVD OQ W 


Coumpoimd 




z 

[ 

i 

PL, 

S3 


KINESIN HEAVY CHAIN; 
CHAIN: A, B a CD; 


KINESIN HEAVY CHAIN; 
CHAIN: A, B, C, D; 


ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO-MAI; 
CHAIN: P; 


ACTIVATED PROTEIN C; 
CHAIN: C L; D-PHE-PRO-MAI; 
CHAIN: P; 


MEROZOITE SURFACE 
PROTEIN 1; CHAIN: A; 


SEQFOL 
D score 
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PDIB annotation 


PROTEIN, SUF FACE 2 ANTIGEN, 
MALARIA VACCINE COMPONENT, 


GLYCOPROTEIN MEMBRANE 
COFACTOR PRGTEIN (MCP); VIRUS 
RECEPTOR, COMPLEMENT 
COFACTOR, SHORT CONSENSUS 
REPEAT, 2 SCR, MEASLES VIRUS, 


APOPTOSIS TR AIL; DR5; LIGAND- 
RECEPTOR COMPLEX, TRIMERIC 
JELLY-ROLL, TNF-R 2 

ST IPPRT?AlVyfTT "V A DADTnciP 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


HYDROLASE/HYDROLASE 
INHIBITOR PROTEIN-PEPTIDE 
COMPLEX 


Coumpound 




CD46; CHAIN: A, B, C, D, E, F; 


TNF-RELATED APOPTOSIS 
INDUCING LIGAND; CHAIN: B; 
DEATH RECEPTOR 5; CHAIN: 
A; 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN: C: 


BLOOD COAGULATION 
FACTOR Vn A; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN: C: 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN: C: 


DES-GLA FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H, I; 
DES-GLA FACTOR VIIA (LIGHT 


SEQFOJL 
D score 
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PDB annotation 


HYDROLASE/HYDROLASE 
INHIBITOR PROTEIN-PEPTIDE 
COMPLEX 


m 

H ^ CO 

S S co < f 
| 5 S S { 

s £ g u i 

j a 2 > i 


MA 1 K1A rKAJl r,ux 

EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNAL, MULTIGENE FAMILY, 
DISEASE MUTATION, 3 EGF-LIKE 
DOMAIN, HUMAN FIBRILLIN- 1 
FRAGMENT, MATRIX PROTEIN 


EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNAL, MULTIGENE FAMILY, 
DISEASE MUTATION, 3 EGF-LIKE 
DOMAIN, HUMAN FEBRILLIN-l 
FRAGMENT. MATRIX PROTEIN 


PROTEIN, CYTOKINE, SIGNALLING 
PROTEIN 


BLOOD COAGULATION, EGF-LIKE | 


■ 

x» 

p 
o 
a 

B 

$ 


CHAIN); CHAIN: L, M; (DFN> 
PHE-ARG; CHAIN: C,D; 
PEPTIDE E-76: CHAIN: X, Y; 


DES-GLA FACTOR VUA 
(HEAVY CHAIN); CHAIN: H, I; 
DES-GLA FACTOR VIIA (LIGHT 
CHAIN); CHAIN: L, M; (DPN)- 
PHE-ARG; CHAIN: C, D; 
PEPTIDE E-76: CHAIN: X, Y; 


COMPLEMENT CONTROL 
PROTEIN; CHAIN: A; 


FIBRILLIN; CHAIN: NULL; 


FIBRILLIN; CHAIN: NULL; 


TUMOR NECROSIS FACTOR 
RECEPTOR; CHAIN: A, B; 


BLOOD COAGULATION 
FACTOR VII; CHAIN: A; 


SEQFOL 
D score 
















PMF 
score 
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DOMAIN, BLOOD 2 CLOTTING 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HE MOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 
BINDING, HYDROLASE, 3 
OT .YrnpR OTFXAJ 


SERINE PROTEASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 
PROTEASE 


COMPLEMENT ;tNHIBITOR SP35, 
VCP, VACCINIA, VIRUS SP35; 
COMPLEMENT XNHIBITOR, 
COMPLEMENT MODULE, SCR, 


* S W 5 

; 53 o 

3 g g 1 ^ 

1 ^ S < 


BLOOD COAGUI^TION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH FACTOR LIKE DOMAIN 


Coumpound 


GLYCOPROTEIN FACTOR H, 
15TH AND 16TH C-MODULE 
PAIR (NMR, MINIMIZED IHFHA 
1 AVERAGED STRUCTURE) 
IHFH 4 IHFHA 5 


FACTOR IXA; CHAIN: C, L,; D- 
PHE-PRO-ARG; CHAIN: I; 


COAGULATION FACTOR VUA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VUA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


VACCINIA VIRUS 
COMPLEMENT CONTROL 
PROTEIN; CHAIN: NULL; 


COAGULATION FACTOR X; 
CHAIN: NULL; 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 


It. O 
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PDB annotation 


TRANSCRIPTION REGULATION 
PROTO-ONCOGENE, NUCLEAR 
BODIES (PODS), LEUKEMIA, 2 
TRANSCRIPTION RRGT IT .ATTrVM 


i 

i 

1 




LIGASE CBL, UBCH7, ZAP-70, E2, 
UBIQUITIN, E3, 

PHOSPHORYLATION, 2 TYROSINE 
KINASE, UBIQIHTINATION, 
PROTEIN DEGRADATION, 


LIGASE CBL, UBCH7, ZAP-70, E2, 
UBIQUITIN, E3, 

PHOSPHORYLATION, 2 TYROSINE 
KINASE, UBIQLHTINATION, 
PROTEIN DEGRADATION, 


ZINC-BINDING PROTEIN ZINC- 
BINDING PROTEIN, XNF7, BBOX, 
DEVELOPMENT, 3 MID-BLASTULA- 
TRANSinON 


METAL BINDING PROTEIN RING 
FINGER PROTEIN MATl; RING 
FINGER (C3HC4) 
DNA-BINDING PROTEIN VfDU 


Coumpound 

RNA POLYMERASE: CHAIN: E: 


TRANSCRIPTION FACTOR 
PML; CHAIN: NULL; 


VIRUS EQUINE HERPES VIRUS- 
I (C3HC4, OR RING DOMAIN) 
ICHC 3 (NMR, 1 STRUCTURE) 
ICHC 4 


VIRUS EQUINE HERPES VIRUS- 
I (C3HC4, OR RING DOMAIN) 
ICHC 3 (NMR, 1 STRUCTURE) 
ICHC 4 


SIGNAL TRANSDUCTION 
PROTEIN CBL; CHAIN: A; ZAP- 
70 PEPTIDE; CHAIN: B; 
UBIQUITIN-CONJUGATING 
ENZYME E12-18 KDA UBCH7; 
CHAIN: C; 


SIGNAL TRANSDUCTION 
PROTEIN CBL; CHAIN: A; ZAP- 
70 PEPTIDE; CHAIN: B; 
UBIQUITIN-CONJUGATING 
ENZYME E12-18 KDA UBCH7; 
CHAIN: C; 


NUCLEAR FACTOR XNF7; 
CHAIN: NULL; 


CDK -ACTIVATING KINASE 
ASSEMBLY FACTOR MATl; 
CHAIN: A; 

RAGI; CHAIN: NULL; 


SEQFOL 
D score 
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PROTEIN 1;RAG1,V(D)J 
RECOMBINATION, ANTIBODY, 
MAD, RING FINGER, 2 ZINC 
BINUCLEAR CLUSTER, ZINC 
FINGER, DNA-BINDING PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) SREBP-1A; 
STEROL REGULATORY ELEMENT 
BINDING PROTEIN, 2 BASIC-HELIX- 
LOOP-HELIX-LEUCINE ZIPPER, 
SREBP, TRANSCRIPTION 3 FACTOR, 
COMPLEX (TRANSCRIPTION 
REGULATION/DNA) 


►J CM 

5 Z O" § •< g C 

^IlllSi e 


2 sly i 

iiggg f 

d < 2 g p s 


FACTOR MAX/DNA) 
TRANSCRIPTIONAL REGULATION, 
DNA BINDING, COMPLEX 2 
(TRANSCRIPTION FACTOR 
MAX/DNA) 


Coumpound 




STEROL REGULATORY 
ELEMENT BINDING PROTEIN 
1 A; CHAIN: A, B, C, D; DNA; 
CHAIN: E, F, G, H; 


J 
< 

1 - 

2| 
3S 

| i 


TRANSCRIPTION FACTOR 
MAX; CHAIN: A, B; DNA (5'- 
D(*CP*AP*CP*CP*AP*CP*GP*T 
P*GP*GP*T>3', CHAIN: C, D; 


H 
* 

IP* 
88|| 
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PDJ J annotation 


TRANSPORT PROTEIN TC4; GTPASE, 
NUCLEAR TRANSPORT, 
TRANSPORT PROTRTM 


TRANSPORT PROTEIN TC4; GTPASE, 
NUCLEAR TRANSPORT, 
TRANSPORT PROTRTM 


SIGNALING PR OTEIN GTP-BINDING 
PROTEIN RHOA, GTPASE RHOA; 
RHO GDI 1; RHO GTPASE, G- 
PROTEIN STGMAT JTSin PROTP1M 


!g c 
S i 

'SIS 
•sis 

III 

CO ft< < 


ENDOCYTOSIS/EXOCYTOSIS G- 
PROTEIN, GTPASE, RAB6, 
VKSTH n AP TP AmrvixTn 


1 SIGNALING PROTEIN P21-RAC2; 
RHO GDI 2, RHO-GDI BETA, LY-GDI; 
BETA SANDWHICH, PROTEIN- 
PROTEIN COMPLEX, G-DOMAIN, 2 
IMMUNOGLOBULIN FOLD, 
WALKER FOLD, GTP-BINDING 
PROTETN 


„ 8 

IIP 

§ w g 

CO ffl ^ 


GTP-BINDING GTP-BINDING, 
GTPASE, SMALL G-PROTEIN, RHO 
FAMILY RAS SI IPF.R 0 FAM11.V 


GTP-BINDING GTP-BINDING, 
GTPASE, SMALL G-PROTEIN, RHO 
FAMILY, RAS SUPER 2 FAMILY 
COMPLEX CSMALL 


Coumpound 


GTP-BINDING PROTEIN RAN; 
CHAIN: A, B; 


GTP-BINDING PROTEIN RAN; 
CHAIN: A, B; 


TRANSFORMING PROTEIN 
RHOA; CHAIN: A, C; RHO GDP 
DISSOCIATION INHIBITOR 
ALPHA; CHAIN: E, F: 


HIS-TAGGED TRANSFORMING 
PROTEIN RHOA(0- 181); CHAIN: 
A; PKN; CHAIN: B: 


RAB6 GTPASE; CHAIN: A; 


1 RAS-RELATED C3 BOTULINUM 
TOXIN SUBSTRATE 2; CHAIN: 
A; RHO GDP-DISSOCIATION 
INHIBITOR 2; CHAIN: B; 


RAN; CHAIN: A, C; IMPORTTN 
BETA SUBUNIT; CHAIN: B, D; 


RACl; CHAIN: NULL; 


RACl; CHAIN: NULL; 

RAN; CHAIN: A, C; NUCLEAR 


SEQFOL 
D score 
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PDB annotation 


cm LaO p 

2 III ill 


ll 

>! 

1 

5 to 


INSECT IMMUNITY INSECT 
IMMUNITY, LPS-BINDING, 
HOMOPHILIC ADHESION 


3 

1 

2 c 

"N C 

8 

1 1 
5 a 

< CO 


UKUW inr /\u i vjiv vji\w yv in 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROW I tt fAt 1 uivvmu win 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 
CELL ADHESION NCAM: NCAM, 


Coumpound 


II 
1 

1 

1 


TITIN; CHAIN: A, r>; 


TITIN; CHAIN: A, B; 


HEMOLIN; CHAIN: A, B; 


AXONIN-1; CHAIN: A; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C,D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C,D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 

NEURAL CELL ADHESION 


SEQFOL 
D score 
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PD]) annotation 


i 

s 

of 

P 


CELL ADHESION NCAM; NCAM, 
IMMUNOGLOBULIN FOLD, 
GLYCOPR OTRVN 


IMMUNE SYSTEM FC-EPSILON RI- 
ALPHA; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 


IMMUNE SYSTEM FC-EPSILON RI- 
ALPHA; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, 1GE- 
BINDING 1 PROTFTM 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FC(EPSILON) 
IGE-FC; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 
BINDING 2 PROTEIN, IGE 
ANTIBODY. TOP.-Fr 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FC(EPSILON) 
IGE-FC; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 
BINDING 2 PROTEIN, IGE 

ANTIBODY TnWJJr? 


IMMUNE SYSTEM, MEMBRANE 
PROTEIN CD32; FC RECEPTOR, 
IMMUNOGLOULIN, LEUKOCYTE, 

CD32 


IMMUNE SYSTI-M, MEMBRANE 
PROTEIN CD32; FC RECEPTOR, 
IMMUNOGLOULIN, LEUKOCYTE, 
CD32 

IMMUNE SYSTEM RECEPTOR BETA 




MOLECULE; CHAIN: A, B, C, D; 


NEURAL CELL ADHESION 
MOLECULE; CHAIN: A, B, C, D; 






z 








Coumpound 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILC 
RECEPTOR CHAIN: A; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSIL( 
RECEPTOR CHAIN: A; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILC 
RECEPTOR CHAIN: A; IG 
EPSILON CHAIN C REGION; 
CHAIN: B, D; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILC 
RECEPTOR CHAIN: A; IG 
EPSILON CHAIN C REGION- 
CHAIN: B, D; 


FC RECEPTOR 
FC(GAMMA)RHA; CHAIN: A 


FC RECEPTOR 
FC(GAMMA)RIIA; CHAIN: A: 

LOW AFFINITY 


SEQFOL 
D score 
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PDB annotation 


_ 1 

3 c 

1 

2 Pfii 

SI 


< 

Q 3 

a| 

-§S 
*> 1-1 C : 

"So 

g?3' 

1 ^ S 

H W J 


»■ 

ill 

- O J 

if a 

2e 1 
l§ 


11 |8i i 


INHIBITUK Y KHUfcf 1 UK JSJi^civ 
CELL INHIBITORY RECEPTOR; 
INHIBITORY RECEPTOR, NATURAL 
KILLER CELLS, IMMUNOLOGICAL 2 
RECEPTORS, IMMUNOGLOBULIN 
FOLD 


CELL INHIBITORY RECEPTOR; 
INHIBITORY RECEPTOR, NATURAL 
KILLER CELLS, IMMUNOLOGICAL 2 
RECEPTORS, IMMUNOGLOBULIN 
FOLD 


INHIBITORY KbUbriUK.AJw-Edv 
CELL INHIBITORY RECEPTOR; 
INHIBITORY RECEPTOR, NATURAL 
KILLER CELLS, IMMUNOLOGICAL 2 
RECEPTORS, IMMUNOGLOBULIN 
FOLD 


INHIBITORY RECEPTOR KlULbK 
CELL INHIBITORY RECEPTOR; 


Coumpound 


IMMUNOGLOBULIN GAMMA 
FC REGION CHAIN: A; 


LOW AFFINITY 
IMMUNOGLOBULIN GAMMA 
FC REGION CHAIN: A; 


TWITCHIN; CHAIN: NULL; 


TITIN; CHAIN: NULL; 


P58-CL42 KIR; CHAIN: NULL; 


P58-CL42 KIR; CHAIN: NULL; 


P58-CL42 KIR; CHAIN: NULL; 


P58-CL42 KIR; CHAIN: NULL; 


SEQFOL 
D score 
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INHIBITORY RECEPTOR, NATURAL 
KILLER CELLS , IMMUNOLOGICAL 2 
RECEPTORS, IMMUNOGLOBULIN 
FOLD 




IMMUNOGLOB ULIN ANTI- 
NITROPHENOL , LAMBDA LIGHT 
CHAIN TMMT IMnrtT ORT n txt 


IMMUNE SYSTEM P58 NATURAL 
KILLER CELL RECEPTOR; KIR, 
NATURAL KILLER RECEPTOR, 
INHIBITORY RECEPTOR, 2 
IMMUNOGT .OR f IT iw 


IMMUNE SYSTEM P58 NATURAL 
KILLER CELL RECEPTOR; KIR, 
NATURAL KILLER RECEPTOR, 
INHIBITORY RECEPTOR, 2 


t 
8 

8s 

CO if 

ill 

) 00 P 
> W <N 

Js 

2 a 


IMMUNE SYSTEM CD32; RECEPTOR, 
FC, CD32, IMMUNE SYSTEM 


HYDROLASE C2 DOMAIN, 
PHOSPHOTTOYLINOSITOL, 
PHOSPHOTASR HYDROLASE 


HYDROLASE C2 DOMAIN, 
PHOSPHOTTOYI.INOSITOL, 
PHOSPHOTASE, HYDROLASE 


Coumpound 




MUSCLE PROTEIN TITIN 
MODULE M5 (CONNECTIN) 
1 TNM 3 (NMR, MINIMIZED 
AVERAGE STRUCTURE) ITNM 
4 ITNM 58 


FAB FRAGMENT; CHAIN: 
NULL; 


MHC CLASS I NK CELL 
RECEPTOR PRECURSOR; 
CHAIN: A; 


MHC CLASS I NK CELL 
RECEPTOR PRECURSOR; 
CHAIN: A; 


FC GAMMA RIIB; CHAIN: A; 


FC GAMMA RIIB; CHAIN: A; 


PHOSPHOINOSITIDE 
PHOSPHOTASE PTEN; CHAIN: 
A; 


PHOSPHOINOSITIDE 
PHOSPHOTASE PTEN; CHAIN: 
A; 


SEQFOL 
D score 
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HYDROLASE C2 DOMAIN, 
PHOSPHOTIDYLINOSITOL, 
PHOSPHOTASE, HYDROLASE 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


STRUCTURAL PKUlblN 1 WU 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


STRUCTURAL PROTEIN 1 WU 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


ENDOCYTOSIS/EXOCYl 
NSECl; PROTEIN-PROTEIN 
COMPLEX MULTI-SUBUNIT 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) TUS; TER; DNA- 
BINDING, DNA REPLICATION, 2 
COMPLEX (DNA-BINDING 
PROTEIN/DNA) 


i 

§ ! 

aS 

§2 
&- si 

|3 
l| 

1 PQ 


MEMBRANE PROTEIN hUUK rlbLiA 
BUNDLE, ALPHA HELIX 
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I . 
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D ~ 
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38 

3 X 

§3 
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Coumponnd 


PHOSPHOINOSITIDE 
PHOSPHOTASE PTEN; CHAIN: 
A; 


ALPHA SPECTRIN; CHAIN: A, 
B,C; 


ALPHA SPECTRIN; CHAIN: A, 
B,C; 

' 


ALPHA SPECTRIN; CHAIN: A, 
B, C; 


SYNTAXIN BINDING PROTEIN 
1; CHAIN: A; SYNTAXIN 1A; 
CHAIN: B; 


REPLICATION TERMINATOR 
PROTEIN; CHAIN: A; DNA 
16MER DUPLEX REPLICATION 
TERMINATOR; CHAIN: B, C; 


SSOl PROTEIN; CHAIN: A; 


SSOl PROTEIN; CHAIN: A; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


SEQFOL 
D score 
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B. 


CONTRACTILE PROTEIN 1 










TRANSMEMBRANE PROTEIN 
COLICIN, BAC1ERIOCIN, ION 
CHANNEL FORMATION, 
TRANSMEMBRANE 2 PROTEIN 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTPTM 


STRUCTURAL P ROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELXX COILED-COILS, 
STRUCTURAL PROTEIN 
STRUCTURAL PROTEIN TWO 


Coumpound 






DEFENSIN DEFENSIN /HNPS-3 
IDFN 3 • 


DEFENSIN DEFENSIN /HNP$-3 
1DFN3 


DEFENSIN DEFENSIN /HNPS-3 
1DFN3 


COLICIN IA; CHAIN: NULL; 


ALPHA SPECTRIN; CHAIN: A, 
B, C; 


ALPHA SPECTRIN; CHAIN: A, 
B, C; 


ALPHA SPECTRIN; CHAIN: A, 
B, C; 


ALPHA SPECTRIN; CHAIN: A, 
B, C; 


ALPHA SPECTRIN; CHAIN: A, 


SEQFOL 
D score 






59.97 






148.75 










PMF 
score 








1.00 


1.00 




-0.09 


0.17 


0.04 


-0.02 


0.30 


Verify 
score 








CO 

d 


-0.35 




0.12 


0.06 


0.11 


0.44 


-0.09 


Psi Blast 






I.le-12 


I.le-12 


u 
ro 


ON 


»o 

00 


»n 
i 

i) 
On 


CN 

a) 
oq 
co 


On 
On 


On 
O 

On 








00 
CO 


00 

co 


00 
fO 
1— 1 


vo 
m 
vo 


On 




m 

00 

<n 


\o 


00 
CN 


STAR 
T AA 






On 
o 


O 


o 




8 


00 

t— 1 




On 


O 


CHAI 
N TD 








< 


< 




< 


< 


< 


< 


< 


§£ 

Ph m 






T3 




a 

■a 




c 

3 

pi 


1 


icun 


icun 


Icun 


So C 2 






3 
— < 


m 
S 


m 
S 

(-4 


1014 


«* 


3 


1014 




1014 



392 



WO 02/070539 



PCT/US02/05095 
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b 1 KUU 1 Ul\AL» ri\\J 1 IiXl>l ivyu 

REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


SIuNALiNU rKUimiN uuainune 
NUCLEOTIDE- BINDING PROTEIN 1; 
GBP, GTP HYDROLYSIS, GDP, GMP, 
INTERFERON INDUCED, DYNAMIN 
2 RELATED, LARGE GTPASE 
FAMILY, SIGNALING PROTEIN 


NSECl; PROTEIN-PROTEIN 
COMPLEX, MULTI-SUBUNIT 


NSECl; PROTEIN-PROTEIN 
COMPLEX, MULTI-SUBUNTT 


NSECl; PROTEIN-PROTEIN 
COMPLEX MULTI-SUBUNIT 
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<ri pq 
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INDING 

synta: 


INDING 
SYNTA] 


BINDING 

synta: 


IN; CHA 


CHAIN: 
CHAIN: 
CHAIN: 






a 


B,C; 


ALPHA SPEC 
B, C; 


INTERFERON 
GUANYLATE 
PROTEIN I; C 


SYNTAXINB 
1; CHAIN: A; 
CHAIN: B; 


SYNTAXINB 
1; CHAIN: A; 
CHAIN: B; 


SYNTAXINE 
1; CHAIN: A; 
CHAIN: B; 


SSOl PROTE 


PREFOLDIN; 
PREFOLDIN; 
PREFOLDIN; 


PREFOLDIN; 
PREFOLDIN; 
PREFOLDIN; 


PREFOLDIN; 
PREFOLDIN; 


SEQFOL 
D score 
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PDB annotation 


CHAPERONE A RCHAEAL PROTEIN 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTR A f!TTT V. PR OTF.TM 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTR A C!TTT .V. PROTPTM 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTR A <TTTT V. PROTPTM 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRA CTTT .P. PROTPTM 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRACTILE 'PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTPTM 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, D NA-BINDING 
PROTPTM 


COMPLEX (ZINC: FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTPTM 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA). 


•a < 


j <fmu 


is 


W < 

IS 


MUSCLE 
:HAIN: A; 


V1USCLE 
)HAIN: A; 


VIUSCLE 
!HAIN: A; 


> EPTIDE; 
BINDING 


*EPTIDE; 
BINDING 


•EPTIDE; 
BINDING 


EPTIDE; 


JumpouE 


i CHAIN: 
CHAIN: 
CHAIN: 


w E 


I 

W E 

^ S 


Si 


LETAL I 
ININ 2; C 


w S 
>j £ 


QGSR ZINC FINGER I 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE 
SITE; CHAIN: B. C: 


QGSR ZINC FINGER I 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE 
SITE; CHAIN: B, C: 


QGSR ZINC FINGER F 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE ] 
SITE; CHAIN: B, C: 


QGSR ZINC FINGER P 
CHAIN: A; DUPLEX 


o i 
i 

*■ 
c 
t 

c 
p 


PREFOLDIN; 
PREFOLDIN; 
PREFOLDIN: 


HUMAN SKE 
ALPHA-ACT 


HUMAN SKE 
ALPHA-ACT 


HUMAN SKE 
ALPHA-ACT 


HUMAN SKE 
ALPHA-ACT] 


HUMAN SKE 
ALPHA-ACT] 


SEQFOL 
D score 
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PDB annotation 


i 

3 1 
z 

y 

o _ 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGEK/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


• 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 


Coumpound 


OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


DNA-BINDING PROTEIN 
HUMAN ENHANCER-BINDING 
PROTEIN MBP-1 MUTANT 
WITHCYS 11 1BB0 3 
REPLACED BY ABU (CI 1ABU) 
(NMR, 60 STRUCTURES) IBBO 
4 


DNA; CHAIN: A, B, D s E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C,F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C,F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 


SEQFOL 
D score 
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PD)i annotation 


INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STR UCTURE, COMPLEX 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTBIN-DNA 
INTERACTION , PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 


s is 

S 00 & 

5 go 
§ S° 

pin 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 

(ZINC! FT>JOF.R/riK^ 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA^ 
COMPLEX (ZINC FINGER/DNA 1 7.TNJP 


•a 
c 
p 
o 
a 

E 

3 

O 

U 


PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 

DNA; CHAIN: A, B, D, E: 


SEQFOL 
D score 
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PDB annotation 


FINGER, PROTEIN-UNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINCFINGER/DNA) 


COMPLEX (ZINC r 1JN UJsK/IJIn A; Z,UNC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINUEKrtJIMAJ £11NU 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINCFINGER/DNA) 


COMPLEX (Z.INU MNUHK/JJINA; AiiNO 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC finuek/djna; £1NU 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINCFINGER/DNA) 




Coumpound 


CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C S F,G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D,E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C,F,G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C,F,G; 


TRANSCRIPTION REGULATION 
YEAST TRANSCRIPTION 
FACTOR ADRl (RESIDUES 130 - 
159) IPAA 3 (PAPA - CARBOXY 
TERMINAL ZINC FINGER 
DOMAIN) MUTANT WITH IPAA 
4 PRO 131 REPLACED BY ALA, 
PRO 133 REPLACED BY ALA, 
CYS 140 IPAA 5 REPLACED BY 
ALA (P131A,P133A,C140A) 


SEQFOL 
D score 
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PDfi annotation 




ZINC FINGER TRANSCRIPTION 
FACTOR SPI; ZKNC FINGER, 
TRANSCRIPTION ACTIVATION SPI 


ZINC FINGER TRANSCRIPTION 
FACTOR SPI; ZINC FINGER, 
TRANSCRIPTION ACTIVATION SPI 


ZINC FINGER TRANSCRIPTION 
FACTOR SPI ; ZINC FINGER, 
TRANSCRIPTION ACTIVATION. SPI 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) TFIIIA; 5S 
GENE; NMR, TFIIIA, PROTEIN, DNA, 
TRANSCRIPTION FACTOR, 5S RNA 2 
GENE, DNA BINDING PROTEIN, 
ZINC FINGER, COMPLEX 3 
(TRANSCRIPTION 
REGULAnON/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) TFIIIA; 5S 
GENE; NMR, TFIIIA, PROTEIN, DNA, 
TTtANSCRIPTION FACTOR, 5S RNA 2 
GENE, DNA BINDING PROTEIN, 
ZINC FINGER, COMPLEX 3 
(TRANSCRIPTION 
REGULATION/DNA^ 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) TFIIIA; 5S 
GENE; NMR, TFIIIA, PROTEIN, DNA, 
TRANSCRIPTION FACTOR, 5S RNA 2 
GENE, DNA BINDING PROTEIN, 
ZINC FINGER, C OMPLEX 3 
(TRANSCRIPTION 
REGULATION/DNA) 


Coutnpound 


1 (NMR, 10 STRUCTURES) IPAA 6 


SP1F3; CHAIN: NULL; 


i SP1F2: CHAIN: NULL: 




SPIF2; CHAIN: NULL; 


TRANSCRIPTION FACTOR HIA; 
CHAIN: A; 5S RNA GENE; 
CHAIN: E, F; 


! TRANSCRIPTION FACTOR HIA; 
CHAIN: A; 5S RNA GENE; 
CHAIN: E, F; 


TRANSCRIPTION FACTOR IIIA; 
CHAIN: A; 5S RNA GENE; 
CHAIN: E,F; 


SEQFOL 
D score 
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COMPLEX (TRANSCRIPTION 
REGULATION/DNA) TFIIIA; 5S 
GENE; NMR, TFIIIA, PROTEIN, DNA, 
TRANSCRIPTION FACTOR, 5S RNA 2 
GENE, DNA BINDING PROTEIN, 
ZINC FINGER, COMPLEX 3 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
! TRANSCRIPTION INITIATION, ZINC 
FINGER PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE m, 2 
TRANSCRIPTION INITIATION, ZINC 
FINGER PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 


Coumpound 


TRANSCRIPTION FACTOR IIIA; 
CHAIN: A; 5S RNA GENE; 
CHAIN: E,F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


SEQFOL 
D score 
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score 
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PDB annotation 


(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG I; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


TRANSCRIPTION REGULATION 
TRANSCRIPTION REGULATION, 
ADRl, ZINC FINGER, NMR 


TRANSCRIPTION REGULATION 
TRANSCRIPTION REGULATION, 
ADRl, ZINC FINGER, NMR 


TRANSCRIPTION REGULATION 
TRANSCRIPTION REGULATION, 


Coumpound 




YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


ADRl; CHAIN: NULL; 




ADRl; CHAIN: NULL; 


ADRl; CHAIN: NULL; 


SEQFOL 
D score 
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0.00 
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PDB annotation 


s 
| 

2 


TRANSCRIPTION REGULATION 
TRANSCRIPTION REGULATION, 
ADRL ZINC FINGER- NMR 






COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA^ 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA^ 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA^ 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROnilN/DNA^ 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) ! FIVE-FINGER GLI; 


Coumpound 




ADRl; CHAIN: NULL; 


COMPLEX(TRANSCRIPTION 
REGULATION/DNA) 
TRAMTRACK PROTEIN (TWO 
ZINC-FINGER PEPTIDE) 
COMPLEXED WITH 2DRP 3 
DNA2DRP4 


COMPLEX(TRANSCRIPTION 
REGULATION/DNA) 
TRAMTRACK PROTEIN (TWO 
ZINC-FINGER PEPTIDE) 
COMPLEXED WITH 2DRP 3 
DNA 2DRP 4 


1 ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl ; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl ; 
CHAIN: A; DNA; CHAIN: C, D; 


SEQFOL 
D score 
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score 




0.80 


0.11 


o 
d 


0.21 


0.10 


0.71 


0.28 


0.15 


Verify 
score 




-0.01 


-0.30 


-0.47 


VO 
CO 

9 


-0.44 


CO 

9 


-0.29 


d 
i 


Psi Blast 




00 
CO 

VO 


VO 

o 
6 

co 


o 

•n 
od 


00 
CO 


VO 
CN 

6 

vn 


CO 
CO 
1 

00 
VO 


CO 

co 
1 

6 
vo 

od 


00 
CN 
i 

O 
vri 


H < 




1015 


00 


VO 

m 


00 




VO 

*n 
in 


CO 
00 

m 


CO 

C\ 
in 


STAR 
TAA 






CM 
CO 


»— < 
o 

wo 


CO 


00 

»o 

CO 




»— < 




CHAI 
NID 






< 


< 


< 


< 


< 


< 




Is 




2adr 


2drp 




CN 


% 

CN 






-a 

CN 


in 




1015 


1015 


tn 
i— « 
o 


m 

3 

<-* 


1015 


m 

r-l 

2 


m 
o 

F* 


tn 

3 



402 



WO 02/070539 



PCT/US02/05095 



PDB annotation 


GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 






OXIDOREDUCTASE 
FERROCYTOCHROME CVOXYGEN 


OXIDOREDUCTASE; 
OXIDOREDUCTASE, 


CYTOCHROME(C)-OXYGEN, 
CYTOCHROME C 2 OXIDASE 




PEPTIDE SYNTHETASE GRSA; 
PEPTIDE SYNTHETASE, GRSA, 
ADENYLATE FORMING 


OXIDOREDUCTASE 
OXIDOREDUCTASE, 
MONOOXYGENASE, 
PHOTOPROTEIN, LUMINESCENCE 




COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 


Coumpound 




ZINC FINGER PROTEIN GLIl; 
CHAIN A; DNA; CHAIN: C, D; 


ZINC FINGER DNA BINDING 
DOMAIN ZINC-FINGER (ZFY- 
SWAP) (NMR, 12 STRUCTURES) 
7ZNF3 




CYTOCHROME C OXIDASE; 
CHAIN: A, B, C, D, E, F, G, H, I, J, 
K,L,M,N,0,P, Q, 




GRAMICIDIN SYNTHETASE 1; 
CHAIN: A, B; 

PHENYLALANINE; CHAIN: C, 
D; 


LUCIFERASE; CHAIN: NULL; 




QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


SEQFOL 
D score 
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PDB annotation 


GLYCOPROTEIN, RECEPTOR, 2 
SIGNAL, COMPLEX 
(IMMUNOGLOBULIN/RECEPTOR) 


COMPLEX 

(IMMUNOGLOBULIN/RECEPTOR) 
TCR VAPLHA VBETA DOMAIN; T- 
CELL RECEPTOR, STRAND SWITCH, 
FAB, ANTICLONOTYPIC, 2 
(IMMUNOGLOBULIN/RECEPTOR) 






TRANSFERASE FPS; TRANSFERASE, 
ISOPRENE BIOSYNTHESIS, 
CHOLESTEROL BIOSYNTHESIS 


TRANSFERASE FPS; TRANSFERASE, 
ISOPRENE BIOSYNTHESIS, 
CHOLESTEROL BIOSYNTHESIS 












COMPLEX (INHIBITOR/NUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


1 COMPLEX (NUCLEAR I 


Coumpoand 




KB5-C20 T-CELL ANTIGEN 
RECEPTOR; CHAIN: A, B; 
ANTIBODY DESIRE-1; CHAIN: 
L, H; 


IMMUNOGLOBULIN ANTIGEN- 
BINDING FRAGMENT (FAB) 
(IGG2B, KAPPA) IMAM 3 




FARNESYL DIPHOSPHATE 
SYNTHASE; CHAIN: NULL; 


FARNESYL DIPHOSPHATE 
SYNTHASE; CHAIN: NULL; 




DEFENSIN DEFENSIN /HNPS-3 
1DFN3 


DEFENSIN DEFENSIN /HNPS-3 
IDFN 3 


DEFENSIN DEFENSIN /HNP$-3 
IDFN 3 




RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


U2 RNA HAIRPIN IV; CHAIN: Q, 


SEQFOL 
D score 
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PD)i annotation 


IMMUNOGLOBULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS. B-TREFOIL FOLD 


RNA BINDING PROTEIN TAP (NFXl); 
RIBONUCLEOI'ROTEIN (RNP,RBD 
OR RRM) AND LEUCINE-RICH- 
REPEAT 2 CLRm 


Is* 

m p a 

ill 

NCLi K J /■ 


i| ! 

9 

3 o g 

lip 

g CN P5 Ct 

o B 3 c 

ill 


) 
\ 

■ 


NERVE GROWTH FACTOR/TRKA 
COMPLEX BETA-NGF; COMPLEX, 
TRKA RECEPTOR, NERVE GROWTH 
FACTOR, CYSTEINE KNOT, 2 
IMMUNOGLOBULIN LIKE DOMAIN, 
NERVE GROWTH FACTOR/TRKA 
COMPLEX 


TRANSCRIPTION RNAIP; RANGAP; 
GTPASE-ACTIV ATING PROTEIN 
FOR SPIl, GTPASE-ACTIVATING 
PROTEIN, GAP, RNAIP, RANGAP, 
LRR, LEUCINE- 2 RICH REPEAT 
PROTEIN, TWDvfNING, 


Coumpound 


FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G, H; 


NUCLEAR RNA EXPORT 
FACTOR 1; CHAIN: A, B; 


NUCLEAR RNA EXPORT 
FACTOR 1; CHAIN: A, B; 


TITIN, 127; CHAIN: NULL; 


MUSCLE PROTEIN TITIN 
1 MODULE M5 (CONNECTIN) 
ITNM 3 (NMR, MINIMIZED 
AVERAGE STRUCTURE) ITNM 
4 ITNM 58 


NERVE GROWTH FACTOR; 
CHAIN: V, W; TRKA 
RECEPTOR; CHAIN; X, Y; 


GTPASE-ACTIVATING 
PROTEIN RNA I SCHPO; 
CHAIN: A, B; 


S s 
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PDB annotation 


HEMIHEDRAL TWINNING, 3 
MEROHEDRAL TWINNING, 
MEROHEDRY 


CELL ADHESION PROTEIN NCAM 
MODULE 2; CELL ADHESION, 
GLYCOPROTEIN, HEPARIN- 
BINDING, GPI-ANCHOR, 2 NEURAL 
ADHESION MOLECULE, 
IMMUNOGLOBULIN FOLD, 
HOMOPHILIC 3 BINDING, CELL 
ADHESION PROTEIN 




TRANSFERASE CK, MSSA; 
NUCLEOTIDE MONOPHOSPHATE 
KINASE,, TRANSFERASE 


TRANSFERASE TRANSFERASE 


TRANSFERASE APS KINASE; APS 
KINASE, ADENYLYLSULFATE 
KINASE, SULFATE, NUCLEOTIDE 2 
KINASE, TRANSFlERASE 




COMPLEX 

(HYDROLASE/COFACTOR) 
TRIACYLGLYCEROL LIPASE; 
COMPLEX 

(HYDROLASE/COFACTOR), LIPID 
DEGRADATION 


SERINE ESTERASE RELATED 
PROTEIN 2 LIPASE; SERINE 
ESTERASE, HYDROLASE, LIPID 
DEGRADATION, PANCREAS, 2 
GLYCOPROTEIN, CHIMERIC 


Coumpound 




NEURAL CELL ADHESION ! 
MOLECULE, LARGE ISOFORM; 
CHAIN: A; 




CYTIDINE MONOPHOSPHATE 
KINASE; CHAIN: A; 


GLYCEROL-3-PHOSPHATE 
CYTIDYLYLTRANSFERASE; 
CHAIN: A, B; 


ADENOSINE- 

5'PHOSPHOSULFATE KINASE; 
CHAIN: A, B; 




TRIACYLGLYCEROL ACYL- 
HYDROLASE; CHAIN: A, C; 
COLIPASE; CHAIN: B,D 


RP2 LIPASE; CHAIN: NULL; 


SEQFOL 
D score 
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COMPLEX (DNA-BINDING 
PROTEIN/DNA) GHF-1; COMPLEX 
(DNA-BINDING PROTEIN/DNA), 
PITUITARY, CPHD, 2 POU DOMAIN, 
TRANSCRIPTION FACTOR 


PROTEIN/DNA HOMEODOMAIN, 
DNA, COMPLEX, DNA-BINDING 
PROTEIN. PROTEIN/DNA 


TRANSCRIPTION/DNA 
ULTRABITHORAX; PBX PROTEIN; 
DNA BINDING, HOMEODOMAIN, 


a 

3 
O 

a 
6 
§ 


HYDROLASE(CARBOXYLIC 
ESTERASE) LIPASE (E.C.3.1.1.3) 
(TRIACYLGLYCEROL 
HYDROLASE) 1HPL 3 


HYDROLASE(CARBOXYLIC 
ESTERASE) LIPASE (E.C.3.1.1.3) 
COMPLEXED WITH COLIPASE 
AND INHIBITED ILPB 3 BY 
UNDECANE PHOSPHONATE 
METHYL ESTER (TWO 
CONFORMATIONS') ILPB 4 


PANCREATIC LIPASE 
RELATED PROTEIN 1; CHAIN: 
NULL; 


DNA-BINDING PROTEIN 
ANTENNAPEDIA PROTEIN 
(HOMEODOMAIN) MUTANT 
WITH CYS 39 1AHD3 
REPLACED BY SER (C39S) 
COMPLEX WITH DNA (NMR, 
lAHD 4 16 STRUCTURES) lAHD 
5 


PIT-1; CHAIN: A, B; DNA; 
CHAIN: C, D; 


HOMEOBOX PROTEIN HOX-Bl; 
CHAIN: A; PBX1; CHAIN: B; 
DNA CHAIN: D; DNA CHAIN: E: 


ULTRABITHORAX HOMEOTIC 
PROTEIN IV; CHAIN: A; 
HOMEOBOX PROTEIN 


SEQFOL 
D score 


107.77 
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PDB annotation 




TRANSFERASE CK; NUCLEOTIDE 
MONOPHOSPHATE KINASE, 
TRANSFERASE 






| COMPLEX (HORMONE/RECEPTOR) 
| HGH; HGHBP; COMPLEX 
(HORMONE/RECEPTOR) 


INSECT IMMUNITY INSECT 
IMMUNITY, LPS-BINDING, 
HOMOPHILIC ADHESION 


RECEPTOR RECEPTOR, SIGNAL 
TRANSDUCER OF IL-6 TYPE 
CYTOKINES, THIRD 2 N-TERMINAL 
DOMAIN, TRANSMEMBRANE, 
GLYCOPROTEIN 


RECEPTOR RECEPTOR, SIGNAL 
TRANSDUCER OF IL-6 TYPE 
CYTOKINES, THIRD 2 N-TERMINAL 
DOMAIN, TRANSMEMBRANE, 
GLYCOPROTEIN 


RECEPTOR RECEPTOR, SIGNAL 
TRANSDUCER OF IL-6 TYPE 
CYTOKINES, THIRD 2 N-TERMINAL 
DOMAIN, TRANSMEMBRANE, 
GLYCOPROTEIN 


RECEPTOR RECEPTOR, SIGNAL 
TRANSDUCER OF IL-6 TYPE 
CYTOKINES, THIRD 2 N-TERMINAL 
DOMAIN, TRANSMEMBRANE, 


Coumpound 


COMPLEXED WITH ADP AND 
AMP 1UKZ3 


CYTIDINE MONOPHOSPHATE 
KINASE; CHAIN: A; 


TRANSFERASE(PHOSPHOTRA 
NSFERASE) ADENYLATE 
KINASE (E.C.2.7A3) 3ADK 4 




GROWTH HORMONE; CHAIN: 
A; GROWTH HORMONE 
RECEPTOR; CHAIN: B; 


HEMOLIN; CHAIN: A, B; 


GP130; CHAIN: NULL; 


GPI30; CHAIN: NULL; 


GP130; CHAIN: NULL; 
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SEQFOL 
D score 
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PDB annotation 


GLYCOPROTEIN 3 


HORMONE/GROWTH FACTOR 
HORMONE, RECEPTOR, 
HORMONE/GROWTH FACTOR 


CONNECTIN A71, CONNECTIN; 
TITIN, CONNECTIN, FIBRONECTIN 
TYPE m 


CONNECTIN A71, CONNECTIN; 
TITIN, CONNECTIN, FIBRONECTIN 
TYPEHI 


CONNECTIN A71, CONNECTIN; 
TITIN, CONNECTIN, FIBRONECTIN 
TYPEUI 


CONNECTIN A 71, CONNECTIN; 
TITIN, CONNECTIN, FIBRONECTIN 
TYPE in 


CONNECTIN A 71, CONNECTIN; 
TITIN, CONNECTIN, FIBRONECTIN 
TYPE in 


SIGNALING PROTEIN CYTOKINE 
RECEPTOR, GLYCOPROTEIN 130, 
GP130, INTERLEUKINE 6 2 
RECEPTOR BETA SUBUNIT, 
SIGNALING PROTEIN 


SIGNALING PROTEIN CYTOKINE 
RECEPTOR, GLYCOPROTEIN 130, 
GP130, INTERLEUKINE 6 2 
RECEPTOR BETA SUBUNIT, 
SIGNALING PROTEIN 


SIGNALING PROTEIN CYTOKINE 
RECEPTOR, GLYCOPROTEIN 130, 
GP130, INTERLEUKINE 6 2 
RECEPTOR BETA SUBUNIT, 
SIGNALING PROTEIN 


Coumponnd 




GROWTH HORMONE; CHAIN: 
A; PROLACTIN RECEPTOR; 
CHAIN: B; 


TITIN; CHAIN: NULL; 


TITIN; CHAIN: NULL; 


TITIN; CHAIN: NULL; 


TITIN; CHAIN: NULL; 


TITIN; CHAIN: NULL; 


GP130; CHAIN: A, B; 


GP130; CHAIN: A, B; 


GP130; CHAIN: A, B; 


SEQFOL 
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PBB annotation ~~1 








CELL ADHESION NEURAL CELL 


CELL ADHESION NEURAL CELL 


CELL ADHESION NEURAL CELL 


CELL ADHESION NEURAL CELL 
ADHESION 


Coumpound 


MOLECULE DROSOPHILA 
NEUROGLIAN 

(CHYMOTRYPTIC FRAGMENT 
CONTAINING THE ICFB 3 TWO 
AMINO PROXIMAL 
FIBRONECTIN TYPE III 
REPEATS ICFB 4 (RESIDUES 
610-814)) ICFB 5 


NEURAL ADHESION 
MOLECULE DROSOPHILA 
NEUROGLIAN 

(CHYMOTRYPTIC FRAGMENT 
CONTAINING THE ICFB 3 TWO 
AMINO PROXIMAL 
FIBRONECTIN TYPE III 
REPEATS ICFB 4 (RESIDUES 
610 - 814)) ICFB 5 


NEURAL ADHESION 
MOLECULE DROSOPHILA 
NEUROGLIAN 

(CHYMOTRYPTIC FRAGMENT 
CONTAINING THE ICFB 3 TWO 
AMINO PROXIMAL 
FIBRONECTIN TYPE in 
REPEATS ICFB 4 (RESIDUES 
610 -814)) ICFB 5 




< 


< 


< 


AXONIN-1; CHAIN 


AXONIN-1; CHAIN: 


AXONIN-1; CHAIN: 


AXONIN-1; CHAIN: 


SEQFOL 
D score 
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PDB annotation 


BINDING PROTEIN BINDINU 
PROTEIN, CYTOKINE RECEPTOR 


BINDING PROTEIN BINDINU 
PROTEIN, CYTOKINE RECEPTOR 


BINDING PROTEIN BINDINU 
PROTEIN, CYTOKINE RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
! GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIICE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


| COMPLEX (CYTOKINE/RECEPTOR) | 


Coumpound 


GRANULOCYTE COLONY- 
STIMULATING FACTOR 
RECEPTOR; CHAIN: NULL; 


GRANULOCYTE COLONY- 
STIMULATING FACTOR 
RECEPTOR; CHAIN: NULL; 


GRANULOCYTE COLONY- 
STIMULATING FACTOR 
RECEPTOR; CHAIN: NULL; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
i FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C,D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1;*CHAIN: 

C, D; ! 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C,D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C, D; 


I ERYTHROPOIETIN; CHAIN: A; 


SEQFOL 
D score 
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PDJ3 annotation 


EPOBP; ERYTHROPOIETIN, 
ERYTHROPOnmN RECEPTOR, 
SIGNAL 2 TRANSDUCTION, 
HEMATOPOIETIC CYTOKINE, 
CYTOKINE RECEPTOR 3 CLASS 1, 
COMPLEX fCYTOKJNE/RECEPTOin 


CELL ADHESION NCAM; NCAM, 
IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN 


CELL ADHESION NCAM; NCAM, 
IMMUNOGLOE1ULIN FOLD, 
GLYCOPROTEIN 


GROWTH FACTOR/GROWTH 
FACTOR RECEIPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGRO UP WITHIN IG-LIKE 
DOMAINS. B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
1 FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LDCE ! 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS. B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEITOR FGFl ; FGFRl ; 
IMMUNOGLOB ULIN (IG) LIKE 
DOMAINS BELONGING TO THE I* 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS. B-TREFOIL FOT T5 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGFl ; FGFRl ; 
IMMUNOGLOBULIN (IG) LIKE 
DOMAINS BELONGING TO THE I- 


Coumpound 


ERYTHROPOIETIN RECEPTOR; 
CHAIN: B,C; 


NEURAL CELL ADHESION 
MOLECULE; CHAIN: A, B, C, D; 


NEURAL CELL ADHESION 
MOLECULE; CHAIN: A, B, C,D; ! 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G, H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F; G, H; 


FIBROBLAST GROWTH 
FACTOR 1; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1 ; CHAIN: 
C,D; 


FIBROBLAST GROWTH 
FACTOR 1; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 


SEQFOL 
D score 
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PDB annotation 


il 

_ j 

*g 

D§ 
OR 

3«* 

3Q c/f 

29 


HORMONE/GROWTH 
FACTOR/HORMONE RECEPTOR 4- 
HELICAL BUNDLE, ALPHA 
HELICAL BUNDLE, TERNARY 
COMPLEX, FN 2 UI DOMAINS, BETA 
SHEET DOMAINS, CYTOKINE- 
RECEPTOR COMPLEX 


HORMONE/GROWTH 
FACTOR/HORMONE RECEPTOR 4- 
HELICAL BUNDLE, ALPHA 
HELICAL BUNDLE, TERNARY 
COMPLEX, FN 2 III DOMAINS, BETA 
SHEET DOMAINS, CYTOKINE- 
RECEPTOR COMPLEX 


HORMONE/GROWTH 
FACTORTHORMONE RECEPTOR 4- 
HELICAL BUNDLE, ALPHA 
HELICAL BUNDLE, TERNARY 
COMPLEX, FN 2 III DOMAINS, BETA 
SHEET DOMAINS, CYTOKINE- 


>< 

i 

3 

i 

3 


CONTRACTILE PROTEIN 
IMMUNOGLOBULIN FOLD, BETA 
BARREL 






Coumpound 


P 

U 


PLACENTAL LACTOGEN; 
CHAIN: A; PROLACTIN 
RECEPTOR; CHAIN: B, C; 


PLACENTAL LACTOGEN; 
CHAIN: A; PROLACTIN 
RECEPTOR; CHAIN: B, C; 


PLACENTAL LACTOGEN; 
CHAIN: A; PROLACTIN 
RECEPTOR; CHAIN: B, C; 


TELOKIN; CHAIN: A 


CELL ADHESION PROTEIN 
FIBRONECTIN CELL- 
ADHESION MODULE TYPE III- 
101FNA3 


CELL ADHESION PROTEIN 
FIBRONECTIN CELL- 
ADHESION MODULE TYPE III- 
I01FNA3 


SEQFOL 
D score 
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PDB annotation 


CELL ADHESION PROTEIN RGD, 

P.YTRArPT 1 r Tlf ad \jta tdtv i i?xm »o 


CELL ADHESION PROTEIN RGD, 

F/JTTR A CUT T I TV AD \/T A tdtv ittxtt? 10 


CELL ADHESION PROTEIN RGD, 

FYTRArCT T TTt AD X/fATDTV 1T7XTP 10 


CELL ADHESION PROTEIN RGD, 

RVTRAr'PT T X Tit AD A/fATDTV 1 TTTVTT? 1 o 


0 c 

1 

32: 

W - 
2^ 

) U ft 


HEPARIN AND INTEGRIN BINDING 

HT7PADTKT AXTTi Txrro/->nTxr nn,mn.» 


HEPARIN AND INTEGRIN BINDING " 


HEPARIN AND INTEGRIN BINDING 
HEPARIN AND INTEGRIN BINDING 


D O O C 

?1 1 i 


HEPARIN AND "NTEGRIN BINDING 1 


D 

Id 

ill 

a o g ^ g c 


CliLL ADHESION PROTEIN CELL 
ADHESION PROTEIN, RGD, 
EXTRACELLULAR MATRIX, 2 
HEPARIN-BINDING, 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN. RGD. 


Coumpound 


FIBRONECTIN; IFNF 6 CHAIN: 
NULL; IFNF 7 


FIBRONECTIN; IFNF 6 CHAIN: 
NULL; IFNF 7 


FIBRONECTIN; IFNF 6 CHAIN: 
NULL: IFNF 7 


FIBRONECTIN; IFNF 6 CHAIN: 
NULL; IFNF 7 


FIBRONECTIN; IFNF 6 CHAIN: 
NULL: IFNF 7 


FIBRONECTIN; CHAIN: A; 


FIBRONECTIN; CHAIN: A; 


FIBRONECTIN; CHAIN: A; 
1 FIBRONECTIN: nT-TATTsJ- A- 


FIBRONECTIN; CHAIN: A; 


FIBRONECTIN; CHAIN: A; 


FIBRONECTIN; CHAIN: NULL; 


FIBRONECTIN; CHAIN: NULL; 


FIBRONECTIN; CHAIN: NULL; 
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166.16 
























PMF 
score 


0.98 




© 


0.76 


-0.20 


m 
d 


0.07 


-0.07 
-0.19 


0.98 


-0.20 


1.00 


1.00 


0.49 


Verify 
score 


0.25 




0.29 


0.24 


0.08 


zro 


0.26 


CS rH 

— o 

d d 


0.29 


0.05 


0.21 


0.03 


-0.04 


Psi Blast 


vo 
ro 
i 

u 

tn 


VO 

ro 
i 

0) 

in 


ro 

CO 

■ 

ro' 


VO 
CS 
1 


T* 


o 

CS 
t 

<u 
> 


i> 

CS 
I 

PO 


ro \n 
cs cs 
i i 
<u o 

* "1 
ro* oo 


00 


o 
cs 

CO* 


I 


ro 


I. le-20 




n 
o 
n 


n V 

* 

n < 


o < 
o 

X) 


$ ! 




s 

n 


3 ; 

-0 1 


*" v> 

o to 

^ 00 


CO 
CO 
CO 




ro < 


* i 

"0 


o 

n 


STAR 
TAA 


» c 
n i 


* < 
<0 I 


* v 


§ I 


N ( 

** r 
^ < 


N 


Z i 

•o * 


s o 

<3 00 


p 


q* I 
> 


n ! 


2 | 


q 


CHAI 
NID 










< 


< « 


< « 


< < 


< < 










ga 2 


a 3 




3 ; 


2 3 


a jl 






lfhh 




! j 


a 


\ 1 


3 


gall 


I I 


* »» 


8 ! 


5 2 


I I 


I ! 


I I 

- r- 


1049 


1049 


5 ! 

■4 w 


I I 


5 | 


? 

> 



420 




421 



WO 02/070539 



PCT7US02/05095 



PDF> annotation 


STRUCTURAL PROTEIN INTEGRIN, 
HEMIDESMOSOME, FIBRONECTIN, 
CARCINOMA, STRUCTURAL 2 
PROTEIN 


STRUCTURAL PROTEIN INTEGRIN, 
HEMIDESMOSOME, FIBRONECTIN, 
CARCINOMA, STRUCTURAL 2 

PROTEIN 


STRUCTURAL PROTEIN INTEGRIN, 
HEMIDESMOSOME, FIBRONECTIN, 
CARCINOMA, STRUCTURAL 2 
PROTEIN 


STRUCTURAL PROTEIN INTEGRIN, 
HEMIDESMOSOME, FIBRONECTIN, 
CARCINOMA, STRUCTURAL 2 

PROTRTN 


| STRUCTURAL PROTEIN TENASCIN, 
1 FIBRONECTIN 1 YPE-UI, HEPARIN, 
EXTRACELLUL/*vR 2 MATRIX, 
ADHESION, FUSION PROTEIN, 
STRnrrn trat ponfrai 


STRUCTURAL PROTEIN TENASCIN, 
FIBRONECTIN T YPE-UI, HEPARIN, 
EXTRACELLULAR 2 MATRIX, 
ADHESION, FUSION PROTEIN, 

STRI JCTT TR AT PI? OTCTM 


STRUCTURAL PROTEIN TENASCIN, 
FIBRONECTIN TYPE- HI, HEPARIN, 
EXTRACELLULAR 2 MATRIX, 
ADHESION, FUSION PROTEIN, 
STRnPTTTRAT PTfnTcrvi 


1MMUNOGLOBU LIN-LIKE DOMAIN 
CONNECTIN 127, TITIN IG REPEAT 
27; MUSCLE PROTEIN. 


Coumpound 


INTEGRIN BETA-4 SUBUNIT; 
CHAIN: A, B; 


INTEGRIN BETA-4 SUBUNIT; 
CHAIN: A, B; 


IN I JiURlN BETA-4 SUBUNIT; 
CHAIN: A, B; 


IN l liURJN BETA-4 SUBUNIT; 
CHAIN: A, B; 


TENASCIN; CHAIN: A, B; 


TENASCIN; CHAIN: A, B; 


TENASCIN; CHAIN: A, B; 


TITIN, 127; CHAIN: NULL; 


SEQFOL 
D score 
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PDJ3 annotation 


PROTEIN BINDING ED-B, 
FIBRONECTIN , TYPEIII DOMAIN, 
ANGIOGENESIS, PROTEIN 2 
BINDING 






CELL ADHESION PROTEIN NCAM 
MODULE 2; CELL ADHESION, 
GLYCOPROTEIN, HEPARIN- 
BINDING, GPI-ANCHOR, 2 NEURAL 
ADHESION MOLECULE, 
IMMUNOGLOBULIN FOLD, 
HOMOPHILIC 3 BINDING, CELL 
ADHESION PROTEIN 


ill 

is! 

IP 


CELL ADHESION NEURAL CELL 
ADHESION 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOB OLIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATIOR 


Coumpound 


FIBRONECTIN; CHAIN: A; 


HORMONE/RECEPTOR HUMAN 
GROWTH HORMONE 
COMPLEXED WITH ITS 
RECEPTOR 3HHR 3 
(EXTRACELLULAR DOMAIN) 
3HHR4 


HORMONE/RECEPTOR HUMAN 
GROWTH HORMONE 
COMPLEXED WITH ITS 
RECEPTOR 3HHR 3 
(EXTRACELLULAR DOMAIN) 
3HHR4 


NEURAL CELL ADHESION 
MOLECULE, LARGE ISOFORM; 
CHAIN: A; 


HEMOLIN; CHAIN: A, B; 


AXONIN-1; CHAIN: A; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 


SEQFOL 
D score 
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PDB annotation 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
MMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


CELL ADHESION NCAM; NCAM, 
IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LHCE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGFl; FGFRl ; 
IMMUNOGLOBULIN (IG) LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 


IMMUNE SYSTEM FC-EPSILON RI- 
ALPHA; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 
BINDING 2 PROTEIN 


I IMMUNE SYSTEM, MEMBRANE | 


Coumpound 


O 

o 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C,D; 


NEURAL CELL ADHESION 
MOLECULE; CHAIN: A, B, C, D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G, H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E,F,G,H; 


FIBROBLAST GROWTH 
FACTOR 1; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C,D; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILON 
RECEPTOR CHAIN: A; 


| FC RECEPTOR 


SEQFOL 
D score 
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PROTEIN CD32 S * FC RECEPTOR, 
IMMUNOGLOU LIN, LEUKOCYTE, 
CD32 


CONTRACTILE PROTEIN 
IMMUNOGLOBULIN FOLD, BETA 
BARREL 


CONTRACTILE PROTEIN 
IMMUNOGLOB ULIN FOLD, BETA 
BARREL 


IMMUNE SYSTEM RECEPTOR BETA 
SANDWICH, IM MUNOGLOBULIN- 
LIKE. RECF.PTOR 


z 55 u Iz 

11 § 2g 

■ 8* u 0 2 2 p pc 

* y E 0 2 2 a c 


MUSCLE PROTEIN CONNECTIN, 
NEXTM5; CELL ADHESION, 
GLYCOPROTEIN, 
TRANSMEMBRANE, REPEAT, 
BRAIN, 2 IMMUNOGLOBULIN FOLD, 
ALTERNATIVE SPLICING, SIGNAL, 3 
mt isrrr v. pr otftm 


1 




Coumpound 


< 

i 
I 

! 


TELOKIN; CHAIN: A 


TELOKIN; CHAIN: A 


LOW AFFINITY 
IMMUNOGLOBULIN GAMMA 
FC REGION CHAIN: A: 


TITIN; CHAIN: NULL; 


TITIN; CHAIN: NULL; 


MUSCLE PROTEIN TITIN 
MODULE M5 (CONNECTIN) 
ITNM 3 (NMR, MINIMIZED 
AVERAGE STRUCTURE) ITNM 
4 ITNM 58 


MUSCLE PROTEIN TITIN 
MODULE M5 (CONNECTIN) 
ITNM 3 (NMR, MINIMIZED 


SEQFOL 
D score 
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FDB annotation 




IMMUNE SYSTEM P58 NATURAL 
KILLER CELL RECEPTOR; KIR, 
NATURAL KILLER RECEPTOR, 
INHIBITORY RECEPTOR, 2 
IMMUNOGLOBULIN 


IMMUNE SYSTEM CD32; RECEPTOR, 
FC, CD32, IMMUNE SYSTEM 




COMPLEX (ANTIGEN/PEPUDE) B35; 
MAJOR HISTOCOMPATIBILITY 
ANTIGEN, MHC, HLA, HLA-B3501, 
HIV, 2 NEF, COMPLEX 
(ANTIGEN/PEPTIDE) 


HISTOCOMPATIBILITY COMPLEX 
B8; B2M; PEPTIDE HLA B8, HIV, 
MHC CLASS I, 

HISTOCOMPATIBILITY COMPLEX 


IMMUNE SYSTEM 
IMMUNOGLOBULIN FOLD 


IMMUNE SYSTEM MHC, HLA, 
CLASS I, KIR, NK CELL RECEPTOR, 
IMMUNOGLOBULIN 2 FOLD, 
RECEPTOR/MHC COMPLEX 




Coumpound 


AVERAGE STRUCTURE) ITNM 
41TNM58 


MHC CLASS INK CELL 
RECEPTOR PRECURSOR; 
CHAIN: A; 


FC GAMMA RUB; CHAIN: A; 




B*3501; CHAIN: A, B; PEPTIDE 
VPLRPMTY; CHAIN: C; 


B*0801; CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
HIV-1 GAG PEPTIDE 
(GGKKKYKL- INDEX 
PEPTIDE); CHAIN: C; 


HLA-A*0201; CHAIN: A, D; 
BETA-2 MICROGLOBULIN; 
CHAIN: B,E;HTLV-1 
OCTAMERIC TAX PEPTIDE; 
CHAIN: C,F; 


HLA-CW3 (HEAVY CHAIN); 
CHAIN: A; BETA-2- 
MICROGLOBULIN; CHAIN: B; 
PEPTIDE FROM IMPORTIN 
ALPHA-2; CHAIN: C; NATURAL 
KILLER CELL RECEPTOR 
KIR2DL2; CHAIN: D,E; 


HISTOCOMPATIBILITY 
ANTIGEN MURINE CLASS I 


SEQFOL 
D score 
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PDB annotation 


1 

>< 

13 
J 

11 
3 1 

3g 

S'S 

|S 


CONTRACTILE PROTEIN WHl 
DOMAIN; MOLECULAR 
RECOGNITION, ACTIN DYNAMICS, 
CONTRACTILE PROTEIN 


CELL MOTILITY AN lNLUMTL,iiiti 
SEVEN STRANDED ANTI-PARALLEL 
BETA BARREL 2 CLOSED BY AN 
ALPHA HELIX, EVHl DOMAIN, 
ACTIN-BASED CELL 3 MOTILITY, 
INTERACTION MODULE 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLfcA [ZdrilJ riJNUJIK/UINAJ 4*UN^ 

FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (TRANSCKirilUIN 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE 111,2 
TRANSCRIPTION INITIATION, ZINC 
FINGER PROTEIN 


COMPLEX (TRANSCR1F11UIN 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 


Coumpound 




MENAEVHl DOMAIN; CHAIN: 
A; PEPTIDE ACTA; CHAIN: B; 


EVHl DOMAIN FROM 
ENA/V ASP-LIKE PROTEIN; 
CHAIN: A^PHE-GLU-PHE- 
PRO-PRO-PRO-PRO-THR-ASP- 
GLU-GLU; CHAIN: C, D; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


DNA; CHAIN: A, B, D,E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C,F, G; 


TFIIIA; CHAIN: A,D;5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


SEQFOL 
D score 
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62.71 . 
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FINGER PROTEIN, DNA-PRQTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
RE GUL ATI ON/ r»J A ^ 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING proi;ein/dna , > 


CELL ADHESION PROTEIN 
EPITHELIAL OVDHERIN DOMAINS 1 
AND 2, ECADi:'.; CADHERIN, CELL 
ADHESION PROTEIN, CALCIUM 
BINDING PROTPJTvj 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN 


1 

MUSCLE PROTEIN CTNC; CARDIAC, 
MUSCLE PROTEIN, REGULATORY, 
CALCIUM BINDING 


HYDROLASE CALCINEURIN; 
HYDROLASE, PHOSPHATASE, 
IMMUNOSUPPRFSSTfYhJ 


CALCIUM-BINDING CALCIUM- 
BINDING, MYRISTOYLATION, 
NEURONAL SPECIFIC GUANYLATE 

2 CYCLASE ArTTVATYYB 


i 
i 

> • 

i 

> 


Coumpound 




ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


E-CADHERIN; CHAIN: A, B; 


N-CADHERIN; CHAIN: A; 


TROPONIN C; CHAIN: NULL; 


SERINEjTHREONINE 
PHOSPHATASE 2B; CHAIN: A, 
B; 


NEUROCALCIN DELTA; 
CHAIN: A, B; 


CALCIUM-BINDING PROTEIN 
CALMODULIN COMPLEXED 
WITH CALMODULIN-BINDING 
DOMAIN OF 1 CDM 3 
CALMODULIN-DEPENDENT 
PROTEIN KINASE II ICDM 4 
CALCIUM-BINDING PROTEIN 


SEQFOL 
D score 




105.76 


122.19 


122.52 


73.76 


92.24 


184.52 


57.84 
67.88 


PMF 
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Verify 
score 


















Psi Blast 




co 
i 

4> 
00 

VO 


00 
1 

V 


o 

00 

vd 


VO 
m 

0 
«n 

00 


CO 

<£> 


o\ 
6 


v> o 
</■> vo 

-» CO 






rn 


vo 

VO 
CM 


v*> 
o 

CM 


t»- 
co 
CM 




N 




STAR 
TAA 




to 
ro 


CM 

VO < 


o 


CM 




r> 
o 


3 S3 


CHAI 
NID 




< 


< 


< 


I 




< 


< 


Is 


( 




ledh 


3 


'3 


1 ; 


1 : 


E 

a — 


M 


i 
c 
c 

r 


o 
jv 

-« 


1011 


3 

H 

-4 


1105 


3 i 


n i 

3 C 


1105 



432 



WO 02/070539 



PCT/US02/05095 



PDB annotation 
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COMPLEX (GTPASE- 
ACTIVATING/GTP-BINDING) 
COMPLEX (GTPASE- 
ACTIVATING/GTP-BINDING), 

GTPASE ACTIVATION 

TRANSPORT PROTEIN TC4; GTPASE, 


Coumpound 


CALMODULIN (VERTEBRATE) 
1CLL3 I 


RECOVERTN; CHAIN: NULL; 


CALCIUM-BINDINQ PROTEIN 
RECOVERTN (CALCIUM 
SENSOR IN VISION) IREC 3 


TROPONIN C; CHAIN: NULL; 


TROPONIN C; ITNX 4 CHAIN: 
NULL; ITNX 5 


CONTRACTILE SYSTEM i 
PROTEIN TROPONIN C 1TOP 3 


CALMODULIN; CHAIN: A; 
RS20; CHAIN: B; 


BINDING PROTEIN 
SARCOPLASMIC CALCIUM- 
BINDING PROTEIN 2SCP 3 


P50-RHOGAP; CHAIN: A, B, C; 
CDC42HS; CHAIN: D, E, F; 

GTP-BINDING PROTEIN RAN; 


SEQFOL 
D score 




160.57 


143.82 


73.38 
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PUB annotation 


^2 


TRANSPORT PROTEIN TC4; GTPASE, 
NUCLEAR TRANSPORT, 
TR ANSPOR T PR OTP.TM 


SIGNALING PROTEIN GTP-BINDING 
PROTEINS, PROTEIN-PROTEIN 
COMPLEX, EFPECTORS 


SIGNALING PROTEIN G PROTEIN, 
GTP HYDROLYSIS, KINETIC 
CRYSTALLOG RAPHY, 2 SIGNALING 
PROTEIN 


s c 

§ 

2tf£ 

B§E 

gs= 

Sg« 
w cu 


is 

) o<i< 

isi 

See 


SMALL GTPASE KARYOPHERIN 
BETA, P95 SMALL GTPASE, 

TvITTr*! PAT? T*» A MCDnDT T> r? r-n^nrwo 


1 f£ ° 
; o pq £ 


GTP-BINDING GTP-BINDING, 
GTPASE, SMALL G-PROTEIN, RHO 


COMPLEX (SMALL 

GTP ASE/NUCLI s AR PROTEIN) 

COMPLEX (SMALL 

GTP ASE/NUCLI- AR PROTEIN), 

SMALL GTPASE, 2 NUCLEAR 


i c ~ 


CHAIN: A, B; 


GTP-BINDING PROTEIN RAN; 
CHAIN: A, B; 


RAS-RELATED PROTEIN RAP- 
1 A; CHAIN: A; PROTO- 
ONKOGENE 

SERINE/THREONINE PROTEIN 
KINASE CHAIN: B; 


TRANSFORMING PROTEIN 
P21/H-RAS-1; CHAIN: A; 


HIS-TAGGED TRANSFORMING 
PROTEIN RHOA(0- 181); CHAIN: 
A; PKN; CHAIN: B: 


HUMAN ADP-RIBOSYLATION 
FACTOR 1; IHUR 5 CHAIN: A, 
B: IHUR 7 


RAN; CHAIN: A, C; IMPORTIN 
BETA SUBUNIT; CHAIN: B, D; 


RAP2A; CHAIN: NULL; 


RACl; CHAIN: NULL; 


RAN; CHAIN: A, Q NUCLEAR 
PORE COMPLEX PROTEIN 
NUP358; CHAIN: B, D; 


SEQFOL 
D score 




69.65 


67.97 


75.99 
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71.40 
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77.42 
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PDB annotation 


LIPID TRANSPORT APO A-I; 
LIPOPROTEIN, LIPID TRANSPORT, 
CHOLESTEROL METABOLISM, 2 
ATHEROSCLEROSIS, HDL, LCAT- 
ACTIVATION 


SERINE PROTEINASE TRYPSIN-LIKE 
SERINE PROTEINASE, TETRAMER, 

HPPAPTKI ATT 'DDr«V *1 A cnrrTTi a a 


3 w e 

t £ £ Q £ t 

1111 


COMPLEX (SERINE 
PROTEINASE/INHIBITOR) 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN IIA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATTONyTNWrRTTnp , \ 


HYDROLASE/HYDROLASE 
INHIBITOR ENTEROKINASE, 
HEAVY CHAIN; ENTEROKINASE, 
LIGHT CHAIN; KNTEROPEPTTDASE, 
TRYPSINOGEN ACTIVATION, 2 
HYDROLASE/HYDROLASE 
INHIBITOR 


Coumpound 


APOLIPOPROTEIN A-I; CHAIN: 
A,B,C,D; 


BETA-TRYPTASE; CHAIN: A, B, 
C,D; 


PLASMINOGEN ACTIVATOR; 
CHAIN: A; GLU-GLY-ARG 
CHLOROMETHYL KETONE; 
CHAIN: I; 


ALPHA-THROMBIN: 1 AHT 4 ! 


CHAIN: L, H; 1 AHT 5 HIRUGEN; 
1AHT 8 CHAIN: I: 1 AHT 9 


ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO-MAI; 
CHAIN: P; 


EN TEROPEPTIDASE; CHAIN: A; 
ENTEROPEPTIDASE; CHAIN: B; 
VAL-ASP-ASP-ASP-ASP-LYS 
PEPTIDE; CHAIN: C; 


SEQFOL 
D score 


51.65 


100.75 


106.52 


91.53 


99.33 


89.82 


PMF 
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Verify 
score 
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PDB annotation 


/• 
P 


COMPLEX (PKUlbASJ^iiNnimivjiv 
TRYPSIN, COAGULATION FACTOR 
XA, CHIMERA, PROTEASE, PPACK, 2 
CHLOROMETHYLKETONE, 
COMPLEX (PROTEASE/INHIBITOR) 


COMPLEX (PKOTEASfc/INttimiUK; 
RTAP; GLYCOPROTEIN, SERINE 
PROTEASE, PLASMA, BLOOD 
COAGULATION, 2 COMPLEX 
(PROTEASE/INHIBITOR) 


COMPLEX (BLOOD 
COAGULATION/PROENZYME) 
COMPLEX (BLOOD 
COAGULATION/PROENZYME), 
THROMBIN, 2 PRETHROMBIN-2, 
PLASMA, SERINE PROTEASE 


TERNARY CUMrLEX (^rMUUXiivJ 
TC, PCPA-TC; TERNARY COMPLEX 
(ZYMOGEN), SERINE PROTEINASE, 
C-TERMINAL 2 PEPTIDASE 


SERINE PROTEASE (TC>l-rA; - 
SERINE PROTEASE, FIBRINOLYTIC 
ENZYMES 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 


Coumpound 


HYDROLASE(SERINE 
PROTEINASE) EPSILON- 
THROMBIN (E.C.3.4.21.5) NON- 
COVALENT COMPLEX WITH 
1ETR3MQPA 1ETR4 


COAGULATION FACTOR XA- 
TRYPSIN CHIMERA; CHAIN: A; 
D-PHE-PRO-ARG- 
CHLOROMETHYLKETONE 
(PPACK) WITH CHAIN: I; 


FACTOR XA; CHAIN: H, L; 
ANTICOAGULANT PEPTIDE; 
CHAIN: I; 


ALPHA-THROMBIN; CHAIN: L, 
H; PRETHROMBIN-2; CHAIN: K; 


PROCARBOXYPEPTIDASE A; 
CHAIN: A, B; PROPROTEINASE 
E; CHAIN: C; 

CHYMOTRYPSINOGEN C; 
CHAIN: D; 


TWO CHAIN TISSUE 
PLASMINOGEN ACTIVATOR; 
CHAIN: A, B; 


DNA; CHAIN: A, B,D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


SEQFOL 
D score 


92.09 


89.40 


98.96 


93.49 | 


95.69 


102.89 


71.69 


PMF 
score 
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PD33 annotation 


(ZINC FIN GER/DNA ) 
COMPLEX (TRANSCRIPTION 
REGULATION/t>NA) TFHIA; 5S 
GENE; NMR, TFIIIA, PROTEIN, DNA, 
TRANSCRIPTION FACTOR, 5S RNA 2 
GENE, DNA BINDING PROTEIN, 
ZINC FINGER, COMPLEX 3 
(TRANSCRIPTION 
REGI JLATTON/ r>?C A ^ 


COMPLEX (TRANSCRIPTION 
REGULATTON/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE tH, 2 
TRANSCRIPTION INITIATION, ZINC 
FINGER PROTF TM 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGITLATTON/rVNA^ 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


HYDROLASE ATP SYNTHASE, FOF1- 
ATPASE, OXIDATIVE 
PHOSPHORYLATION, 2 
MITOCHONDRIA, HYDROLASE 


Coumpound 


TRANSCRIPTION FACTOR IUA; 
CHAIN: A; 5S RNA GENE; 
CHAIN: E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


YYl; CHAIN: C; ADENO- 
ASSOC1ATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


ZINC FINGER PROTEIN GLII; 
CHAIN: A; DNA; CHAIN: C, D; 


Fl-ATPASE ALPHA CHAIN; 
CHAIN: A; Fl-ATPASE BETA 
CHAIN; CHAIN: B; Fl-ATPASE 
GAMMA CHAIN; CHAIN: G; 


SEQFOL 
D score 


67.88 


105.09 


80.44 


202.77 


64.40 


PMF 

score 












Verify 
score 
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PDB annotation 


HYDROLASE Air SYNiriAOii, run- 
ATPASE, OXIDATIVE 
PHOSPHORYLATION^ 
MITOCHONDRIA, HYDROLASE ! 


TOXIN BINDING PROTEIN TWO 
DOMAINS: BETA PROPELLER AND 
ALPHA/BETA FOLD 


S ; 

q * 

i 

5 

2 i! 

2 

23 

J 3 

ii 


i 

4 

I 

3 d 

5 8 
|| 
2 S 


COMPLEX 

BINDING/TRANSDUCER) BETAl, 
TRANSDUCIN BETA SUBUNIT; 
GAMMAl, TRANSDUCIN GAMMA 
SUBUNIT; COMPLEX (GTP- 
BINDING/TRANSDUCER), G 
PROTEIN, HETEROTRIMER 2 
SIGNAL TRANSDUCTION 


MEMBRANE PROTEIN CRYSTAL 
STRUCTURE, MEMBRANE, FERM 
DOMAIN, TAIL DOMAIN 


KINASE KINASE, SIGNAL 

TRANSDUCTION, 

CALCIUM/CALMODULIN 


2 5 
*> .5 

sis 

lis 

5 < J 

Bps 


Coumpound 


Fl-ATPASE ALPHA CHAIN; 
CHAIN: A; Fl-ATPASE BETA 
CHAIN; CHAIN: B; Fl-ATPASE 
GAMMA CHAIN; CHAIN: G; 


TOLB PROTEIN; CHAIN: A; 


TRANSCRIPTIONAL 1 
REPRESSOR TUPl; CHAIN: A, 
B, C; 


TRANSCRIPTIONAL 
REPRESSOR TUPl; CHAIN: A, 
B.C; 


GT-ALPHA/Gl-ALPHA 
CHIMERA; CHAIN: A; GT- 
BETA; CHAIN: B; GT-GAMMA; 
CHAIN: G; 


MOESIN; CHAIN: A, B; MOESIN; 
CHAIN: C, D; 


CALCIUM/CALMODULIN- 
DEPENDENT PROTEIN KINASE; 
CHAIN: NULL; 


CALCIUM/CALMODULIN- 
DEPENDENT PROTEIN KINASE; 
CHAIN: NULL; 


SEQFOL 
D score 


130.11 
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CM 

cn 




PMF 
score 




0.17 


0.94 


0.18 


0.40 


1.00 




1.00 


Verify 
score 
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PDtt annotation 


PATHOGENESIS-RELATED PROTEIN 
PATHOGENESIS-RELATED LEAF 
PROTEIN 6, ETHYLENE 
PATHOGENESIS-RELATED 
PROTEIN, PR-1 PROTEINS, 2 PLANT 
DEFENSE 


PATHOGENESIS-RELATED PROTEIN 
PATHOGENESIS-RELATED LEAF 
PROTEIN 6, ETHYLENE 
PATHOGENESIS-RELATED 
PROTEIN, PR- 1 PROTEINS, 2 PLANT 
DEFENSE 


SIGNALING PROTEIN P21-RAC2; 
RHO GDI 2, RHO-GDI BETA, LY-GDI; 
BETA SAND WHICH, PROTEIN- 
PROTEIN COMPLEX, G-DOMAIN, 2 
IMMUNOGLOB ULIN FOLD, 
WALKER FOLD, GTP-BINDING 
PROTEIN 


KINASE UMP/CMP KINASE; 
NUCLEOSIDE MONOPHOSPHATE 
KINASE, NMP KINASE, 
PHOSPHORYL 2 TRANSFER, 
TRANSITION STATE ANALOG 
COMPLEX. TRA NSFRR A SR 


TRANSFERASE TRANSFERASE 
(PHOSPHOR YLi 


TRANSFERASE TRANSFERASE 
rPHOSPHORY1 1 


4 


Coumponnd 


PATHOGENESIS-RELATED 
PROTEIN P14A; CHAIN: NULL; 


PATHOGENESIS-RELATED 
PROTEIN P14A; CHAIN: NULL; 


RAS-RELATED C3 BOTULINUM 
TOXIN SUBSTRATE 2; CHAIN: 
A; RHO GDP-DISSOCIATION 
INHIBITOR 2; CHAIN: B; 


URIDYLMONOPHOSPHATE/CY 
TIDYLMONOPHOSPHATE 
KINASE; CHAIN: A; 


PHOSPHOGLYCERATE 
MUTASE; CHAIN: A, B: 


PHOSPHOGLYCERATE 
MUTASE; CHAIN: A. B: 


TRANSFERASE(PHOSPHOTRA 
NSFERASE) ADENYLATE 
KINASE (E.C.2.7.4.3) 3ADK 4 


SEQFOL 
D score 


82.01 
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VO 




PMF 
score 
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1.00 
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1.00 




0.68 


Verify 
score 
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PDB annotation 






KINASE KINASE, 
PHOSPHOTRANSFERASE 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (TRANbUKlr 1 1UN 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, ZINC 
FINGER PROTEIN 


COMPLEX (TRAN aCKir 1 IUN 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (DNA-B1ND1NU 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 


Coumpound 


TRANSFERASE (PHOSPHORYL) 
PHOSPHOGLYCERATE 
MUTASE (E.C.2.7.5.3) DE- 
PHOSPHO ENZYME 3PGM 4 


TRANSFERASE (PHOSPHORYL) 
PHOSPHOGLYCERATE i 
MUTASE (E.C.2.7.5.3) DE- 
PHOSPHO ENZYME 3PGM 4 


THYMIDYLATE KINASE; 
CHAIN: A, B, C, D, E, F, G, H; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 


w 
o 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


SEQFOL 
D score 




58.96 




100.83 


108.88 

i 

■ 


86.63 


98.31 


PMF 
score 


1.00 




0.19 










Verify 
score 
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PDB annotation 
BINDING PROTEIN/DNA*! 


KINASE KINASE, SIGNAL 

TRANSDUCTION, 

C ALCIUM/C AI .MODI TT TM 


TRANSFERASE TRANSFERASE, 
SERINE/THREONINE-PROTEIN 
KINASE, CASEIN KINASE, 2 




PROTEIN KINASE CDK2; PROTEIN 
KINASE, CELL CYCLE, 
PHOSPHORYLATION, 
STAUROSPORINE, 2 CELL DIVISION, 
mitosis TNrmTiiTTrnsi 


COMPLEX (KINASE/INHIBITOR) 
CDK6; P19INK4;D; CYCLIN 
DEPENDENT KINASE, CYCLIN 
DEPENDENT KINASE INHIBITORY 2 
PROTEIN, CDK, INK4, CELL CYCLE, 
COMPLEX (KJ>"ASE/INHIBITOR) 
HEADER HEI JJt 


COMPLEX (INHIBITOR 
PROTE IN/KIN ASE) INHIBITOR 


Coumpound 


CALCIUM/CALMODULIN- 
DEPENDENT PROTEIN KINASE; 
CHAIN: NULL; 


PROTEIN KINASE CK2/ALPHA- 
SUBUNIT; CHAIN: NULL; 


TRANSFERA SE(PHOSPHOTRA 
NSFERASE) $C-/AMP$- 
DEPENDENT PROTEIN KINASE 
(E.C.2.7.1.37)($C/APK$) I APM 3 
(CATALYTIC SUBUNIT) ALPHA 
ISOENZYME MUTANT WITH 
SER 139 1 APM 4 REPLACED BY 
ALA (/S139A$) COMPLEX WITH 
THE PEPTIDE 1 APM 5 
INHIBITOR PKI(5-24) AND THE 
DETERGENT MEGA-8 1 APM 6 


C YCLIN-DEPENDENT PROTEIN 
KINASE 2; CHAIN: NULL; 


C YCLIN-DEPENDENT KINASE 
6; CHAIN: A, C; CYCLIN- 
DEPENDENT KINASE 
INHIBITOR; CHAIN: B, D; 


C YCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 


SEQFOL 
D score 


130.42 


108.45 


168.88 

i 


111.26 


93.00 


113.75 


PMF 
score 














Verify 
score 
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PDB annotation 

COAGULATION, PLASMA, 
PLATELET, FIBRINOGEN, FIBRIN 


CYTOKINE MACROPHAGE, 
INFLAMMATORY RESPONSE, 
CYTOKINE 


A \ 

So 
^ 5 u 


TAUTOMERASE; CYTOKINE, 
MACROPHAGE, INFLAMMATORY 
RESPONSE, TAUTOMERASE 


HEXAMERIZATION DOMAIN 
HEXAMERIZATION DOMAIN, 
ATP ASE, TRANSPORT 


CALCIUM-BINDING PROTEIN 
CALSEQUESTRIN, CALCIUM- 
BINDING PROTEIN, SARCOPLASMIC 
2 RETICULUM, RABBIT SKELETAL 
MUSCLE 


UBIQUITIN CONJUGATION UBC2; 
UBIQUITIN CONJUGATION, 
UBIQUITIN-CONJUGATING 
ENZYME 


uS g 

IsSsS 

Q b -p s 


=1 

-H l-M 

q S 

zf g 
3 W 
3 2 
IS 

aS 


Coumponnd 

D,E,F, S,T,M,N; 


MACROPHAGE MIGRATION 
INHIBITORY FACTOR; CHAIN: 
NULL; 


GLYCOSYLATION-INHIBITING 
FACTOR; CHAIN: A, B, C; 


MACROPHAGE MIGRATION 
INHIBITORY FACTOR; CHAIN: 
A,B,C; 


N-ETHYLMALEIMIDE- 
SENSITIVE FUSION PROTEIN; 
CHAIN: A;- 


CALSEQUESTRIN; CHAIN; 
NULL 


UBIQUITIN-CONJUGATING 
ENZYME RAD6; CHAIN: A, B, C; 


UBIQUITIN-PROTEIN LIGASE 
E3A; CHAIN: A, B, C; 
UBIQUITIN CONJUGATING 
ENZYME E2; CHAIN: D; 


UBIQUITIN CONJUGATING 
ENZYME; CHAIN: A; 


SEQFOL 
D score 


145.02 


180.54 
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PMF 
score 
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§ 

1 

B 
B 

cq 




TRANSFERASE; TRANSFERASE, 
TYROSINE KINASE, SH3, SH2, 
ONCOPROTEIN 


TRANSFERASE HCK; SH3, PROTEIN 
TYROSINE KIN ASE, SIGNAL 
TRANSDUCTION. 2 TRANSFER ASF. 


1 


ELECTRON TRj*yNSPORT HELDC, 
BETA-STRAND, ELECTRON 
TRANSPORT 






1 IP 

1 igj^ 

SaSai! 


i 


CALCIUM-BINDING PROTEIN 
CALMODULIN CERIUM TR1C- 
DOMAIN, RESIDUES 1 - 75; CERIUM- 
LOADED, CALCIUM-BINDING 1 
PROTEIN 


METAL BINDING PROTEIN CAVP: 1 


Coumpound 


PROTO-ONCOGENE TYROSINE 
KINASE (E.C.2.7.1.112) ISHF 3 
(SH3 DOMAIN) ISHF 4 


ABL TYROSINE KINASE; 
CHAIN: NULL; 


HEMATOPOIETIC CELL 
KINASE; CHAIN: NULL; 




CYTOCHROME B5; CHAIN: A; 




COMPLEX(SERINE 
PROTEINASE-INHIBITOR) 
ALPHA-CHYMOTRYPSIN 
(E.C.3.4.21.4) COMPLEX WITH 
TURKEY ICHO 4 OVOMUCOID 
THIRD DOMAIN (/OMTKY3$) 
ICHO 5 


TRYPSIN; CHAIN: T; TRYPTASE 
INHIBITOR; CHAIN: L; 




CALMODULIN; CHAIN: NULL; 


CALCIUM VECTOR PROTEIN; 


SEQFOL 
D score 
























PMF 
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0.19 




00 

m 
d 


0.86 




0.05 


0.23 | 
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PDB annotation 


-4 

J gj 

< 15 
J * 

il 

p 




I 

t 

• 

i 
< 

< 


5 \ 

1 i 

r'S 
*1 


z 

is 
s£ 

D O ( 

5 b ' 

is 


ANTibKbtrZii rKUiUUN iNdJ^i 
ANTIFREEZE PROTEIN, THERMAL 
HYSTERESIS, TENEBRIO 2 
MOLITOR, IODINATION, RIGHT- 
HANDED BETA-HELIX, TMAFP 


TRANSPORT PKUlbUN <JA1A,1UM 
BINDING, EF HAND, FOUR-HELIX 
BUNDLE 


BLOOD CLOU 1NU 
COMPLEX(SERINE 
PROTEASE/COFACTOR/LIGAND), 
BLOOD COAGULATION, 2 SERINE 
PROTEASE, COMPLEX, CO-FACTOR, 
RECEPTOR ENZYME, 3 INHIBITOR, 
GLA, EGF, COMPLEX (SERINE 4 


Coumpound 


CHAIN: A; 


CALCIUM-BINDING PROTEIN 
CALMODULIN COMPLEXED 
WITH CALMODULIN-BINDING 
DOMAIN OF ICDM 3 
CALMODULIN-DEPENDENT 
PROTEIN KINASE II ICDM 4 


COMPLEX(SERINE 
PROTEINASE-INHIBITOR) 
ALPHA-CHYMOTRYPSIN 
(E.C.3.4.21.1) COMPLEX WITH 
TURKEY ICHO 4 OVOMUCOID 
THIRD DOMAIN (/OMTKY3S) 
ICHO 5 


CARDIAC TROPONIN C; 
CHAIN: A; 


CALMODULIN; CHAIN: A; 


THERMAL HYSTERESIS 
PROTEIN ISOFORM YL-1; 
CHAIN: A, B; 


CALMODULIN; CHAIN: A; 


BLOOD COAGULATION 
FACTOR VUA; CHAIN: L; 
BLOOD COAGULATION 
FACTOR VIIA; CHAIN: H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T; 5L15; CHAIN: I; 


SEQFOL 
D score 
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Q 

II 

« 

55 c 

s 

pq 


EXTRACELLULAR MODULE 
O STEONECTIH, SPARC, SECRETED 
PROTEIN ACIDIC AND 
EXTRACELLULAR MODULE, 
GLYCOPROTEIN, ANTI-ADHESIVE 
PROTEIN, 2 COLLAGEN BINDING, 
SITE-DIRECTED MUTAGENESIS, 
GLYCOSYLATED 3 PROTEIN 


GROWTH FACTOR EH 1 ; EH 
DOMAIN, EPSl 5, EF-HAND, 
SOLUTION STRUCTURE, S 1 00 




COMPLEX (SEIUNE 
PROTEASE/INI IIBITOR) SGPB; ISGP 
8 ALA18-OMTKY3; ISGP 14 SERINE 
PROTEINASE, PROTEIN INHIBITOR 


CALCIUM-BINDING PROTEIN BM- 
40, OSTEONECTIN; ISRA 7 
EXTRACELLULAR MATRIX 


■» ivvyiiiin i orw-v £\S 

CALCIUM-BINDING PROTEIN BM- 
40, OSTEONECTIN; ISRA 7 
EXTRACELLULAR MATRIX 
pp mrcTxr 1 CD A ~M\ 


BLOOD COAGLlfLATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 


Conmpound 




BASEMENT MEMBRANE 
PROTEIN BM-40; CHAIN: A, B; 


EPIDERMAL GROWTH FACTOR 
RECEPTOR SUBSTRATE 
CHAIN: A; 


CALCIUM-BINDING PROTEIN 
RAT ONCOMODULIN IRRO 3 


STREPTOMYCES GRISEUS 
PROTEINASE B; ISGP 6 CHAIN: 
E; ISGP 7 TURKEY 
OVOMUCOID INHIBITOR; ISGP 
11 CHAIN: I: ISGP 12 


SPARC; ISRA 4 CHAIN: NULL; 
ISRA 5 


SPARC; 1 SRA 4 CHAIN: NULL; 1 
ISRA 5 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 


SEQFOL 
D score 
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GROWTH FACTOR LIKE DOMAIN 








PHOSPHOLIPID ANALOG 
PLACENTAL ANTICOAGULANT 
PROTEIN; PHOSPHOLIPID ANALOG, 
CALCIUM BINDING PROTEIN, 
MEMBRANE 2 BINDING PROTEIN 


PLACENTAL ANTICOAGULANT 
PROTEIN; PHOSPHOLIPID ANALOG, 
CALCIUM BINDING PROTEIN, 
MEMBRANE 2 BINDING PROTEIN 












Coumpound 


PROTEINASE INHIBITOR 
(KAZAL) OVOMUCOID THIRD 
DOMAIN CLEAVED RDI 30VO 


CALCIUM-BINDING PROTEIN 
PARVALBUMIN (ALPHA 
LINEAGE) 5PAL 3 


LECTIN (AGGLUTININ) WHEAT 
GERM AGGLUTININ 
aSOLECTIN2)9WGA3 


ANNEXIN V; CHAIN: NULL; 


ANNEXIN V; CHAIN: NULL; 


CALCIUM/PHOSPHOLIPID 
BINDING ANNEXIN I IAIN 3 


CALCIUM/PHOSPHOLIPID 
BINDING ANNEXIN I IAIN 3 


CALCIUM/PHOSPHOLIPID 
BINDING ANNEXIN I IAIN 3 


CALCIUM/PHOSPHOLIPID- 
BINDING PROTEIN ANNEXIN V 
1ALA3 


> 

li 

sg 

^ p < 

O CQ ~ 


SEQFOL 
D score 










75.48 | 
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CALCIUM/PHOSPHOLIPID-BINDING 
PROTEIN 32.5 KX> CALELECTRIN, 
ENDONEXIN I; 1 ANN 7 32.5KD 
CALELECTRIN, ENDONEXIN I, 
LIPOCORTIN IV, lANN 12 2 
CHROMOBINDIN IV, PROTEIN II 
I ANN 13 


CALCIUM/PHOSPHOLIPID-BINDING 
PROTEIN 32.5 ICD CALELECTRIN, 
ENDONEXIN I; 1 ANN 7 32.5KD 
CALELECTRIN, ENDONEXIN I, 
LIPOCORTIN IV, lANN 12 2 
CHROMOBINDIN IV, PROTEIN II 
lANN 13 


CALCIUM/PHOSPHOLIPID-BINDING 
PROTEIN 32.5 KD CALELECTRIN, 
ENDONEXIN I; 1 ANN 7 32.5KD 
CALELECTRIN, ENDONEXIN I, 
LIPOCORTIN IV, I ANN 12 2 
CHROMOBINDIN IV, PROTEIN II 
1 ANN 13 


CALCIUM/PHOI3PHOLIP ID-BINDING 
PROTEIN P68, PROTEIN III, 67-KDA- 
CALCIMEDIN, LIPOCORTIN 
ANNEXIN, CALCIUM-BINDING, 
MEMBRANE-BI NDING, 2 
CALCIUM/PHOSPHOLIPID-BINDING 
PROTEIN 


CALCIUM/PHOSPHOLIPID-BINDING 
PROTEIN P68, PROTEIN III, 67-KDA- 
CALCIMEDIN, LIPOCORTIN 
ANNEXIN, CALCIUM-BINDING, 
MEMBRANE-BINDING, 2 
CALCIUM/PHOSPHOLIPID-BINDING 


Coumpound 


ANNEXIN IV; lANN 5 CHAIN: 
NULL; I ANN 6 


ANNEXIN IV; I ANN 5 CHAIN: 
NULL; 1 ANN 6 


ANNEXIN IV; lANN 5 CHAIN: 
NULL; 1 ANN 6 


ANNEXIN VI; CHAIN: NULL; 


ANNEXIN VI; CHAIN: NULL; 


SEQFOL 
D score 




68.51 








PMF 
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1.00 
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Verify 
score 
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PROTEIN 


CALCIUM/PHOSPHOLIPID-BINDING 
PROTEIN ANNEXIN FAMILY 1 AXN 
14 


ii 

C S ! 
x x 1 
5 S { 

O ( 

s g 1 

is 


3 3 
3| 

S H 

is 




PHOSPHOLIPID ANALOG 
PLACENTAL ANTICOAGULANT 
PROTEIN; PHOSPHOLIPID ANALOG, 
CALCIUM BINDING PROTEIN, 
MEMBRANE 2 BINDING PROTEIN 










Coumponnd 


ANNEXIN ni; 1 AXN 4 CHAIN: 
NULL; lAXN 5 


ANNEXIN III; 1 AXN 4 CHAIN: 
NULL; lAXN 5 


ANNEXIN III; 1 AXN 4 CHAIN: 
NULL; lAXN 5 


CALCIUM/PHOSPHOLIPID 
BINDING ANNEXIN V 
(LIPOCORTIN V, ENDONEXIN 
II, PLACENTAL IHVD 3 
ANTICOAGULANT PROTEIN) 
(CALCIUM IONS ARE VISIBLE) 
MUTATION IHVD 4 WITH GLU 
17 REPLACED BY GLY (E17G) 
IHVD 5 


ANNEXIN V; CHAIN: NULL; 


CALCIUM/PHOSPHOLIPID 
BINDING ANNEXIN I IAIN 3 


CALCIUM/PHOSPHOLIPID 
BINDING ANNEXIN I IAIN 3 


CALCIUM/PHOSPHOLIPID 
BINDING ANNEXIN I IAIN 3 


CALCIUM/PHOSPHOLIPID- 
BINDING PROTEIN ANNEXIN V 
1ALA3 


o £ 






































5 S 
w P 

CO 




84.90 




69.28 
























403.43 






PMF 
score 


1.00 
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Verify 
score 
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CALCIUM/PHOSPHOLIPID-BINDING 
PROTEIN 32.5 KD CALELECTRIN, 
ENDONEXIN Is 1 ANN 7 32.5KD 
CALELECTRIN, ENDONEXIN I, 
LIPOCORTIN IV, lANN 12 2 
CHROMOBINDIN IV, PROTEIN II 


CALCIUM/PHOSPHOLIPID-BINDING 
PROTEIN P68, PROTEIN in, 67-KDA- 
CALCIMEDIN, LIPOCORTIN 
ANNEXIN, CA1.CIUM-BINDING, 
MEMBRANE-BINDING, 2 
CALCIUM/PHOSPHOLIPID-BINDING 


CALCIUM/PHOSPHOLIPID-BINDING 
PROTEIN ANNEXIN FAMILY 1 AXN 

14 




PHOSPHOLIPID ANALOG 
PLACENTAL ANTICOAGULANT 
PROTEIN; PHOSPHOLIPID ANALOG, 
CALCIUM BINDING PROTEIN, 


PHOSPHOLIPID ANALOG 
PLACENTAL ANTICOAGULANT 


Coumpound 


ANNEXIN IV; lANN 5 CHAIN: 
NULL; 1 ANN 6 


ANNEXIN VI; CHAIN: NULL; 


ANNEXIN III; 1 AXN 4 CHAIN: 
NULL; 1 AXN 5 


CALCIUM/PHOSPHOLIPID 
BINDING ANNEXIN V 
(LIPOCORTIN V, ENDONEXIN 
II, PLACENTAL IHVD 3 
ANTICOAGULANT PROTEIN) 
(CALCIUM IONS ARE VISIBLE) 
MUTATION IHVD 4 WITH GLU 
1 7 REPLACED BY GLY (E 1 7G) 
IHVD 5 


ANNEXIN V; CHAIN: NULL; 


ANNEXIN V; CHAIN: NULL; 


SEQFOL 
D score 










58.84 




PMF 
score 


1.00 


1.00 


1.00 


1.00 




1.00 


Verify 
score 


0.55 
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0.68 
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MEMBRANE-R17srnmn 9 I 


CALCIUM/PHOSPHOLIPID-BINDING 
PROTRTM 


CALCIUM/PHOSPHOLIPID-BINDING 
PROTEIN ANNliXIN FAMILY I AXN 
14 


CALCIUM/PHOSPHOLIPID-BINDING 
PROTEIN ANNliXIN FAMILY lAXN 

14 


METAL BINDING PROTEIN PLANT | 
ANNEXIN, CAPSICUM ANNUUM, 
BELL PEPPER, CALCIUM 2 BINDING 






SCAFFOLD PROTEIN SCAFFOLD 
PROTEIN, PP2A, 


Coumpound 




ANNEXIN III; 1 AXN 4 CHAIN: 
NULL; lAXN 5 


ANNEXIN III; I AXN 4 CHAIN: 
NULL; 1 AXN 5 


ANNEXIN 24(CA32); CHAIN: A, 
B; 


CALCIUM/PHOSPHOLIPID 
BINDING ANNEXIN V 
(LIPOCORTTN V, ENDONEXIN 
II, PLACENTAL IHVD 3 
ANTICOAGULANT PROTEIN) 
(CALCIUM IONS ARE VISIBLE) 
MUTATION IHVD 4 WITH GLU 
17 REPLACED BY GLY (E17G) 
IHVD 5 


CALCIUM/PHOSPHOLIPID 
BINDING ANNEXIN V 
(LIPOCORTIN V, ENDONEXIN 
n, PLACENTAL IHVD 3 
ANTICOAGULANT PROTEIN) 
(CALCIUM IONS ARE VISIBLE) 
MUTATION 1 HVD 4 WITH GLU 
17 REPLACED BY GLY (E17G) 
IHVD 5 


PROTEIN PHOSPHATASE PP2A; 
CHAIN: A, B; 


SEQFOL 
D score 




85.90 






68.02 




120.56 


PMF 
score 






1. 00 


1.00 




LOO 




Verify 
score 






0.70 
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PHOSPHORYLATION, HEAT REPEAT 


SCAFFOLD PROTEIN SCAFFOLD 
PROTEIN, PP2A, 

PHOSPHORYLATION, HEAT REPEAT 


PROTEIN, PP2A, 

PHOSPHORYLATION, HEAT REPEAT 


3« 

2 9 

Q CO 

sa 

=2 ~ s 

si a 

2 e< 
2 3 2 

r* PS Ph 


3 
C. 

5 

p 

1- 

f 

\ 

p 
c 
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5* 

8 

il { 

o .5 H C 
s CO < c 


KARYOPHERIN ALPHA; NUCLEAR 
IMPORT RECEPTOR, NUCLEAR 
LOCALIZATION SIGNAL, 2 
ARMADILLO REPEATS, 
AUTOINHIBITION, INTRASTERIC 
REGULATION 


NUCLbAK l KJi^nr i 
KARYOPHERIN ALPHA; NUCLEAR 
IMPORT RECEPTOR, NUCLEAR 
LOCALIZATION SIGNAL, 2 
ARMADILLO REPEATS, 
AUTOINHIBITION, INTRASTERIC 
REGULATION 


i * 

5 r o O 

< 2 2 ! 
x, co H . 

< h y 

to on 2 


STKUUl UKAL rKUl miN 
ARMADILLO REPEAT, BETA- 


Coumpound 


PROTEIN PHOSPHATASE PP2A; 
CHAIN: A, B; 


PROTEIN PHOSPHATASE PP2A; 
CHAIN: A, B; 


BCAR YOPHERIN ALPHA; 
CHAIN: A, B; MYC PROTO- 
ONCOGENE PROTEIN; CHAIN: 
C.D.E.F; 


KARYOPHERIN ALPHA; 
CHAIN: A, B; MYC PROTO- 
ONCOGENE PROTEIN; CHAIN: 
CD,E,F; 


IMPORTIN ALPHA; CHAIN: A; 


IMPORTIN ALPHA; CHAIN: A; 


RAN; CHAIN: A, C; IMPORTIN 
BETA SUBUNIT; CHAIN: B, D; 


BETA-CATEN1N; CHAIN: NULL; 


SEQFOL 
D score 










118.96 








PMF 

score 


0.23 


-0.07 
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0.86 
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CA1ENIN, STRUCTURAL PROTEIN 
ARMADILLO REPEAT ARMADILLO 
REPEAT, BETA-CATENIN, 
CYTOSKELETON 


OXIDOREDUCTASE FATTY ACID 
HYDROXYLASE; FATTY ACID 
MONOOXYGENASE, 
HEMOPROTEIN. P450 REMAP 1C 


OXIDOREDUCTASE (OXYGENASE) 


OXIDOREDUCTASE FATTY ACID 
HYDROXYLASE; FATTY ACID 
MONOOXYGENASE, 
HEMOPROTEIM P450 RFMAR *T 


OXIDOREDUCTASE FATTY ACID 
HYDROXYLAS E; FATTY ACID 
MONOOXYGENASE, 
HEMOPR DTETKJ P450 P F.MA P HT 


OXIDOREDUCTASE 
PROGESTERONE 21- 
HYDROXYLASE, CYPIIC5 P450 1, 
MEMBRANE PROTEIN, 
PROGESTERONE 21- 
HYDROXYLASE, BENZO(A) 2 
PYRENE HYDROXYLASE, 
ESTRADIOL 2-KYDROXYLASE, 
P450. CYP2C5 


OXIDOREDUCTASE NITRIC OXIDE 
REDUCTASE, CYTOCHROME 
P450NOR 


OXIDOREDUCT ASE (OXYGENASE) 


Coumpound 


BETA-CATENIN; CHAIN: NULL; 


CYTOCHROME P450; CHAIN: A, 
B; 


CYTOCHROME P450 ERYF; 
lOXA 5 CHAIN: NULL lOXA 6 


CYTOCHROME P450; CHAIN: A, 
B; 


CYTOCHROME P450; CHAIN: A, 
B; 


CYTOCHROME P450 2C5; 
CHAIN: A; 


NITRIC OXIDE REDUCTASE; 
CHAIN: A; 


CYTOCHROME P450 ERYF; 
lOXA 5 CHAIN: NULL lOXA 6 
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TOXIN BINDING PROTEIN TWO 
DOMAINS: BETA PROPELLER AND 
ALPHA/BETA FOLD 


3 i 

i 

g 

z • ; 

D 

-H 

2 Pi 

3 3 

5 a 

S§ 
S £ 


S ; 

Q C 

\ 

| 

2 

£ eg 

H PL, 


Q 
D 

H 

s 

5 

3 d * 

tg i 

- cm ' 


BINDING/TRANSDUCER) BETAl , 
TRANSDUCIN BETA SUBUNIT; 
GAMMAl, TRANSDUCIN GAMMA 
SUBUNIT; COMPLEX (GTP- 
BINDING/TRANSDUCER), G 
PROTEIN, HETEROTRIMER 2 
SIGNAL TRANSDUCTION 


BINDING/TRANSDUCER) BETAl, 
TRANSDUCIN BETA SUBUNIT; 
GAMMAl, TRANSDUCIN GAMMA 
SUBUNIT; COMPLEX (GTP- 
BINDING/TRANSDUCER), G 
PROTEIN, HETEROTRIMER 2 
SIGNAL TRANSDUCTION 


III 

u *z ffl s § 
3 2 s g R 

^ s i 1 S 

J P2 P O w 


'a 

§ 
ft. 
E 
§ 

u 


TOLB PROTEIN; CHAIN: A; 


TRANSCRIPTIONAL 
REPRESSOR TUPl; CHAIN: A, 
B. C: 


TRANSCRIPTIONAL 
REPRESSOR TUPl; CHAIN: A, 
B, C; 


TRANSCRIPTIONAL 
REPRESSOR TUPl; CHAIN: A, 
B.C: 


GT-ALPHA/GI-ALPHA 
CHIMERA; CHAIN: A; GT- 
BETA; CHAIN: B; GT-GAMMA; 
CHAIN: G; 


GT-ALPHA/GI-ALPHA 
CHIMERA; CHAIN: A; GT- 
BETA; CHAIN: B; GT-GAMMA; 
CHAIN: G; 


GT-ALPHA/GI-ALPHA 
CHIMERA; CHAIN: A; GT- 
BETA; CHAIN: B; GT-GAMMA; 
CHAIN: G; 


SEQFOL 
D score 










119.11 






PMF 
score 


0.04 


o 
p 


0.29 


§ 




0.53 


1.00 


Verify 
score 
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0.40 
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BINDING/TRANSDUCER), G 
PROTEIN, HETEROTRIMER2 
SIGNAL TRANSDUCTION 


IMMUNOGLOl 5ULIN 
IMMUNOGLOBULIN, C REGION, V 


CHAPERONE/STRUCTURAL 
PROTEIN CHAJPERONE ADHESIN 
DONOR STRAND 
COMPLEMEN1 ATION, 2 

CHAPERONE/STRUCTURAL 
PROTEIN 


CHAPERONE/STRUCTURAL 
PROTEIN CHAPERONE ADHESIN 
DONOR STRAND 
COMPLEMENT ATION, 2 
CHAPERONE/STRUCTURAL 
PROTEIN 


LIGASE CYCLIN A/CDK2- j 
ASSOCIATED P45; CYCLIN A/CDK2- 
ASSOCIATED P 1 9; SKPl, SKP2, F- 
BOX, LRRS, LEUCINE-RICH 
REPEATS, SCF, 2 UBIQUIT1N, E3, 
1 JBTOT TTTTNI PR CYTT+TK! rrr.Acr 


HELICASE DNA REPAIR, DNA 
REPLICATION, SOS RESPONSE, 
HELICASE, 2 AIT -BINDING, DNA- 


COMPLEX (HELICASE/DNA) 
COMPLEX (HELICASE/DNA), 
HELICASE, DNA UNWINDING 
HYDROLASE/DNA ATP-DEPENDENT 


Coumponnd 




ANTI-IDIOTYPIC FAB 409.5.3 
(IGG2A) FAB; CHAIN: A, B, L, H 


PAPD-LIKE CHAPERONE FIMC; 
CHAIN: A, C, E, G, I, K, M, O; 
MANNOSE-SPECIFIC ADHESIN 
FIMH; CHAIN: B, D, F, H, J, L, N, 
P; 


PAPD-LIKE CHAPERONE FIMC; 
CHAIN: A, C, E, G, I, K, M, O; 
MANNOSE-SPECIFIC ADHESIN 
FIMH; CHAIN: B, D, F, H, J, L, N, 


SKP2; CHAIN: A, C; SKPl; 
CHAIN: B, D; 


PCRA; CHAIN: NULL; 


ATP-DEPENDENT DNA 
HELICASE REP; CHAIN: A, B; 
DNA CHAIN: C; 
HELICASE PCRA; CHAIN: A, F; 


SEQFOL 
D score 














90.62 


PMF 
score 
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Verify 
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PEPl IDE-COMPLEX, HELICAL 
REPEAT, HSP90, 2 PROTEIN 
BINDING 


CHAPERONE I [OP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSP90, 2 PROTEIN 
BINDING 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSP90, 2 PROTEIN 
BINDING 


CHAPERONE H OP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSP90, 2 PROTEIN 

RINDTT\TG 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSP90, 2 PROTEIN 
BINDING 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSP90 > 2 PROTEIN 
BINDING 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSC70„ 2 HSP70, PROTEIN 
BINDING 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSC70, 2 HSP70, PROTEIN 
BINDING 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMPl JEX, HELICAL 
REPEAT, HSC70, 2 HSP70. PROTEIN 


Coumpound 


g 

Pr n 

o .. 

£g 


TPR2A-DOMAIN OF HOP; 
CHAIN: A; HSP90-PEPTIDE 
MEEVD; CHAIN: B; 


TPR2A-DOMAIN OF HOP; 
CHAIN: A; HSP90-PEPTTDE 
MEEVD; CHAIN: B; 


TPR2A-DOMAJN OF HOP; 
CHAIN: A; HSP90-PEPTIDE 
MEEVD; CHAIN: B; 


TPR2A-DOMAIN OF HOP; 
CHAIN: A; HSP90-PEPTIDE 
MEEVD; CHAIN: B; 


TPR2A-DOMAIN OF HOP; 
CHAIN: A; HSP90-PEPTIDE 
MEEVD; CHAIN: B; 


TPRl-DOMAIN OF HOP; CHAIN: 
A, B; HSC70-PEPTEDE; CHAIN; 
C,D; 


TPRl-DOMAIN OF HOP; CHAIN: 
A, B; HSC70-PEPTIDE; CHAIN: 
C. D; 


TPRl-DOMAIN OF HOP; CHAIN: 
A, B; HSC70-PEPTTDE; CHAIN: 
C,D; 


SEQFOL 
D score 




















PMF 
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0.99 


0.74 
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UTTMHTNG 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSC70, 2 HSP70, PROTEIN 
BINDING 


. § „ 

5 P 2 
§S 5 

^ ~ 85 p 
lx w f 

ago t 

l|S"i[ 


- 5 _ 
5 P 2 
c J o < 

u * e« P 

s s 5 s s 

is P W Q 


- 2 „ 

3 p i 

r ^ 'ti 6 *" 

j £ a « ? 


- 5 „ 
i S ^ 

:x"w F 

i i 


. 5 

i P 

ia h w p ; 
iSwg! 

.) PS PQ < 


PEROXISMORE RECEPTOR 1, PTSl- 
BP, PEROXIN-5, PTSl PROTEIN- 
PEPTIDE COMPLEX, 
TETRATRICOPEPTIDE REPEAT, TPR, 
2 HELICAL REPEAT 


MUJNALrJJNU rs^KJ lain 

PEROXISMORE RECEPTOR 1, PTS1- 
BP, PER0XIN-5, PTSl PROTEIN- 


73 
ft 
S3 
o 

s 
p 
o 
U 


TPRl-DOMAIN OF HOP; CHAIN: 
A, B; HSC70-PEPTIDE; CHAIN: 
C,D; 


TPRl-DOMAIN OF HOP; CHAIN: 
A, B; HSC70-PEPTIDE; CHAIN: 
C, D; 


TPRl-DOMAIN OF HOP; CHAIN: 
A, B; HSC70-PEPTIDE; CHAIN: 
C, D; 


TPRl-DOMAIN OF HOP; CHAIN: 
A, B; HSC70-PEPTIDE; CHAIN: 
C,D; 


si 

ig S 

2 W 

a o 

9 b _ 
2 (if Q 


il 

> r** 
O u 

1^ 


PEROXISOMAL TARGETING 
SIGNAL 1 RECEPTOR; CHAIN: 
A, B; PTSl-CONTAINING 
PEPTIDE; CHAIN: C, D; 


PER0XIS0MAL TARGETING 
SIGNAL 1 RECEPTOR^ CHAIN: 
A, B; PTS 1 -CONTAINING 


SEQFOL 
D score 
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PEPTIDE COMPLEX, 
TETRATRICOPEPTIDE REPEAT, TPR, 
2 HELICAL REPEAT 


SIGNALING PROTEIN 
PEROXISMORE RECEPTOR 1, PTS1- 
BP, PEROXIN-5, PTSl PROTEIN- 
PEPTIDE COMPLEX, 
TETRATRICOP EPTIDE REPEAT, TPR, 
2 HELICAL REPEAT 


SIGNALING PROTEIN 
PEROXISMORE RECEPTOR 1, PTSl- 
BP, PEROXIN-5, PTSl PROTEIN- 
PEPTIDE COMP LEX, 
TETRATRICOPEPTIDE REPEAT, TPR, 
2 HELICAL REPEAT 


SIGNALING PROTEIN 
PEROXISMORE RECEPTOR 1, PTS1- 
BP, PEROXIN-5, PTS 1 PROTEIN- 
PEPTIDE COMPLEX, 
TETRATRICOPEPTIDE REPEAT, TPR, 
2 HELICAL REPEAT 


CELL ADHESION PROTEIN 
EPITHELIAL CADHERIN DOMAINS 1 
AND 2, ECAD12; CADHERIN, CELL 
ADHESION PROTEIN, CALCIUM 
BINDING PROTRTM 


CELL ADHESION PROTEIN 
EPITHELIAL CA DHERIN DOMAINS 1 
AND 2, ECAD12; CADHERIN, CELL 
ADHESION PROTEIN, CALCIUM 
BINDING PROTJ?IM 


CELL ADHESION PROTEIN 
EPITHELIAL CADHERIN DOMAINS 1 






gf 


PEROXISOMAL TARGETING 
SIGNAL 1 RECEPTOR; CHAIN: 
A, B; PTS 1 -CONTAINING 
PEPTIDE; CHAIN: C, D; 


0 § 










Coamponnd 


PEPTIDE; CHAIN: C, D; 


PEROXISOMAL TARGET! 
SIGNAL 1 RECEPTOR; CH 
A, B; PTSl-CONTAINING 
PEPTIDE; CHAIN: C, D; 


PEROXISOMAL TARGETH 
SIGNAL 1 RECEPTOR; CH 
A, B; PTSl-CONTAINING 
PEPTIDE; CHAIN: C, D; 


E-CADHERIN; CHAIN: A, I 




E-CADHERIN; CHAIN: A, I 


E-CADHERIN; CHAIN: A, B 


SEQFOL 
D score 
















PMF 
score 




LOO 


1.00 
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.^t^. m r.oinl/1. /"I A TMJT7 D TXT /"'TJT T 


!§ 

■* fid o * 

ill 


5 a 

Q P 

H 

j Q > 

^) O l 


CADHERIN INCI 13 


CELL AUriJiolUiN rRuicuN i^tii^ 
ADHESION PROTEIN 


i ! 
i I 

? 55 - 


ADHESION PROTEIN 


SANDWICH, CALCIUM-BINDING 
PROTEIN, METAL BINDING 2 
PROTEIN 


5o 

2 Q 

S 
S« 

= 2 

o W CO 

3 a < 

U < W 

j o a 


ANTI-ONCOGENE CELL CYCLE, 
ANTI-ONCOGENE, REPEAT, ANK 
REPEAT 


S3 | 

DO C 

38 "i 

ill 


it | 

J< <. 

3 w l 

D O ( 
50 C 

3y < 

lis' 


jl. | 

^ < P 
DO O 

3 o o 

%%%% 


Coumpound 




N-CADHERIN; INCI 3 


N-CADHERIN; INCI 3 


N-CADHERIN; CHAIN: A; 


N-CADHERIN; CHAIN: A; 


N-CADHERIN; CHAIN: A; 


LAMININ ALPHA2 CHAIN; 
CHAIN: A, B, C,D; 


EPITHELIAL CADHERIN; 
CHAIN: NULL; 


TUMOR SUPPRESSOR 
P16INK4A; CHAIN: NULL; 


TUMOR SUPPRESSOR 
P16INK4A; CHAIN: NULL; 


TUMOR SUPPRESSOR 
P16INK4A; CHAIN: NULL; 


TUMOR SUPPRESSOR 
P16INK4A; CHAIN: NULL; 

TUMOR SUPPRESSOR 


SEQFOL 
D score 
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(INHIBITOR PROTEIN/KINASE) 
COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-D>EPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 

/TWWTRTTVIP T>t> rYTCTXT/VTXT A Crrrv 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYC LIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 


HURMONE/GRO WTH FACTOR P 1 8- 
INK4C; CELL C YCLE INHIBITOR, 
PI 8INK4C, TUMOR, SUPPRESSOR, 
CYCLIN- 2 DEPENDENT KINASE, 

HOR MfrNF/riP hWTU tj a r»TWT> 


1 

eiil 

mm 


HORMONE/GROWTH FACTOR P 1 8- 
INK4C; CELL CYCLE INHIBITOR, 
P181NK4C, TUMOR, SUPPRESSOR, 
CYCLIN- 2 DEPENDENT KINASE, 
HORMONE/GROWTH FACTOR 


Coumpound 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A; 


CYCLIN-DEPENDENT KINASE 

^ TTsIHTRTTriP • r*T-T A TXT. a . 


c 
3 

> 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A; 


SEQFOL 
D score 














PMF 
score 


LOO 


1.00 


1. 00 


1.00 
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1.00 


Verify 
score 
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nUKMUNE/uROWTH FACTOR 
CELL CYCLE INHIBITOR PI 8- 
INK4C(INK6); CELL CYCLE 
INHIBITOR, PI 8-INK4C(INK6), 
ANKYRIN REPEAT, 2 CDK 4/6 
INHIBITOR 


CELL CYCLE INHIBITORP18- 
INK4C(INK6); CELL CYCLE 
INHIBITOR, P 1 8-INK4C(INK6), 
ANKYRIN REP EAT, 2 CDK 4/6 
INHIBITOR 


CELL CYCLE INHIBITOR P 1 8- 
INK4C(INK6); C3ELL CYCLE 
INHIBITOR, P1«-INK4C(INK6), 
ANKYRIN REPEAT, 2 CDK 4/6 
INHIBITOR 


CELL CYCLE INHIBITOR PI 8- 
INK4C(INK6); CELL CYCLE 
INHIBITOR, P18-INK4C(INK6), 
ANKYRIN REPEAT, 2 CDK 4/6 
INHIBITOR 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCIUPTIdN FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 

TRANSCRIPTION FACTOR P65: 


■a 

B 

s 

Q 

6 

B 
Q 

o 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A, B; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A, B; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A, B; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A, B; 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D: 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D: 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D; 
NF-KAPPA-B P65 SUBUNIT; 
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PDB annotation 


§2 

21 o 

H Oh C ft 


BLOOD COAG ULATION, SERINE [ 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION* FACTOR n; 
FETOMODULIN, TM, CD 141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, ISGF-LIKE DOMAINS, 
ANTICOAGULANT COMPLEX, 2 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD 141 
ANTIGEN; EGR -CMK SERINE 
PROTEINASE, EGF-LIKE DOMAINS, 
ANTICOAGULANT COMPLEX, 2 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD 141 
ANTIGEN; EGR -CMK SERINE 
PROTEINASE, EGF-LIKE DOMAINS, 
ANTICOAGULANT COMPLEX, 7 


Coumpound 


TNF-RELATED APOPTOSIS 
INDUCING LIGAND; CHAIN: B; 
DEATH RECEPTOR 5; CHAIN: 
A; 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK") WITH CHAIN: C: 


THROMBIN LIGHT CHAIN; 
CHAIN: A, B, C, D; THROMBIN 
HEAVY CHAIN; CHAIN: M, N, 
O, P; THROMBOMODULIN; 
CHAIN: 1, J, K, L; THROMBIN 
INHIBITOR L- GLU-L-GL Y-L- 
ARM; CHAIN: E, F, G, H; 


THROMBIN LIGHT CHAIN; 
CHAIN: A, B, C, D; THROMBIN 
HEAVY CHAIN; CHAIN: M, N, 
O, P; THROMBOMODULIN; 
CHAIN: I, J, K, L; THROMBIN 
INHIBITOR L-GLU-L-GLY-L- 
ARM; CHAIN: E, F, G,H; 


THROMBIN LIGHT CHAIN; 
CHAIN: A, B, C, D; THROMBIN 
HEAVY CHAIN; CHAIN: M, N, 
O, P; THROMBOMODULIN; 
CHAIN: I, J, K, L; THROMBIN 
INHIBITOR L-GLU-L-GLY-L- 
ARM; CHAIN: E, F, G, H: 
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INHIBITOR, THROMBOSIS 
SERINE PROTEASE INHIBITOR 
FACTOR XA EXHIBITOR; 
ANTISTASIN, CRYSTAL 
STRUCTURE, FACTOR XA 
INHIBITOR, 2 SJERINE PROTEASE 


BLOOD COAGULATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH PArmp T TVC nm/TAixt 


! 

1 

I 




LYASE 2-PHOSPHO-D-GLYCERATE 
HYHP HT A op. t va cn rrt vr*s\T vote 


0 

H 

0 

H 

U 

lags 

J S3 9 ^ 

gsi 

►J Q 


PROTEIN KJNA.SE INHIBITOR PKCI- 
1, PROTEIN KIN ASE C INHIBITOR 1, 
HINT PROTEIN KINASE INHIBITOR, 
PKCM, HIT PROTEIN FAMILY, 2 
MSTIDINE TRIAD PROTEIN 
FAMILY, NUCLEOTIDYL 
HYDROLASE, 3 NUCLEOTIDYL 
TRANSFERASE 

PROTEIN KINASE INHIBITOR PKC.U 


Coumpound 


ANTISTASIN; CHAIN: NULL; 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 


METALLOTHIONEIN 
METALLOTHIONEIN ISOFORM 
II 4MT2 3 


LECTIN (AGGLUTININ) WHEAT 
GERM AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


ENOLASE; CHAIN: A, B; 


ENOLASE; IPDZ 4 CHAIN: 
NULL; IPDZ 5 


PROTEIN KINASE C 
INTERACTING PROTEIN; 
CHAIN: NULL; 

PROTEIN KINASE C 


SEQFOL 
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COMPLEX (TR ANSCRIPTION 
REGULATION/DNA) GABPALPHA; 
GABPBETAl; COMPLEX 
(TRANSCRIPTION 

REGULATION/DNA), DNA-BINDING, 
2 NUCLEAR PROTEIN, ETS DOMAIN, 
ANKYRIN REPEATS, 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 


-r^nx-w. x ivlkj Air 

COMPLEX (KINASE/ANTI- 
ONCOGENE) C DK6; P16INK4A, 
MTSl; CYCLIN DEPENDENT 
KINASE, CYCLIN DEPENDENT 
KINASE INHIBITORY 2 PROTEIN, 
CDK, INK4, CELL CYCLE, MULTIPLE 
TUMOR SUPPRESSOR, 3 MTSl, 
COMPLEX (KJNASE/ANTI- 


COMPLEX (KINASE/ANTI- 
ONCOGENE) CDK6; P16INK4A, 
MTSl; CYCLIN IDEPENDENT 
KINASE, CYCLIN DEPENDENT 
KINASE INHIBITORY 2 PROTEIN, 
CDK, INK4, CELL CYCLE, MULTIPLE 
TUMOR SUPPRI-SSOR, 3 MTSl, 
COMPLEX (KINASE/ANTI- 
ONCOGENE^ HEADER 


COMPLEX (INHIBITOR 
PROTEIN/KINA5E) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA. COMPLEX 


Coumpound 


GA BINDING PROTEIN ALPHA; 
CHAIN: A; GA BINDING 
PROTEIN BETA 1; CHAIN: B; 
DNA; CHAIN: D, E; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; MULTIPLE 
TUMOR SUPPRESSOR; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; MULTIPLE 
TUMOR SUPPRESSOR; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 
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OOl 

1 1 

s 1 

<-> 


COMPLEX (ANTI- 
ONCOGENE/ANKYRIN REPEATS) 
P53BP2; ANKYRIN REPEATS, SH3, 
P53, TUMOR SUPPRESSOR, 
MULTIGENE 2 FAMILY, NUCLEAR 
PROTEIN, PHOSPHORYLATION, 
DISEASE MUTATION, 3 
POLYMORPHISM, COMPLEX (ANTI- 
ONCOGENE/ANKYRIN REPEATS) 


oo 6 C 

r.s c 

sis 

Be g i 

132 Q3 Dd > 

S3 St 




BP; IX-BP; COAGULATION FACTOR 
DC-BINDING, HETERODIMER, 
VENOM, HABU 2 SNAKE, C-TYPE 
LECTIN SUPERFAMILY, COLLAGEN 
BINDING PROTEIN 


! . £ 
: > 

) (/> c 

i p 

? o c 

il « 


LECTIN-LIKE DOMAINS | 


Coumpound 


P53; CHAIN: A; 53BP2; CHAIN: 
B; 


MANNOSE-BINDING PROTEIN- 
A; 1 AFB 4 CHAIN: 1, 2, 3; 1 AFB 
5 . 


COAGULATION FAC IOK. 1A- 
BINDING PROTEIN A; CHAIN: 
A; COAGULATION FACTOR IX- 
BINDING PROTEIN B; CHAIN: 
B; 


COAGULATION FACTUK 1A- 
BINDING PROTEIN A; CHAIN: 
A; COAGULATION FACTOR IX- 
BINDING PROTEIN B; CHAIN: 
B; 


FLAVOCETIN-A: ALPHA 
SUBUNIT; CHAIN: A; 
FLAVOCETIN-A: BETA 
SUBUNIT; CHAIN: B 


FLAVOCETIN-A: ALPHA 
SUBUNIT; CHAIN: A; 
FLAVOCETIN-A: BETA 
SUBUNIT; CHAIN: B 
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PDB annotation 


4, C-TYPE LECTIN, 2 
CARBOHYDRATE RECOGNITION 
DOMAIN 


ANTIFREEZE PROTEIN 
RECOMBINANT SEA RAVEN 
PROTEIN, SOLUTION BACKBONE 
FOLD, C- 2 TYPE LECTIN, 
ANTIFREEZE PROTEIN 


2 O n a 

glial 

lull 

Q K fr w 6 
OQQfflQ 


i 

) 

i 

« 

) 
) 
) 




COMPLEX (PROTEASE 
INHIBITOR/PEP TIDE) PAI-1; SERINE 
PROTEASE INHIBITOR, PAI-1, 
CARBOHYDRATE, INHIBITOR 2 


SERPIN ACT; SBRPIN, SERINE 
PROTEASE INHIBITOR. 


Coumpound 




SEA RAVEN TYPE II 
ANTIFREEZE PROTEIN; CHAIN: 
A; 


DIAMINOPIMELIC ACID 
DEHYDROGENASE; CHAIN: A, 
B; 


OXIDOREDUCTASE (/NAD$(A)- 

ALDEHYDE(D)) D- 

GLYCERALDEHYDE-3- 

PHOSPHATE 

DEHYDROGENASE 

(E.C.1.2.1.12)3GPD4 


OXIDOREDUCTASE (/NAD$(A> 
ALDEHYDE(D)) D- 
GLYCERALDEHYDE-3- 
PHOSPHATE 
DEHYDROGENASE 
(E.C.I.2.L12) 3GPD4 


PLASMINOGEN ACTIVATOR 
INHIBITOR TYPE 1; CHAIN: A; 
PENTAPEPTIDE; CHAIN: B, C 


ANTICHYMOTRYPSIN; CHAIN: 
A, B; 


SEQFOL 
D score 
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PDB annotation 


HISTOCOMPATIBILITY COMPLEX 


IMMUNE SYSTEM 
IMMUNOGLO BULIN FOLD 


B ^ 5 
S-ggg 

igau. 


IMMUNE SYSTEM MHC, HLA, 
CLASS I, KIR, NK CELL RECEPTOR, 
IMMUNOGLOBULIN 2 FOLD, 


KbUhiF TOR/Mr.CC COMPLEX 






Coumpound 


(GGKKKYKL - INDEX 
PEPTIDE): CHAIN: C: 


HLA-A*0201; CHAIN: A, D; 
BETA-2 MICROGLOBULIN; 
CHAIN: B, E; HTLV-l 
OCTAMERIC TAX PEPTIDE; 
CHAIN: C, F: 


HLA-CW3 (HEAVY CHAIN); 
CHAIN: A; BETA-2- 
MICROGLOBULIN; CHAIN: B; 
PEPTIDE FROM IMPORTIN 
ALPHA-2; CHAIN: C; NATURAL 
KILLER CELL RECEPTOR 
KIR2DL2; CHAIN: D. E: i 


HLA-CW3 (HEAVY CHAIN); 
CHAIN: A; BETA-2- 
MICROGLOBULIN; CHAIN: B; 
PEPTIDE FROM IMPORTIN 
ALPHA-2; CHAIN: C; NATURAL 
KILLER CELL RECEPTOR 
KIR2DL2: CHAIN: D. E: 


HISTOCOMPATIBILITY 
ANTIGEN MURINE CLASS I 
MAJOR HISTOCOMPATIBILITY 
COMPLEX CONSISTING IHOC 3 
OF H-2D=B=, B2- 
MICROGLOBULIN, AND A 9- 
RESIDUE PEPTIDE IHOn 4 


HISTOCOMPATIBILITY 
ANTIGEN HUMAN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN IHSA 3 /HLA- 
B(ASTERISK)2705$ IHSA 4 
HISTOCOMPATIBILITY 
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HISTOCOMPATIBILITY COMPLEX 
B8; B2M; PEPTIDE HLA B8, HIV, 
MHC CLASS I, 

HISTOCOMPATIBILITY COMPLEX 


HISTOCOMPATIBILITY COMPLEX 
B8; B2M; PEPTIDE HLA B8, HIV, 
MHC CLASS I, 

HISTOCOMPATIBILITY COMPLEX 


IMMUNE SYSTEM 
IMMUNOGLOBULIN FOLD 


IMMUNE SYSTEM MHC, HLA, 
CLASS I, KIR, NIC CELL RECEPTOR, 
IMMUNOGLOBULIN 2 FOLD, 
RECEPTOR/MHr. fOMPT.FY 


i 

i 

i 

i 
> 




Coumpound 




B*3501; CHAIN: A, B; PEPTIDE 
VPLRPMTY; CHAIN: C; 


B*0801; CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
HIV-1 GAG PEPTIDE 
(GGKKKYKL - INDEX 
PEPTIDE); CHAIN: C: 


B*0801; CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
, HIV-1 GAG PEPTIDE 
(GGKKKYKL - INDEX 
PEPTIDE): CHAIN: C; 


HLA-A*0201; CHAIN: A, D; 
BETA-2 MICROGLOBULIN; 
CHAIN: B, E; HTLV-1 
OCTAMERIC TAX PEPTIDE; 
CHAIN: C, F; 


HLA-CW3 (HEAVY CHAIN); 
CHAIN: A; BETA-2- 
MICROGLOBULIN; CHAIN: B; 
PEPTIDE FROM IMPORTIN 
ALPHA-2; CHAIN: C; NATURAL 
KILLER CELL RECEPTOR 
KIR2DL2; CHAIN: D. E: 


HISTOCOMPATIBILITY 
ANTIGEN HUMAN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN 1HS A3 /HLA- 


SEQFOL 
D score 
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PDB annotation 






<« 

i 

( 


IMMUNii bio IciVi 
IMMUNOGLOBULIN (IG)-LIKE 
DOMAIN, ALPHA HELIX, BETA 
SHEET, 2 IMMUNE SYSTEM 






< 
c/ 

■S 5 
1 * 

I - 
5 | 
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u 

cv 
< 

P 


HISTOCOMPATIBILITY 
ANTIGEN HUMAN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN IHSA 3 /HLA- 
B(ASTERISK)2705$ 1HSA4 


HISTOCOMPATIBILITY 
ANTIGEN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN AW68.1 
(LEUCOCYTE IHSB 3 
ANTIGEN) IHSB 4 


HISTOCOMPATIBILITY 
ANTIGEN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN AW68.1 
(LEUCOCYTE IHSB 3 
ANTIGEN) IHSB 4 


HISTOCOMPATIBILITY 
LEUKOCYTE ANTIGEN (HLA)- 
CW4 CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
HLA-CW4 SPECIFIC PEPTIDE; 
CHAIN: C; 


HISTOCOMPATIBILITY 
ANTIGEN TRUNCATED 
HUMAN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN HLA-AW68 ITMC 3 
COMPLEXED WITH A 
DECAMERIC PEPTIDE 
(EVAPPEYHRK) ITMC 4 


HISTOCOMPATIBILITY 
ANTIGEN TRUNCATED 


SEQFOL 
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PDB annotation 


1 * 

! 1 

3 ? 
3 

< 

S 1 
0 i 

0 < 

1 


DISPLAY PEPTIDE 


IMMUNOGLOBULIN FOLD 


i 

« PL, 

igss 

3 8 CQ ffi 
* R d 5 

a g 0 « 

o3 *0 P 

I! M S p 

£* 00 pu, 








Coumpound 


(GGKKKYKL - INDEX 
PEPTIDE); CHAIN: C: 


IMMUNOGLOBULIN LAMBDA 
HEAVY CHAIN; CHAIN: A, B; 
ENGINEERED PEPTIDE; CHAIN: 
E.F: 


HLA-A*0201; CHAIN: A, D; 
BETA-2 MICROGLOBULIN; 
CHAIN: B,E;HTLV-1 
OCTAMERIC TAX PEPTIDE; 
CHAIN: C,F; 


HLA-CW3 (HEAVY CHAIN); 
CHAIN: A; BETA-2- 
MICROGLOBULIN; CHAIN: B; 
PEPTIDE FROM IMPORTIN 
ALPHA-2; CHAIN: C; NATURAL 
KILLER CELL RECEPTOR 
KIR2DL2; CHAIN: D, E; 


HISTOCOMPATIBILITY 
ANTIGEN HUMAN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN IHSA 3 /HLA- 
B(ASTERISK)2705$ IHSA 4 


HISTOCOMPATIBILITY 
ANTIGEN HUMAN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN IHSA 3 /HLA- 
B(ASTERISK)2705$ IHSA 4 


HISTOCOMPATIBILITY 
ANtlGEN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN AW68.1 
(LEUCOCYTE IHSB 3 
ANTIGEN) IHSB 4 


SEQFOL 
D score 
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PDB annotation 






MAJOR HISTOCOMPATIBILITY 
COMPLEX MHC NONCLASSICAL 
CHAIN, MHC-E, HLA-E, MHC CLASS 
HLA-E, HLA E, MAJOR 
HISTOCOMPATIBILITY COMPLEX, 
MHC, HLA, 2 BETA 2 
MICROGLOBULIN, PEPTIDE, 
LEADER PEPTIDE, 3 NON- 
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Coumpound 


HISTOCOMPATIBILITY 
ANTIGEN AW68.1 
(LEUCOCYTE IHSB 3 
ANTIGEN) IHSB 4 


HISTOCOMPATIBILITY 
ANTIGEN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN AW68.1 
(LEUCOCYTE IHSB 3 
ANTIGEN) IHSB 4 


HLA CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN HLA-E; CHAIN: A, C; 
BETA-2-MICROGLOBULIN; 
CHAIN: B, D; PEPTIDE 
(VMAPRTVLL); CHAIN: P, Q; 


HLA CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN HLA-E; CHAIN: A, C; 
BETA-2-MICROGLOBULIN; 
CHAIN: B, D; PEPTIDE 
(VMAPRTVLL); CHAIN: P, Q; 


HISTOCOMPATIBILITY 
LEUKOCYTE ANTIGEN (HLA)- 
CW4 CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
HLA-CW4 SPECIFIC PEPTIDE; 
CHAIN: C; 
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D score 
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PDB annotation 


MHC CLASS I, 

HISTOCOMPATIBILITY COMPLEX 


HISTOCOMPATIBILITY COMPLEX 
B8; B2M; PEPTIDE HLA B8, HIV, 
MHC CLASS I, 

HISTOCOMPATIBILITY COMPLEX 


IMMUNE SYSTEM 
IMMUNOGLOBULIN FOLD 


IMMUNE SYSTEM MHC, HLA, 
CLASS I, KIR, NK CELL RECEPTOR, 
IMMUNOGLOBULIN 2 FOLD, 
RECEPTOR/MHC COMPLEX 


IMMUNE SYSTEM MHC, HLA, 
CLASS I, KIR, NK CELL RECEPTOR, 
IMMUNOGLOBULIN 2 FOLD, 
RECEPTOR/MHC COMPLEX 






Coumpound 


HIV-1 GAG PEPTIDE 
(GGKKKYKL - INDEX 
PEPTIDE); CHAIN: C; 


B*0801; CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
HIV-1 GAG PEPTIDE 
(GGKKKYKL - INDEX 
PEPTIDE); CHAIN: C; 


HLA-A*0201; CHAIN: A, D; 
BETA-2 MICROGLOBULIN; 
CHAIN: B,E;HTLV-1 
OCTAMERIC TAX PEPTIDE; 
CHAIN: C, F; 1 


HLA-CW3 (HEAVY CHAIN); 
CHAIN: A; BETA-2- 
MICROGLOBULIN; CHAIN: B; 
PEPTIDE FROM IMPORTIN 
ALPHA-2; CHAIN: C; NATURAL 
KILLER CELL RECEPTOR 
KIR2DL2; CHAIN: D, E: 


HLA-CW3 (HEAVY CHAIN); 
CHAIN: A; BETA-2- 
MICROGLOBULIN; CHAIN: B; 
PEPTIDE FROM IMPORTIN 
ALPHA-2; CHAIN: C; NATURAL 
KILLER CELL RECEPTOR 
KIR2DL2; CHAIN: D, E; 


HISTOCOMPATIBILITY 
ANTIGEN HUMAN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN IHSA 3 /HLA- 
B(ASTERISK)2705$ IHSA 4 


HISTOCOMPATIBILITY 
ANTIGEN HUMAN CLASS I 


SEQFOL 
D score 
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PDB annotation 


IMMUNE SYSTEM 
IMMUNOGLOBULIN FOLD 


IMMUNE SYSTEM MHC, HLA, 
CLASS I, KIR, NK CELL RECEPTOR, 
IMMUNOGLOBULIN 2 FOLD, 
RECEPTOR/MHC COMPLEX 










Coumpound 


HLA-A+0201; CHAIN: A, D; 
BETA-2 MICROGLOBULIN; 
CHAIN: B, E; HTLV-1 
OCTAMERIC TAX PEPTIDE; 
CHAIN: C, F; 


HLA-CW3 (HEAVY CHAIN); 
CHAIN: A; BETA-2- 
MICROGLOBULIN; CHAIN: B; 
PEPTIDE FROM IMPORTIN 
ALPHA-2; CHAIN: C; NATURAL 
KILLER CELL RECEPTOR 
KIR2DL2; CHAIN: D,E; 


HISTOCOMPATIBILITY 
ANTIGEN HUMAN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN IHSA 3 /HLA- 
B(ASTERISK)2705$ IHSA 4 


HISTOCOMPATIBILITY 
ANTIGEN HUMAN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN IHSA 3 /HLA- 
B(ASTERISK)2705$ IHSA 4 


HISTOCOMPATIBILITY 
ANTIGEN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN AW68.1 
(LEUCOCYTE IHSB 3 
ANTIGEN) IHSB 4 


HISTOCOMPATIBILITY 
ANTIGEN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN AV/68.1 
(LEUCOCYTE IHSB 3 
ANTIGEN) IHSB 4 


SEQFOL 
D score 
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PMF 
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PDB annotation 
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-o * < 
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3 C PQ 

a S s a a 
° 5 § w 5 


n 

lip 


HISTOCOMPATIBILITY COMFLbA 
B8; B2M; PEPTIDE HLA B8, HIV, 
MHC CLASS I, 

HISTOCOMPATIBILITY COMPLEX 


HISTOCOMPATIBILITY COMPLEX 
B8; B2M; PEPTIDE HLA B8, HIV, 
MHC CLASS I, 

HISTOCOMPATIBILITY COMPLEX 


IMMUNE SYSTEM 
IMMUNOGLOBULIN FOLD 


IMMUNE SYSTEM MHC, HLA, 
CLASS 1, KIR, NK CELL RECEPTOR, 


Coumpound 


HUMAN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN HLA-AW68 1TMC3 
COMPLEXEDWITHA 
DECAMERIC PEPTIDE 
(EVAPPEYHRK) ITMC 4 




B*3501; CHAIN: A, B; PEPTIDE 
VPLRPMTY; CHAIN: C; 


B*3501; CHAIN: A, B; PEPTIDE 
VPLRPMTY; CHAIN: C; 


B*0801; CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
HIV-1 GAG PEPTIDE 
(GGKKKYKL -INDEX 
PEPTIDE); CHAIN: C; 


B*0801 ; CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
HIV-1 GAG PEPTIDE 
(GGKKKYKL - INDEX 
PEPTIDE); CHAIN: C; 


HLA-A*0201; CHAIN: A, D; 
BETA-2 MICROGLOBULIN; 
CHAIN: B,E;HTLV-1 
OCTAMERIC TAX PEPTIDE; 
CHAIN: C,F; 


HLA-CW3 (HEAVY CHAIN); 
CHAIN: A; BETA-2- 


SEQFOL 
D score 








00 




223.96 






PMF 
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IMMUNOGLOBXJLIN 2 FOLD, 
RECEPTOR/MHC COMPLEX 










IMMUNE SYSTEM 
IMMUNOGLOBULIN (IG)-LIKE 
DOMAIN, ALPHA HELIX, BETA 
SHEET, 2 IMMUNE SYSTEM 


Coumpound 


MICROGLOBULIN; CHAIN: B; 
PEPTIDE FROM IMPORTIN 
ALPHA-2; CHAIN: C; NATURAL 
KILLER CELL RECEPTOR 
KIR2DL2; CHAIN: D, E; 


HISTOCOMPATIBILITY 
ANTIGEN HUMAN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN IHSA 3 /HLA- 
B(ASTERISK)2705$ IHSA 4 


HISTOCOMPATIBILITY 
ANTIGEN HUMAN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN IHSA 3 /HLA- 
B(ASTERISK)2705$ IHSA 4 


HISTOCOMPATIBILITY 
ANTIGEN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN AW68.1 
(LEUCOCYTE IHSB 3 
ANTIGEN) IHSB 4 


HISTOCOMPATIBILITY 
ANTIGEN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN AW68.1 
(LEUCOCYTE IHSB 3 
ANTIGEN) IHSB 4 


HISTOCOMPATIBILITY 
LEUKOCYTE ANTIGEN (HLA)- 
CW4 CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
HLA-CW4 SPECIFIC PEPTIDE; 
CHAIN: C; 

HISTOCOMPATIBILITY 


SEQFOL 
D score 
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PDB annotation 

CHAIN DIMER HEADER 


COMPLEX (ANTIBODY/ANTIGEN) 
FAB-12; VEGF; COMPLEX 
(ANTIBODY/ANTIGEN), 
ANGIOGENIC FACTOR 


IMMUNOGLOBULIN, IMMUNE 
SYSTEM 


IMMUiNb bio iJbivi 
IMMUNOGLOBULIN 
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IMMUKfc bibltM AINlitJVjiJ i 
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Coumpound 


FAB FRAGMENT; CHAIN: L, H, 
J,K; VASCULAR 
ENDOTHELIAL GROWTH 
FACTOR; CHAIN: V, W; 


MONOCLONAL ANTIBODY 
MRK-16 (LIGHT CHAIN); 
CHAIN: A, C; MONOCLONAL 
ANTIBODY MRK-16 (HEAVY 
CHAIN); CHAIN: B, D; 


IMMUNOGLOBULIN OPG2 FAB, 
CONSTANT DOMAIN; CHAIN: 
L; IMMUNOGLOBULIN OPG2 
FAB, VARIABLE DOMAIN; 
CHAIN: H; 


COMPLEX 

(ANTIBODY/ANTIGEN) HYHEL- 
5 FAB COMPLEXED WITH 
BOB WHITE QUAIL LYSOZYME 
1BOL3 1BOL 95 


ANTIBODY R24 (LIGHT 
CHAIN); CHAIN: A; ANTIBODY 
R24 (HEAVY CHAIN); CHAIN: 
B: 


CAMPATH- 1H:LIGHT CHAIN; 
CHAIN: L; CAMPATH- 
1H:HEAVY CHAIN; CHAIN: H; 
PEPTIDE ANTIGEN? CHAIN: P; 


FAB ANTIBODY LIGHT CHAIN; 
CHAIN: L; FAB ANTIBODY 
HEAVY CHAIN; CHAIN: H; 


COMPLEX 

(ANTIBODY/ANTIGEN) FAB 
FRAGMENT OF THE 
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PDB annotation 




IMMUNOGLOBULIN 






CATALYTIC ANTIBODY 
CATALYTIC ANTIBODY 6D9 
CATALYTIC ANTIBODY, ESTER 
HYDROLYSIS, ESTEROLYTIC, FAB, 
2 IMMUNOOT .Oil I TT TM 


COMPLEX (IMMUNOGLOBULIN 
IGG1/IGG2A) 






Coumpound 


MONOCLONAL ANTIBODY 
F9.13.7(IGG1) 1FBI3 
COMPLEXED WITH 
LYS02YME (E.C.3.2.1.17) 1FBI 4 


4-4-20 (IG*G2A-KAPPA=) FAB 
FRAGMENT; IFLR 5 CHAIN: L, 
H; IFLR 6 


IMMUNOGLOBULIN FAB 
FRAGMENT OF HUMANIZED 
ANTIBODY 4D5, VERSION 4 
1FVD3 


IMMUNOGLOBULIN FAB 
FRAGMENT OF HUMANIZED 
ANTIBODY 4D5, VERSION 4 
1FVD3 


1 IMMUNOGLOBULIN 6D9; 
1 CHAIN: L, H; 


IDIOTYPIC FAB 730.1.4 GGGl) 
OF VIRUS IIAI 5 CHAIN: L, H; 
IIAI 7 ANTI-IDIOT YPIC FAB 
409.5.3 (IGG2A); IIAI 9 CHAIN: 
M, I IIAI 10 


COMPLEX 

(ANTIBODY/BINDING 
PROTEIN) IGGl FAB 
FRAGMENT COMPLEXED 
WITH PROTEIN G (DOMAIN III) 
IIGC 5 PROTEIN G, 
STREPTOCOCCUS IIGC 15 


IMMUNOGLOBULIN IGGl FAB' 
FRAGMENT (B 1 312) 1 IGF 3 


SEQFOL 
D score 




56.39 




57.20 


| 57.33 


60.65 


57.26 


58.26 


PMF 
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-0.12 
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PDB annotation 


IMMUNOGLOBULIN INTACT 
IMMUNOGLOBULIN, V REGION, C 
REGION, HINGE REGION 


CATALYTIC ANTIBODY SULf iUb 
OXIDATION, MONOOXYGENASE, 
OXYGENATION, FAB, 2 
IMMUNOGLOBULIN, CATALYTIC 
ANTIBODY 




MUSCLE PROTEIN CONNECT1N, 
NEXTM5; CELL ADHESION, 
GLYCOPROTEIN, 
TRANSMEMBRANE, REPEAT, 
BRAIN, 2 IMMUNOGLOBULIN FOLD, 
ALTERNATIVE SPLICING, SIGNAL, 3 
MUSCLE PROTEIN 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN, 


COMPLEX (ANTIBODY/PEP I IDE 
EPITOPE) ANTIBODY, PEPTIDE 
ANTIGEN, ANTITUMOR ANTIBODY, 
2 COMPLEX (ANTIBODY/PEPTIDE 
EPITOPE) 


COMPLEX (ANTIBODY/PEP 1 IDJi 
EPITOPE) ANTIBODY, PEPTIDE 
ANTIGEN, ANTITUMOR ANTIBODY, 
2 COMPLEX (ANTIBODY/PEPTIDE 
EPITOPE) 


COMPLEX (ANTIBODY/PEP TIDb 
EPITOPE) ANTIBODY, PEPTIDE 
ANTIGEN, ANTITUMOR ANTIBODY, 


Coumpound 


IGGl INTACT ANTIBODY 
MAB6U.3; CHAIN: A, B, C, D 


28B4 FAB; CHAIN: L, H; 


IMMUNOGLOBULIN FAB D44.1 
(IGGl, KAPPA) (BALB/C 
MOUSE, MONOCLONAL 
ANTIBODY) IMLB 5 


TITIN; CHAIN: NULL; 


N1G9 (IGGl -LAMBDA-); 
CHAIN: L, H; 


SM3 ANTIBODY; CHAIN: L, H; 
PEPTIDE EPITOPE; CHAIN: P; 


SM3 ANTIBODY; CHAIN: L, H; 
PEPTIDE EPITOPE; CHAIN: P; 


SM3 ANTIBODY; CHAIN: L, H; 
PEPTIDE EPITOPE; CHAIN: P; 


SEQFOL 
D score 




57.31 


56.48 






58.83 




56.66 


PMF 
score 


0.37 
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0.22 




0.41 




Verify 
score 
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2 COMPLEX (A.NTIBODY/PEPTIDE 
EPTTOPF/i 


j 

• 




COMPLEX (ANTIBODY/ELECTRON 
TRANSPORT) FAB E8; CYT C, 
ANTIGEN; IMMUNOGLOBULIN, 
! IGGl KAPPA, FAB FRAGMENT, 
HORSE 2 CYTOCHROME C, 
COMPLEX (ANTIBODY/ELECTRON 
TRANSPORT* 


TRANSFERASE TRK RECEPTOR, 
RECEPTOR TYROSINE KINASE, 3D- 
DOMAIN SWAPPING, 2 
TRANSFERASE 


CATALYTIC ANTIBODY 
CATALYTIC ANTIBODY, FAB, RING 
CI .OST IRE R E A f ^TTrwj 


CATALYTIC ANTIBODY 
CATALYTIC ANTIBODY, FAB, RING 


! 
i 


Coutnpound 




IMMUNOGLOBULIN IGGl 
MONOCLONAL FAB 
FRAGMENT (TE33) COMPLEX 
WITH CHOLERA ITET 3 TOXIN 
PEPTIDE 3 (CTP3) ITET 4 


MUSCLE PROTEIN TITIN 
MODULE M5 (CONNECTIN) 
1 TNM 3 (NMR, MINIMIZED 
AVERAGE STRUCTURE) 1TNM 
4 iTNM 58 i 


E8 ANTIBODY; CHAIN: L, H; 
CYTOCHROME C; CHAIN: F; 


NT-3 GROWTH FACTOR 
RECEPTOR TRKC; CHAIN: A; 


IGG 5C8; CHAIN: L, H; 


IGG 5C8; CHAIN: L, H; 


IMMUNOGLOBULIN IGG2B 
(KAPPA) FAB FRAGMENT 
COMPLEXED WITH ANTIGEN 
2CGR 3 N-(P-CYANOPHENYL)- 
N'-CDIPHENYLEMETHYLl 


SEQFOL 
D score 




57.32 








57.06 




57.12 


PMF 
score 






0.05 
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0.01 
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Verify 
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FBB annotation 




t 

c 

jj 
c 

1 

1 
f 
( 

1 


* 
1 

> 

a 

— \ 

- .w 

^ MM 

xj ^ W 

CN p 
§ PQ »J 
O < O 




TRANSFERASE TRANSFERASE, 
CREATINE KINASE 


TRAN SFERAob tftf-UJ^, tfK/uiN-i iru 
CREATINE KINASE; BRAIN-TYPE 
CREATINE KINASE, CANCER, 
CELLULAR ENERGY 2 
METABOLISM, GUANIDINO 
KINASE, NEURODEGENERATIVE 
DISORDERS 


TRANSFERASE ^KEAl 1JNE JSaiNAdcJ 

UMTCK, MIA-CK; MITOCHONDRIAL 
CREATINE KINASE, CANCER, 
CELLULAR ENERGY 2 
METABOLISM, GUANIDINO 
KINASE, MITOCHONDRIAL 
PERMEABILITY 3 TRANSITION, 
OCTAMER STABILITY 
TRANSFERASE CREATINE KINASE, 


Coumpound 


GUANIDINE ACETIC ACID 
2CGR4 


IMMUNOGLOBULIN FAB 
FRAGMENT OF A HUMANIZED 
VERSION OF THE ANTI-CD 18 
2FGW 3 ANTIBODY 'H52' 
(HUH52-OZ FAB) 2FGW 4 


HIV-1 REVERSE 
TRANSCRIPTASE; CHAIN: A, B; 
MONOCLONAL ANTIBODY 28; 
CHAIN: C, D; DNA; CHAIN: E, F; 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN FAB* NEW 
(LAMBDA LIGHT CHAIN) 7FAB 
3 


CREATINE KINASE; CHAIN: A, 


CREATINE KINASE, B CHAIN; 
CHAIN: A, B, C, D; 


CREATINE KINASE, 
UBIQUITOUS 

MITOCHONDRIAL; CHAIN: A, 
B, C, D, E, F, G,H; 

CREATINE KINASE; CHAIN: A; 


SEQFOL 
D score 






56.78 


57.01 








PMF 
score 




-0.08 
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PDB annotation 


D <C R H 
2 ft 2 w 

-o?s 


COMPLEX 

(METALLOPROTEINASE/RECEPTOR 
) CDMTl-MMP; TIMP-2 MATRIX 
METALLOPROTEINASE, TISSUE 
INHIBITOR OF 2 
METALLOPROTEINASES, 
PROTEINASE COMPLEX, PRO- 
GELATIN ASE A 3 ACTIVATOR, 
CRYSTAL STRUCTURE, COMPLEX 4 
(METALLOPROTEINASE/RECEPTOR 

) 




METALLOPROTEIN ASii MA 1 KlA 
METALLOPROTEINASE-3 ; MATRIX 
METALLOPROTEINASE, MMP-3, 
NON-PEPTIDE INHIBITOR 


HYDROLASE MMF-2,7ZKJJ 1 YfJa iv 
COLLAGENASE; HYDROLASE 
(METALLOPROTEASE), FULL- 
LENGTH, METALLOPROTEINASE, 2 
GELATINASE A 


HYDROLASE COL-2; bblA SttbJii, 
ALPHA HELDC, HYDROLASE 

METALLOPROTEASE 


Coumpound 


RHOA; CHAIN: B; 


MEMBRANE-TYPE MATRIX 
METALLOPROTEINASE; 
CHAIN: M; 

METALLOPROTEINASE 
INHIBITOR; CHAIN: T; 


HYDROLASE 1 
(METALLOPROTEASE) 
COLLAGENASE (E.C.3.4.24.7) 
(CATALYTIC DOMAIN) 
CRYSTAL FORM 1 1CGE 3 


STROMELYSIN- 1 ; CHAIN: A; 


GELATINASE A; CHAIN: A; 


HUMAN MATRIX 
METALLOPROTEINASE 2; 
CHAIN: A; 

FIBROBLAST (INTERSTITIAL) 


SEQFOL 
D score 




103.34 


120.96 


132.96 


647.13 


90.80 
151.00 


PMF 
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Verify 
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PDB annotation 


2 NUCLEAR PROTEIN, S JJUM/uin, 
ANKYRIN REPEATS, 
TRANSCRIPTION 3 FACTOR 


COMPLEX ^TRANoUKJJr 1 1U1N 
REGULATION/DNA) GABPALPHA; 
GABPBETAl; COMPLEX 
(TRANSCRIPTION 

REGULATION/DNA), DNA-BINDING, 
2 NUCLEAR PROTEIN, ETS DOMAIN, 
ANKYRIN REPEATS, 
TRANSCRIPTION 3 FACTOR 


TUMOR SUPPKbSSUK i UMUK 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


COMPLEX (ivlNASb/AJN 11- 
ONCOGENE) CDK6; P16INK4A, 
MTSl; CYCLIN DEPENDENT 
KINASE, CYCLIN DEPENDENT 
KINASE INHIBITORY 2 PROTEIN, 
CDK, INK4, CELL CYCLE, MULTIPLE 
TUMOR SUPPRESSOR, 3 MTSl, 
COMPLEX (KINASE/ANTI- 
ONCOGENE) HEADER 


COMPLEX (1NH1B1 1 UK 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
INHIBITOR PROTEIN/KINASE) 


COMPLEX (INHibTl UK 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 


Coumpound 




GA BINDING PROTEIN ALPHA; 
CHAIN: A; GA BINDING 
PROTEIN BETA 1; CHAIN: B; 
DNA; CHAIN: D~ E; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; MULTIPLE 
TUMOR SUPPRESSOR; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


SEQFOL 
D score 














PMF 
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0.29 


0.19 


0.04 


0.00 
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Verify 
score 
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PDB annotation 

DEHYDROGENASE 


OXIDOREDUCTASE 
OXIDOREDUCTASE, TROPANE 
ALKALOID BIOSYNTHESIS, 
REDUCTION OF 2 TROPINONE TO 
TROPINE, SHORT-CHAIN 
DEHYDROGENASE 


OXIDOREDUCTASii NAU- 
DEPENDENT OXIDOREDUCTASE, 
SHORT-CHAIN ALCOHOL 2 
DEHYDROGENASE, PCB 
DEGRADATION 
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OXIDOREDUUlAbli ofcri/vrintuiN 
REDUCTASE, 

TETRAHYDROBIOPTERIN, 
OXIDOREDUCTASE 


w 

J co 

i 

SB 

own 

ill 


Coumpound 


TROPINONE REDUCTASE-I; 
CHAIN: A, B; 


CIS-BIPHENYL-2,3- 
DIHYDRODIOL-2,3- 
DEHYDROGENASE; CHAIN: 
NULL; 


CARBONYL REDUCTASE; 
CHAIN: A, B, C, D; 


1 7-BETA-H YDROXYSTEROID- 
DEHYDROGENASE; CHAIN: 
NULL; 


7 ALPHA-HYDROXYSTEROID 
DEHYDROGENASE; CHAIN: A, 
B; 


OXIDOREDUCTASE 3-ALPHA, 
20-BETA-HYDROXYSTEROID 
DEHYDROGENASE 
(E.C.1. 1.1.53) 1HDC3 
COMPLEXED WITH 
CARBENOXOLONE IHDC 4 


SEPIAPTERIN REDUCTASE; 
CHAIN: NULL; 


TRIHYDROXYNAPHTHALENE 
REDUCTASE; CHAIN: A, B; 
TROPINONE REDUCTASE-II; 


SEQFOL 
D score 
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PDH annotation 


OXIDOREDUCrASE, TROPANE 
ALKALOID BIOSYNTHESIS, 
REDUCTION OF 2 TROPINONE TO 
PSEUDOTROPINE, SHORT-CHAIN 
DEHYDROGENASE 




ISOMERASE FKBP25 C-TERMINAL 
DOMAIN; FKBI'12 HOMOLOGOUS 
DOMAIN OF HKKBP2S TSOMERARE 


) 

I 

I 


COMPLEX (GTPASE- 
ACTIVATING/GTP-BINDING) 
COMPLEX (GTPASE- 
ACTIVATING/GTP-BINDING), 
GTPASE ACTIVATION 


TRANSPORT PROTEIN TC4; GTPASE, 
NUCLEAR TRANSPORT, 
TRANSPORT PROTEIN 


TRANSPORT PROTEIN TC4; GTPASE, 
NUCLEAR TRANSPORT, 
TRANSPORT PROTEIN 


SIGNALING PROTEIN GTP-BINDING 
PROTEINS, PROTEIN-PROTEIN 
COMPLEX, EFFECTORS 


SIGNALING PROTEIN G PROTEIN, 
GTP HYDROLYSIS, KINETIC 
CRYSTALLOGRAPHY, 2 SIGNALING 
PROTEIN 


SIGNALING PROTEIN PROTEIN- 
PROTEIN COMPLEX, 
ANTIPARALLEL COILED-COIL 
SMALL GTPASE KARYOPHERIN 


Coumpound 


CHAIN: A, B; 




FKBP25; CHAIN: NULL; 




P50-RHOGAP; CHAIN: A, B, C; 
CDC42HS; CHAIN: D, E, F; 


1 GTP-BINDING PROTEIN RAN; 
CHAIN: A, B; 


GTP-BINDING PROTEIN RAN; 
CHAIN: A, B; 


RAS-RELATED PROTEIN RAP- 
1A; CHAIN: A; PROTO- 
ONKOGENE 


SliKINK/THREONINE PROTEIN 
KINASE CHAIN: B; 


TRANSFORMING PROTEIN 
P21/H-RAS-1; CHAIN: A; 


HIS-TAGGED TRANSFORMING 
PROTEIN RHOA(0-181); CHAIN: 
A; PKN; CHAIN: B; 
RAN; CHAIN: A, C; IMPORTIN 


9 fe 
























SEQI 

D sci 






194.11 




50.20 


61.74 


64.26 


79.57 




93.92 


52.92 
60.48 


PMF 
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Verify 
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8 

H 

Ph 

§§ 
i S 1 

E8 

$ 2 

PQ g 


4 2 c 

i, p C 

183 
°? g s 

© CQ E 


b o £ 


COMPLEX (SMALL 

GTP ASE/NUCLEAR PROTEIN) 

COMPLEX (SMALL 

GTP ASE/NUCLEAR PROTEIN), 

SMALL GTPASE, 2 NUCLEAR 

TRANSPORT 


COMPLEX (u lr- 

BINDING/EFFECTOR) RAS -RELATED 
PROTEIN RAB3A; COMPLEX (GTP- 
BINDING/EFFECTOR), G PROTEIN, 
EFFECTOR, RABCDR, 2 SYNAPTIC 
EXOCYTOSIS, RAB PROTEIN, 
RAB 3 A, RABPHILIN 


3* CO 

5 4^ " 22 < 

2 CM W | - 

^ Co* 

Q O (JL, 

XOor>< 


HYDROLASE U fKUlbllN, 
VESICULAR TRAFFICKING, GTP 
HYDROLYSIS, RAB 2 PROTEIN, 
NEUROTRANSMITTER RELEASE, 
HYDROLASE 


TRANSFERASE TRANSFERASE, 
SERINE/THREONINE-PROTEIN 
KINASE, CASEIN KINASE, 2 


Coumpound 


BETA SUBUNIT; CHAIN: B, D; 


RAP2A; CHAIN: NULL; 


RACl; CHAIN: NULL; 


RAN; CHAIN: A, C; NUCLEAR 
PORE COMPLEX PROTEIN 
NUP358; CHAIN: B, D; 


RAB-3A; CHAIN: A; 
RABPHILIN-3A; CHAIN: B; 


GTP BINDING PROTEIN (G25K); 
CHAIN: A; GTPASE 
ACTIVATING PROTEIN (RHG); 
CHAIN: B; 


RAB3A; CHAIN: A; 


PROTEIN KINASE CK2/ALPHA- 
SUBUNIT; CHAIN: NULL; 


SEQFOL 
D score 




91.54 


62.77 
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SER/THR KINASE 

PROTEIN KINASE CDK2; PROTEIN 

KINASE, CELL CYCLE, 

PHOSPHORYLATION, 

STAUROSPORltNE, 2 CELL DIVISION, 


COMPLEX (KINASE/INHIBITOR) 
CDK6; P 1 9INK4D; CYCLIN 
DEPENDENT KINASE, CYCLIN 
DEPENDENT KINASE INHIBITORY 2 
PROTEIN, CDK, INK4, CELL CYCLE, 
COMPLEX (KINASE/INHIBITOR) 
HEADKR HPT TV 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 


Q 

0 

* 

3 

> 

* 
) 




PROTEIN KINASE CDK2; i 
TRANSFERASE, 
SERINE/THREONINE PROTEIN 
KINASE, ATP-BlfNDING, 2 CELL 
CYCLE, CELL DIVISION, MITOSIS, 
PHOSPHORYLATION 


Coumpound 


CYCLIN-DEPENDENT PROTEIN 
KINASE 2; CHAIN: NULL; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A, C; CYCLIN- 
DEPENDENT KINASE 
INHIBITOR; CHAIN: B, D; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


PHOSPHOTRANSFERASE 
CAMP-DEPENDENT PROTEIN 
KINASE CATALYTIC SUBUNIT 
ICMK 3 (£.0.2.7.1.37-) ICMK 4 


TRANSFERASE(PHOSPHOTRA 
NSFERASE) CAMP- 
DEPENDENT PROTEIN KINASE 
(E.C.2.7.1.37) (CAPK) ICTP 3 
(CATALYTIC SUBUNIT! ICTP 4 


HUMAN CYCLIN-DEPENDENT 
KINASE 2; CHAIN: NULL; 


SEQFOL 
D score 


129.21 


122.05 


133.50 
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102.19 


141.09 


PMF 
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si § 

3 £ 9 

legs ; 

^3 w g pg w 
1 < c 5 < 

13 S 0] BP § 

55 ^ 53 ffi W 


W 

CO 

< <N 

si 

U W O c 
/> co pJ 

iSl i 

S « Sq § 

H H w * 


- < ( 

< vU < 

IS 

Si 


TRAN SF EKASii MUUUfcIN 
ACTIVATED PROTEIN KINASE- 
TRANSFERASE, MAP KINASE, 
SERINE/THREONINE-PROTEIN 
KINASE, 2 P38 


1S« 

2 E 3 

ii 

R CO 

sal 
ill 

- CO ^ 


TRANSFERASE MlTOCiiiN 
ACTIVATED PROTEIN KINASE, MAP 
2, ERK2; TRANSFERASE, 
SERINE/THREONINE-PROTEIN 
KINASE, MAP KINASE, 2 ERK2 


TRANSFERASE TRANSFERASE, 


SERINE/THREONINE-rKU 1 blM 
KINASE, CASEIN KINASE, 2 
SER/THR KINASE 


PROTEIN KINASjS fKUimiN 
KINASE, CELL CYCLE, 
, PHOSPHORYLATION, 
STAUROSPORINE, 2 CELL DIVISION, 
MITOSIS, INHIBITION 


Coumpound 


P38 MAP KINASE; CHAIN: 
NULL; 


C-JUN N-TERMINAL KINASE; 
CHAIN: NULL; 


TWITCHIN; CHAIN: A, B; 


MAP KINASE P38; CHAIN: . 
NULL; 


ERK2; CHAIN: NULL; 


EXTRACELLULAR 
REGULATED KINASE 2; 
CHAIN: NULL; 


PROTEIN KINASE CK2/ALPHA- 
SUBUNIT; CHAIN: NULL; 


C YCLIN-DEPENDENT PROTEIN 
KINASE 2; CHAIN: NULL; 


SEQFOL 
D score 


117.40 


136.35 


105.97 


150.88 


128.64 


140.27 


108.11 


129.37 


PMF 
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TRANSFERASE, JNK3 MAP KINASE, 
SERINE/THREONINE PROTEIN 2 
KINASE 


KINASE KINASE, TWITCHIN, 
INTRASTERIC REGULATION 


TRANSFERASE MITOGEN 
ACTIVATED PROTEIN KINASE; 
TRANSFERASE, MAP KINASE, 


SERINE/THREONINE-PROTEIN 
KINASE, 2 P38 


TRANSFERASE MAP KINASE, 
SERINE/THREONINE PROTEIN 
KINASE, TRANSFERASE 


SERINE KINASE SERINE KINASE, 
TITIN, MUSCLE, AUTOINHIBITION 


TRANSFERASE MITOGEN 
ACTIVATED PROTEIN KINASE, MAP 
2, ERK2; TRANSFERASE, 


SERINE/THREONINE-PROTEIN 
KINASE, MAP KINASE, 2 ERK2 




HYDROLASE PROSEGMENT, 
PROPEPTIDE, INHIBITION, 
HYDROLASE 




HYDROLASE PROSEGMENT, 
PROPEPTIDE, INHIBITION, 
HYDROLASE 




CELL ADHESION PROTEIN 
EPITHELIAL CADHERIN DOMAINS 1 
AND 2, ECAD12; CADHERIN, CELL 
ADHESION PROTEIN, CALCIUM 
BINDING PROTEIN 


Coumpound 


CHAIN: NULL; 


TWITCHIN; CHAIN: A, B; 


MAP KINASE P38; CHAIN: 
NULL; 


< 


! 

i 


TITIN; CHAIN: A, B; 


EXTRACELLULAR 
REGULATED KINASE 2; 
CHAIN: NULL; 






HUMAN PROCATHEPSIN L; 
CHAIN: A; 




HUMAN PROCATHEPSIN L; 
CHAIN: A; 




E-CADHERIN; CHAIN: A, B; 


SEQFOL 
D score 




106.60 


151.09 


128.85 


99.53 


140.42 




368.72 




412.39 




124.30 


PMF 
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Verify 
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CELL ADHESION PROTEIN CELL 
ADHESION PROTRTN 










GUANINE NUCLEOTIDE EXCHANGE 
FACTOR RCCl; GUANINE 
NUCLEOTIDE EXCHANGE FACTOR, 
GEE, RAN, 2 RAS-LIKE NUCLEAR 
GTP BINDING PROTEIN HEADER 
TER 


EYE LENS PROTEIN EYE LENS 
PROTEIN 


EYE LENS PROTEIN EYE LENS 
PROTEIN 


CRYSTALLIN GAMMA II- 
CRYSTALLIN E YE LENS PROTEIN, 
CRYSTALLIN 


EYE-LENS PROTEIN EYE-LENS 
PROTEIN, BETA.-CRYS TALLIN B. 


Coumpound 


N-CADHERIN; CHAIN: A; 




LIPID-BINDING PROTEIN 
ADIPOCYTE LIPID-BINDING 
PROTEIN COMPLEXED WITH 
ARACHIDONIC 1ADL 3 ACID 
1ADL4 


LIPID-BINDING PROTEIN 
FATTY ACID BINDING 
PROTEIN (HUMAN MUSCLE, 
M-FABP) COMPLEXED IHMR 3 
WITH ONE MOLECULE OF 
ELAIDIC ACID IHMR 4 i 


CELLULAR LIPOPHILIC 
TRANSPORT PROTEIN P2 
MYELIN PROTEIN (P2) IPMP 3 


REGULATOR OF 
CHROMOSOME 
CONDENSATION 1; CHAIN: A, 
B, C; 


GAMMAF CRYSTALLIN; 
CHAIN: NULL 


GAMMAE CRYSTALLIN; 
CHAIN: A, B 


GAMMA B-CRYSTALLIN; 
CHAIN: NULL; 


CIRCULARLY PERMUTED BB2- 
CRYSTALLIN; CHAIN: A, B; 


SEQFOL 
D score 


129.50 




158.94 


146.24 


161.49 


141.30 


122.87 


121.88 


125.68 


80.96 


PMF 
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Verify 
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OXIDOREDUCTASE FAIT Y ACID 
HYDROXYLASE; FATTY ACID 
MONOOXYGENASE, 
HEMOPROTEIN, P450 REMARK 


OXIDOREDUCTASE (OXYGENASE; 




MOTOR PROTEIN MOTOR PROTEIN, 
ATP ASE, MICROTUBULE 
ASSOCIATED 


MOTOR PROTEIN MOTOR PROTEIN, 
ATP ASE, MICROTUBULE - 
ASSOCIATED 


CONTRACTILE PROTEIN NCD, 
CLARET SEGREGATIONAL 
PROTEIN; NCD CRYSTAL 
STRUCTURE, MICROTUBULE 
MOTORS, KINESIN 2 SUPERFAMILY, 
CONTRACTILE PROTEIN 


CONTRACTILE PROTEIN NCD, 
CLARET SEGREGATIONAL 
PROTEIN; NCD CRYSTAL 
STRUCTURE, MICROTUBULE 
MOTORS, KINESIN 2 SUPERFAMILY, 
CONTRACTILE PROTEIN 


Coumpound 


CALCIUM/PHOSPHOLIPID 
BINDING PROTEIN 
SYNAPTOTAGMIN I (FIRST C2 
DOMAIN) (CAL&) IRSY 3 


CALCIUM/PHOSPHOLIPID 
BINDING PROTEIN 
SYNAPTOTAGMIN I (FIRST C2 
DOMAIN) (CALB) IRSY 3 




CYTOCHROME P450; CHAIN: A, 
B; 


CYTOCHROME P450 ERYF; 
lOXA 5 CHAIN: NULL lOXA 6 




KINESIN; CHAIN: NULL; 


KINESIN; CHAIN: NULL; 


MICROTUBULE MOTOR 
PROTEIN NCD; CHAIN: A, B, C, 
D; 


MICROTUBULE MOTOR 
PROTEIN NCD; CHAIN: A, B, C, 
D; 


SEQFOL 
D score 




122.64 




197.06 


67.59 




210.20 








PMF 
score 
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CONTRACTILE. PROTEIN NCD, 
CLARET SEGREGATIONAL ! 
PROTEIN; NCD CRYSTAL 
STRUCTURE, MICROTUBULE 
MOTORS, KINESIN 2 SUPERFAMILY, 


MOTOR PROTEIN MOTOR PROTEIN, 

r.YTOSK'P.T PTYYM 


MOTOR PROTEIN MOTOR PROTEIN, 


MOTOR PROTEIN MOTOR PROTEIN, 


MOTOR PROTE IN MOTOR PROTEIN, 

CYTOSK'PT PTHM 


MOTOR PROTE IN MOTOR PROTEIN, 


CONTRACTILE PROTEIN KINESIN, 
MICROTUBULE-BASED MOTOR, 


5"^ 

) g O L 


5 2 .A| 

3 SMI 

! w w < E P 
- u u 2 ^ u 


CONTRACTILE PROTEIN 
CONTRACTILE I'ROTEIN, KAR3, 
KINESIN-RELATED PROTEIN, 
MOTOR 2 PROTEIN, ATPASE, P- 
LOOP, MICROTtiBULE BINDING | 
PROTEIN 1 


Coumpound 


MICROTUBULE MOTOR 
PROTEIN NCD; CHAIN; A, B, C, 
D; 


KINESIN; CHAIN: A, B; 


KINESIN; CHAIN: A, B; 


KINESIN; CHAIN: A, B; 


KINESIN; CHAIN: A, B; 


KINESIN; CHAIN: A, B; 


KINESIN MOTOR NCD; CHAIN: 
A; 


KINESIN MOTOR NCD; CHAIN: 
A; 


KINESIN-LIKE PROTEIN KAR3; 
CHAIN: NULL; 


KINESIN-LIKE PROTEIN KAR3; 
CHAIN: NULL; 




SEQFOL 
D score 


156.19 


139.58 
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*n 
oo 




PMF 
score 
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LOO 


0.98 


0.99 
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1.00 


Verify 
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UUJN 1 KAU 1 1L£ Y KU 1 fcllN 
CONTRACTILE PROTEIN, KAR3, 
KINESIN-RELATED PROTEIN, 
MOTOR 2 PROTEIN, ATPASE, P- 
LOOP, MICROTUBULE BINDING 
PROTEIN 


5 

1 

o w 

P 

S u 
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UUMfLbA UINrULbl 1 UK/iNU^J}M.OC; 

COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


"UUMrLbX ^lNIlim 1 UK/lNUUi.E,/\an,; 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


COMPLEX (INHIBlIOR/NUULbAbJi; 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


COMPLEX (INHIBITOK/NUCLfcAofcj 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 


Coumpound 


KINESIN-LIKE PROTEIN KAR3; 
CHAIN: NULL; 


KINESIN HEAVY CHAIN; 
CHAIN: A, B, C, D; 


KINESIN HEAVY CHAIN; 
CHAIN: A, B, C,D; 




RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


SEQFOL 
D score 
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3 REPEATS 

COMPLEX (NUCLEAR " 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 


I<S|S 

6£ ^ c 
So 


1 ( 

5 x« 

? 5 s5 § C 

Ss3| 

' & B ri £ 

u £ 66 


? v s3 § 6 
2 9 % 5 c 

•Jill 


j If 

lisle 

! d ° S S= 

1 6§ C 
O pu, 6a 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 


T 

: ass 

lies 

war 


INSECT IMMUNITY INSECT 
IMMUNITY, LPS-BINDING, 


CONNECTIN A7 1 , CONNECTIN; 
TITIN, CONNECTIN, FIBRONECTIN 
TYPEm 


Coumpound 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B M ; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


HEMOLIN; CHAIN: A, B; 


HEMOLIN; CHAIN: A, B; 


TITIN; CHAIN: NULL; 


SEQFOL 
D score 




















PMF 
score 
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CELL ADHESION NbUKAL L-isLL 
ADHESION 


CELL ADHESION NEURAL (JbLL 
ADHESION 


GROWTH FACTOK/UKU W IW 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, CELL 
ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, CELL 
ADHESION 


CELL ADHESION LEUCINE RIUhL 
REPEAT, CALCIUM BINDING, CELL 
ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, CELL ! 
ADHESION 


CELL ADHESION LEUCINE KLCtt 
REPEAT, CALCIUM BINDING, CELL 
ADHESION 


TRANSFERASE CRYSTAL 
STRUCTURE, RAB 

GERANYLGERANYLTRANSFERASE, 


Coumpound 


AXONIN-l;CHAIN: A; 


AXONIN-1; CHAIN: A; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C,D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C,D; 


INTERNALIN B; CHAIN: A; 


INTERNALIN B; CHAIN: A; 


INTERNALIN B; CHAIN: A; 


INTERNALIN B; CHAIN: A; 


INTERNALIN B; CHAIN: A; 


RAB 

GERANYLGERANYLTRANSFE 
RASE ALPHA SUBUNIT; 


SEQFOL 
D score 
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1 

m 

k en"* 

S -liS- 

y) 00 g Oh £ 


i 

< 
( 

i 

< 


MUSCLE r KU 1 blN UUJNNJiCinN, 
NEXTM5; CELL ADHESION, 
GLYCOPROTEIN, 
TRANSMEMBRANE, REPEAT, 
BRAIN, 2 IMMUNOGLOBULIN FOLD, 
ALTERNATIVE SPLICING, SIGNAL, 3 
MUSCLE PROTEIN _J 




j 

! 

< 


CELL ADHESION PRO IbUN vuam- 
Dl,2; IVCA 6 IMMUNOGLOBULIN 
SUPERFAMILY, INTEGRIN-BINDING 
IVCA 15 


GLYCOPROTEIN CD4; 
IMMUNOGLOBULIN FOLD, 
TRANSMEMBRANE, 
GLYCOPROTEIN, T-CELL, 2 MHC 
LIPOPROTEIN, POLYMORPHISM 


TRANSCRIPTION RNAlr; KAJNUAr; 
GTPASE-ACTIVATING PROTEIN 


Coumpound 


CHAIN: B, D; 


T LYMPHOCYTE ADHESION 
GLYCOPROTEIN CD2 (HUMAN) 
1HNF3 


TITIN; CHAIN: NULL; 


MUSCLE PROTEIN TITIN j 
MODULE M5 (CONNECTIN) 
ITNM 3 (NMR, MINIMIZED 
AVERAGE STRUCTURE) ITNM 
4 ITNM 58 


GLYCOPROTEIN FIBRONECTIN 
(TENTH TYPE III MODULE) 
(NMR, 36 STRUCTURES) ITTF 3 


HUMAN VASCULAR CELL 
ADHESION MOLECULE-l; 
IVCA 4 CHAIN: A, B; IVCA 5 


T-CELL SURFACE 
GLYCOPROTEIN CD4; CHAIN: 
A, B; 


GTPASE-ACTIVATING 
PROTEIN RNAl SCHPO; 


SEQFOL 
D score 
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FOR SPIl, GTPASE- ACTIVATING 
PROTEIN, GAP, RNAIP, RANGAP, 
LRR, LEUCINE- 2 RICH REPEAT 
PROTEIN, TWINNING, 
HEMIHEDRAL TWINNING, 3 
MEROHEDRAL TWINNING, 
MEROHF/DRY 


ACETYLATION RNASE INHIBITOR, 
RIBONUCLEASE/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH REPEATS 


ACETYLATION RNASE INHIBITOR, 
RIBONUCLEASE/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH RFPFATS 


IMMUNE SYSTEM P58 NATURAL 
KILLER CELL RECEPTOR; KIR, 
NATURAL KILLER RECEPTOR, 
INHIBITORY RECEPTOR, 2 
IMMUNOGT .ORT IT TNI 


CELL ADHESION PROTEIN NCAM 
MODULE 2; CELL ADHESION, 
GLYCOPROTEIN, HEPARIN- 
BINDING, GPI-ANCHOR, 2 NEURAL 
ADHESION MOLECULE, 
IMMUNOGLOBULIN FOLD, 
HOMOPHILIC 3 BINDING, CELL 
ADHESION PROTEIN 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


Coumpound 


CHAIN: A, B; 


RIBONUCLEASE INHIBITOR; 
CHAIN: NULL; 


RIBONUCLEASE INHIBITOR; 
CHAIN: NULL; 


MHC CLASS I NK CELL 
RECEPTOR PRECURSOR; 
CHAIN: A; 


NEURAL CELL ADHESION 
MOLECULE, LARGE ISOFORM; 
CHAIN: A; 


ALPHA SPECTRIN; CHAIN: A, 
B, C; 


SEQFOL 
D score 
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PMF 
score 
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Verify 
score 
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CONTRACTILE PROTEIN NOU, 
CLARET SEGREGATIONAL 
PROTEIN; NCD CRYSTAL 
STRUCTURE, MICROTUBULE 
MOTORS, KINESIN 2 SUPERFAMILY, 
CONTRACTILE PROTEIN 


CONTRACTILE PRO 1 KIN IKirUi- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN 


TRANSCRlr I ION KbUULAilUiN 
SIGMA70; RNA POLYMERASE 
SIGMA FACTOR, TRANSCRIPTION 
REGULATION \ 


MOTOR PROTEIN MO 1 OK PRO 1 HIN, 
CYTOSKELETON 


5 

3 
2 

1? 

is 


MOTOR PROTEIN MOTOR FKO lblJN, 
CYTOSKELETON 


CONTRACTILE FROiblJN 
CONTRACTILE PROTEIN, KAR3, 
BCINESIN-RELATED PROTEIN, 
MOTOR 2 PROTEIN, ATPASE, P- 
LOOP, MICROTUBULE BINDING 
PROTEIN 


MOTOR PROTEIN MOTOR FRO 
CYTOSKELETON 


i 

0 

i 

O 

SB 

IS 

S o 


! 

( 


SURFACE PROTEIN MEROZOUi} 
SURFACE ANTIGEN 1, MAJOR 
BLOOD-STAGE EGF-LIKE DOMAIN, 
EXTRACELLULAR, MODULAR 


Coumpound 


MICROTUBULE MOTOR 
PROTEIN NCD; CHAIN: A, B, C, 
D; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


RNA POLYMERASE PRIMARY 
SIGMA FACTOR; CHAIN: NULL; 


KINESIN; CHAIN: A, B; 


KINESIN; CHAIN: A, B; 


KINESIN; CHAIN: A, B; 


KINESIN-LIKE PROTEIN KAR3; 
CHAIN: NULL; 


KINESIN HEAVY CHAIN; 
CHAIN: A, B, C,D; 


KINESIN HEAVY CHAIN; 
CHAIN: A, B, C,D; 




MEROZOITE SURFACE 
PROTEIN 1; CHAIN: A; 


SEQFOL 
D score 
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PROTEIN, SURFACE 2 ANTIGEN, 
MALARIA VACCINE COMPONENT, 
SURFACE PROTEIN 


SURFACE PROTEIN MEROZOITE 
SURFACE AN11GEN 1, MAJOR 
BLOOD-STAGE EGF-LIKE DOMAIN, 
EXTRACELLULAR, MODULAR 
PROTEIN, SURFACE 2 ANTIGEN, 
MALARIA VACCINE COMPONENT, 
SURFACE PROTEIN 


SURFACE PROTEIN MEROZOITE 
SURFACE ANTIGEN 1, MAJOR 
BLOOD-STAGE EGF-LDCE DOMAIN, 
EXTRACELLULAR, MODULAR 
PROTEIN, SURFACE 2 ANTIGEN, 
MALARIA VACCINE COMPONENT, 
SURFACE PROTRfN 


SURFACE PRO'.rEIN MEROZOITE 
SURFACE ANTXGEN 1, MAJOR 
BLOOD-STAGE EGF-LIKE DOMAIN, 
EXTRACELLULAR, MODULAR 
PROTEIN, SURFACE 2 ANTIGEN, 
MALARIA VACCINE COMPONENT, 
SURFACE PROTRTTSI 


APOPTOSIS TRAIL; DR5; LIGAND- 
RECEPTOR COMPLEX, TRIMERIC 
JELLY-ROLL, TNF-R 2 
SUPERFAMTT .V APOPTOQTC 


MEMBRANE PROTEIN NMR, 
THROMBIN, EGF MODULE, 
ANTICOAGULANT, 

GLYCOSYLATTON 


MEMBRANE PROTEIN NMR, 
THROMBIN, EGF MODULE. 


Coumpound 




MEROZOITE SURFACE 
PROTEIN 1; CHAIN: A; 


MEROZOITE SURFACE 
PROTEIN 1; CHAIN: A; 


MEROZOITE SURFACE 
PROTEIN 1; CHAIN: A; 


TNF-RELATED APOPTOSIS 
INDUCING LIGAND; CHAIN: B; 
DEATH RECEPTOR 5; CHAIN: 
A; 


THROMBOMODULIN; CHAIN: 
A; 


THROMBOMODULIN; CHAIN: 
A; 


SEQFOL 
D score 
















PMF 
score 




0.41 


-0.14 


| 0.53 
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0.15 


Verify 
score 
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S 00 , 

38 


^ Q 
32 . 
-i 2 H 2; 

*< a -< c 1 
w 5 5 


MEMBKAJN.b FKU1H1N INMK, 

THROMBIN, EGF MODULE, 

ANTICOAGULANT, 

GLYCOSYLATION 


COAGULAT1UN **AUiOK 
STRUCTURE, EPIDERMAL GROWTH 
FACTOR, EGF, 2 CALCIUM- 
BINDING, EGF-LIKE DOMAIN, 
STRUCTURE AND FUNCTION, 3 
HUMAN FACTOR IX, 
COAGULATION FACTOR 


BLOOD CLOTTING rAU i UK YU, 
BLOOD COAGULATION, EGF-LIKE 
DOMAIN, BLOOD 2 CLOTTING 


BLOOD CLOTTING ^KU 1 blN 
INHIBITOR COMPLEX, 
COAGULATION COFACTOR, 
PROTEASE 


BLOOD CLOTllNU rRU 1 iilN 
INHIBITOR COMPLEX, 
COAGULATION COFACTOR, 
PROTEASE 


BLOOD CLOTllNU rKU 1 fclN 
INHIBITOR COMPLEX, 
COAGULATION COFACTOR, 
PROTEASE 


BLOOD CLOTTING FKU IblN 
INHIBITOR COMPLEX, 


Coumpound 




THROMBOMODULIN; CHAIN: 

A; 


THROMBOMODULIN; CHAIN: 
A; 


FACTOR IX; CHAIN: B, C; 


BLOOD COAGULATION 
FACTOR VII; CHAIN: A; 


COAGULATION FACTOR XA; 
CHAIN: A; COAGULATION 
FACTOR XA; CHAIN: L; 


COAGULATION FACTOR XA; 
CHAIN: A; COAGULATION 
FACTOR XA; CHAIN: L; 


COAGULATION FACTOR XA; 
CHAIN: A; COAGULATION 
FACTOR XA; CHAIN: L; 


COAGULATION FACTOR XA; 
CHAIN: A; COAGULATION 


SEQFOL 
D score 




















PMF 
score 




0.00 


0.21 


0.23 


0.04 


0.23 


0.42 


0.25 
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Verify 
score 
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COAGULATION COF ACTOR, 
PROTEASE 


BLOOD CLOTT ING PROTEIN 
INHIBITOR COMPLEX, 
COAGULATION COF ACTOR, 
PROTEASE 










COMPLEX (PROTEASE/INHIBITOR) 
RTAP; GLYCOPROTEIN, SERINE 
PROTEASE, PLASMA, BLOOD 
COAGULATION, 2 COMPLEX 
fPROTEASE/INmRITOm 


COMPLEX (PROTEASE/INHIBITOR) 
RTAP; GLYCOPROTEIN, SERINE 
PROTEASE, PLASMA, BLOOD 
COAGULATION , 2 COMPLEX 
rPROTEASE/nsTRTRTTOR ^ 


COMPLEX (PROTEASE/INHIBITOR) 
RTAP; GLYCOPROTEIN, SERINE 
PROTEASE, PLASMA, BLOOD 
COAGULATION, 2 COMPLEX 
(PROTEASE/INHIBITOR) 
GLYCOPROTEIN GLYCOPROTEIN 


Coumpound 


FACTOR XA; CHAIN: L; 


COAGULATION FACTOR XA; 
CHAIN: A; COAGULATION 
FACTOR XA; CHAIN: L; 


COAGULATION FACTOR 
BLOOD COAGULATION 
FACTOR XA IHCG 3 


COAGULATION FACTOR 
BLOOD COAGULATION 
FACTOR XA IHCG 3 


COAGULATION FACTOR 
BLOOD COAGULATION 
FACTOR XA IHCG 3 


COAGULATION FACTOR 
BLOOD COAGULATION 
FACTOR XA IHCG 3 


FACTOR XA; CHAIN: H, L; 
ANTICOAGULANT PEPTIDE; 
CHAIN: I; 


FACTOR XA; CHAIN: H, L; 
ANTICOAGULANT PEPTIDE; 
CHAIN: I; 


FACTOR XA; CHAIN: H, L; 
ANTICOAGULANT PEPTIDE; 
CHAIN: I; 

LAMININ; CHAIN: NULL; 


SEQFOL 
D score 




















PMF 
score 




0.59 


0.46 


0.33 


0.89 


0.95 


0.31 


0.82 


0.34 


-0.11 


Verify 
score 
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0.50 
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PDB annotation 


GLYCOPROTEIN GLYCOPROTEIN 


GLYCOPROTEIN GLYCOPROTEIN 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 
BINDING, HYDROLASE, 3 
GLYCOPROTEIN 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 
BINDING, HYDROLASE, 3 
GLYCOPROTEIN 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 
BINDING, HYDROLASE, 3 
GLYCOPROTEIN - 


SERINE PROTEASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 
PROTEASE 


SERINE PROTEASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 


Coumpound 


LAMININ; CHAIN: NULL; 1 


LAMININ; CHAIN: NULL; | 


FACTOR DCA; CHAIN: C, L,; D- 
PHE-PRO-ARG; CHAIN: I; 


FACTOR IXA; CHAIN: C, L,; D- 
PHE-PRO-ARG; CHAIN: I; 


FACTOR DCA; CHAIN: C, L,; D- 
PHE-PRO-ARG; CHAIN: I; 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 


SEQFOL 
D score 
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PDB annotation 


PROTEASE 


SERINE PROTEASE FVIIA; FVUA; 
BLOOD COAGULATION, SERINE 
PROTEASE 


SERINE PROTEASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 
PROTEASE 


SERINE PROTEASE FVIIA; FVUA; 
BLOOD COAGULATION, SERINE 
PROTEASE 


SERINE PROTEASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 
PROTEASE 


SERINE PROTEASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 
PROTEASE 


Coumpound 


COAGULATION FACTOR VIIA 
(HEAVY CHAIN); CHAIN; H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 
, (HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


SEQFOL 
D score 
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nnotation 


►EAT, SUSHI, 
ONTROL PROTEIN, 
TION,MULTI- 
RANE ADHESION 


rACTUK SERINE 
,OOD 

COAGULATION 


FACiUK SEKINE 
.OOD 

COAGULATION 


1 § 

* s 

o o 

i -1 D *e. 


FACTOR SERINE 
-OOD 

COAGULATION 


I ACIDIC SEMINAL 
4ADHESIN, BOVINE 
v4A PROTEIN, 
[AL FLUID PROTEIN, 
[AIN, X-RAY 
UCTURE, GROWTH 


INAL PLASMA 
EMINAL PLASMA 
IMADHESINS, CUB 
HITECTURE, 
INAL PLASMA 


PDB a 


MEMBRANE ADI 
CONSENSUS REI 
COMPLEMENT C 
2 N-GLYCOSYLA 
DOMAIN, MEMB 


COAGULATION ] 
PROTEINASE, BI 
COAGULATION, 
FACTOR 


COAGULATION 
PROTEINASE, BI 
COAGULATION, 
FACTOR 


COAGULATION 
PROTEINASE, BI 
COAGULATION, 
FACTOR 


COAGULATION 
PROTEINASE, BI 
COAGULATION, 
FACTOR 


SPERM ADHESIh 
PROTEIN; SPERI 
SEMINAL PLASI 
ACIDIC 2 SEMI* 
ASFP, CUB DOM 
CRYSTAL 3 STR 
FACTOR 


COMPLEX (SEM 
PROTEIN/SPP) S 
PROTEINS, SPEI 
DOMAIN 2 ARC) 
COMPLEX (SEM 
PROTEIN/SPP) 


Coumpound 


HUMAN BETA2- 
GLYCOPROTEIN I; CHAIN: A; 


COAGULATION FACTOR IX; 
CHAIN: A; COAGULATION 
FACTOR IX; CHAIN: B; 


COAGULATION FACTOR IX; 
CHAIN: A; COAGULATION 
FACTOR IX; CHAIN: B; 


COAGULATION FACTOR IX; 
CHAIN: A; COAGULATION 
FACTOR DC; CHAIN: B; 


COAGULATION FACTOR DC; 
CHAIN: A; COAGULATION 
FACTOR IX; CHAIN: B; 


ASFP; CHAIN: NULL; 


MAJOR SEMINAL PLASMA 
GLYCOPROTEIN PSP-I; CHAIN: 
A; MAJOR SEMINAL PLASMA 
GLYCOPROTEIN PSP-II; CHAIN: 
B 


SEQFOL 
D score 
















PMF 
score 
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PDI" annotation 


PLASMINOGENS ACTIVATION 


BLOOD COAGULATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH PAH'riP t TVT? r»A\>fAT\r 


BLOOD COAGLfLATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH FAnni? T TVt? nrnviTATxi 


BLOOD COAGLfLATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH PArtnn t tvc nm/ATxi 


BLOOD COAGULATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH FAPTHP I rtTT? r\r\-\A atxt 


BLOOD COAGULATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH FACTOR LIKE DOMAIN 


KINASE KINASE, SIGNAL 
TRANSDUCTION, 

GALrTTIlM/rAT VfOriT TT TXT 


KINASE KINASE, SIGNAL 
TRANSDUCTION. 


Coumpound 


T-PLASMINOGEN ACTIVATOR 
Fl-G; ITPG 7 CHAIN: NULL; 
ITPGS 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 


CALCIUM/CALMODULIN- 
DEPENDENT PROTEIN KINASE; 
CHAIN: NULL; 


CALCIUM/CALMODULIN- 
DEPENDENT PROTEIN KINASE: 


SEQFOL 
D score 
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PDB annotation 
CALCIUM/CALMODULIN 


TRANSFERASE TRANSFERASE, 
SERINE/THREONINE-PROTEIN 
KINASE, CASEIN KINASE, 2 
SER/THR KINASE 


COMPLEX (KINASb/lNriitfl 1 UK; 
CDK6; P19INK4D; CYCLIN 
DEPENDENT KINASE, CYCLIN 
DEPENDENT KINASE INHIBITORY 2 
PROTEIN, CDK, INK4, CELL CYCLE, 
COMPLEX (KINASE/TNHIBITOR) 
HEADER HELIX 


COMPLEX (INHIBri UK 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PROTEIN/KINASE) 


COMPLEX (1NH1B1IUK 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PROTEIN/KINASE) 
PWOSPHOTR ANSFERASE 


) 

) 

1 

i 

1 

) 
1 

A 




Courapound 

A TM« XTTTT T • 


I 

CO 

[p 

^ Ph CO 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A, C; CYCLIN- 
DEPENDENT KINASE 
INHIBITOR; CHAIN: B, D; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


7 
c/ 
C. 
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. CD 

s ^ ^ 

•i l£S 

3 £ »7 U H 
> CO 6 ^ ^ 


TRANSFERASE(PHOSPHOTRA 
NSFERASE) CAMP- 
DEPENDENT PROTEIN KINASE 
(E.C.2.7.1.37) (CAPK) ICTP 3 


SEQFOL 
D score 
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PDB annotation 

TNTRASTERIC REGULATION 


TRANSFERASE MITOGEN 
ACTIVATED PROTEIN KINASE, MAP 
2, ERK2; TRANSFERASE, 
SERINE/THREONINE-PROTEIN 
KINASE, MAP KINASE, 2 ERK2 


TRANSFERASE MITOGEN 
ACTIVATED PROTEIN KINASE, MAP 
2, ERK2; TRANSFERASE, 
SERINE/THREONINE-PROTEIN 
KINASE, MAP KINASE, 2 ERK2 


COMPLEX (MHC/VIRAL 
PEPTIDE/RECEPTOR) HLA-A2 
HEAVY CHAIN; CLASS I MHC, T- 
CELL RECEPTOR, VIRAL PEPTIDE, 2 
COMPLEX (MHC/VIRAL 
PEPTIDE/RECEPTOR 


COMPLEX (MHC/VIKAL 
PEPTIDE/RECEPTOR) HLA-A2 
HEAVY CHAIN; CLASS I MHC, T- 
CELL RECEPTOR, VIRAL PEPTIDE, 2 
COMPLEX (MHC/VIRAL 
PEPTIDE/RECEPTOR 


COMPLEX (MHC/VIKAL 
PEPTIDE/RECEPTOR) HLA A2 
HEAVY CHAIN; COMPLEX 
(MHC/VIRAL PEPTIDE/RECEPTOR) 


COMPLEX (MHC/VIRAL 
PEPTIDE/RECEPTOR) HLA A2 
HEAVY CHAIN; COMPLEX 
(MHC/VIRAL PEPTIDE/RECEPTOR) 


Coumpound 


EXTRACELLULAR 
REGULATED KINASE 2; 
CHAIN: NULL; 


EXTRACELLULAR 
REGULATED KINASE 2; 
CHAIN: NULL; 


HLA-A 0201; CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
TAX PEPTIDE; CHAIN: C; T 
CELL RECEPTOR ALPHA; 
CHAIN: D; T CELL RECEPTOR 
BETA; CHAIN: E; 


HLA-A 0201; CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
TAX PEPTIDE; CHAIN: C; T 
CELL RECEPTOR ALPHA; 
CHAIN: D; T CELL RECEPTOR 
BETA; CHAIN: E; 


HLA-A 0201; CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
TAX PEPTIDE; CHAIN: C; T 
CELL RECEPTOR ALPHA; 
CHAIN: D; T CELL RECEPTOR 
BETA; CHAIN: E; 


HLA-A 0201; CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
TAX PEPTIDE; CHAIN: C; T 
CELL RECEPTOR ALPHA; 


SEQFOL 
D score 




98.67 




128.38 
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PDB annotation 




RECEPTOR T CELL RECEPTOR IBEC 
14 


RECEPTOR T CELL RECEPTOR IBEC 
14 


IMMUNE SYSTFiM HLA-DRl, DRA; 
HLA-DRl, DRBl 0101; TCR HA1.7 
ALPHA CHAIN; TCR HAl. 7 BETA 
CHAIN; PROTEIN-PROTEIN 
COMPLEX, IMMUNOGLOBULIN 
FOLD 


RECEPTOR TCR; T-CELL, 
RECEPTOR, TRANSMEMBRANE, 
GLYCOPROTEIN. STONAT . 


RECEPTOR TCR; T-CELL, 
RECEPTOR, TRANSMEMBRANE, 
GLYCOPROTEIN, SIGNAL 


COMPLEX (INHIBITOR/NUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 
COMPLEX (NUCLEAR 


Coumpound 


CHAIN: D; T CELL RECEPTOR 
BETA; CHAIN: E; 


14.3.D T CELL ANTIGEN 
RECEPTOR; IBEC 5 CHAIN: 
NULL; IBEC 6 


14.3.D T CELL ANTIGEN 
RECEPTOR; IBEC 5 CHAIN: 
NULL; IBEC 6 


HLA CLASS II 
HISTOCOMPATIBILITY 
ANTIGEN, DR CHAIN: A; HLA 
CLASS II 

HISTOCOMPATIBILITY 
ANTIGEN, DR-1 CHAIN: B; 
HEMAGGLUTININ HAl 
PEPTIDE CHAIN; CHAIN: C; T- 
CELL RECEPTOR ALPHA 
CHAIN; CHAIN: D; T-CELL 
RECEPTOR BETA CHAIN; 
CHAIN: E; 


ALPHA, BETA T-CELL 
RECEPTOR CHAIN: A, B; 


ALPHA, BETA T-CELL 
RECEPTOR CHAIN: A, B; 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 

U2 RNA HAIRPIN IV; CHAIN: 0, ^ 


SEQFOL 
D score 




130.56 








131.76 




PMF 
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PDB annotation 


PROTEIN/KJNA; CUMrLbA 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP.RIBONUCLEOPROTEIN 


COMPLEA (NUULLAK 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEA (NUCLbAK 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP ,RIB ONUCLE OPROTEIN 


COMPLEX (NUCLbAK 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP.RIBONUCLEOPROTEIN 


COMPLEA (NUCLbAK 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNPJUBONUCLEOPROTEIN 


COMPLEA (NUCLbAK 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNPjRIBONUCLEOPROTEIN 


CELL ADHESION NEUKAL UblA, 
ADHESION 


CELL ADHESION LbUClJNb KiCri 
REPEAT, CALCIUM BINDING, CELL 
ADHESION 


CELL ADHESION LEUCINE KiUH 
REPEAT, CALCIUM BINDING, CELL 


Coumpound 


R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A*; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A*; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B M ; 
CHAIN: B,D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R;U2A';CHAIN:A,C;U2B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R;U2A; CHAIN: A, Q U2B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


AXONIN-1; CHAIN: A; 


INTERNALIN B; CHAIN: A; 


INTERNALIN B; CHAIN: A; 


SEQFOL 
D score 






















PMF 
score 




d 


0.65 


0.51 


-0.08 


0.06 


0.89 
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PDB annotation 


k 

3 S ' 

Hi 

Z 06 > 

2 o 2 


LIGASE CYCLIN A/UUJU- 
ASSOCIATED PROTEIN P45; CYCLIN 
A/CDK2- ASSOCIATED PROTEIN P 19; 
SKPl, SKP2, F-BOX, LRR, LEUCINE- 
RICH REPEAT, SCF, UBIQUITIN, 2 
E3, UBIOUITIN PROTEIN LIGASE 


LIGASE CYCLIN A/CL>*^- 
ASSOCIATED PROTEIN P45; CYCLIN 
A/CDK2-ASSOCIATED PROTEIN P 19; 
SKPl, SKP2, F-BOX, LRR, LEUCINE- 
RICH REPEAT, SCF, UBIQUITIN, 2 
E3, UBIQUITIN PROTEIN LIGASE 


LIGASE CYCLIN A/CUKZ- 
ASSOCIATED PROTEIN P45; CYCLIN 
A/CDK2-ASSOCIATED PROTEIN P19; 
SKPl, SKP2, F-BOX, LRR, LEUCINE- 
RICH REPEAT, SCF, UBIQUITIN, 2 
E3. UBIOUITIN PROTEIN LIGASE 


LIGASE CYCLIN A/CDK2- 
ASSOCIATED P45; CYCLIN A/CDK2- 
ASSOCIATED PI 9; SKPl, SKP2, F- 
BOX, LRRS, LEUCINE-RICH 
REPEATS, SCF, 2 UBIQUITIN, E3, 
UBIOUITIN PROTEIN LIGASE 


LIGASE CYCLIN A/CDK2- 
ASSOCIATED P45; CYCLIN A/CDK2- 
ASSOCIATED P19; SKPl, SKP2, F- 
BOX, LRRS, LEUCINE-RICH 
REPEATS, SCF, 2 UBIQUITIN, E3, 
UBIQUITIN PROTEIN LIGASE 
ACETYLATION RNASE INHIBITOR, 


Coumpound 


FACTOR 1; CHAIN: A, B; 


SKP2; CHAIN: A, C, E, G, I, K, M, 
0; SKPl; CHAIN: B, D, F, H, J, L, 
N,P; 


SKP2; CHAIN: A, C, E, G, I, K, M, 
O; SKPl; CHAIN: B, D, F, H, J, L, 
N,P; 


SKP2; CHAIN: A, C, E, G, I, K, M, 
O; SKP 1 ; CHAIN: B, D, F, H, J, L, 
N,P; 


SKP2; CHAIN: A, C; SKPl; 
CHAIN: B, D; 


SKP2; CHAIN: A, C; SKPl; 
CHAIN: B, D; 

RIBONUCLEASE INHIBITOR; 


SEQFOL 
D score 
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RIBONUCLEAS E/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH REPEATS 


ACETYLATION RNASE INHIBITOR, 
RIBONUCLEASE/ANGIOGENIN 
INHIBITOR ACirTYLATION, 
LEUCINE-RICH REPEATS 




GLYCOPROTEIN GLYCOPROTEIN 




MUSCLE PROTEIN CTNC; CARDIAC, 
MUSCLE PROTEtIN, REGULATORY, 
CALCIUM BIND ING 


CALCIUM-BINDING PROTEIN 
CALMODULIN CERIUM TR1C- 
DOMAIN, RESIDUES 1 - 75; CERIUM- 
LOADED, CALC IUM-BINDING 
PROTEIN 








STRUCTURAL PROTEIN HELIX- 
TURN-HELIX 


METAL TRANSPORT CALMODULIN, 
HIGH RESOLUTION, DISORDER 


Coumpound 


CHAIN: NULL; 


RIBONUCLEASE INHIBITOR; 
CHAIN: NULL; 




t LAMININ; CHAIN: NULL; 




TROPONIN C; CHAIN: NULL; 


CALMODULIN; CHAIN: NULL; 


CALCIUM-BINDING PROTEIN 
CALMODULIN COMPLEXED 
WITH CALMODUL IN-BINDING 
DOMAIN OF ICDM 3 
CALMODULIN-DEPENDENT 
PROTEIN KINASE II 1 CDM 4 


CALCIUM-BINDING PROTEIN 
CALMODULIN (VERTEBRATE) 
1CLL3 


CALCIUM-BINDING PROTEIN 
CALMODULIN (VERTEBRATE) 
1CLL3 


CARDIAC TROPONIN C; 
CHAIN: A; 


CALMODULIN; CHAIN: A; 


SEQFOL 
D score 
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CALCIUM-KbUULAibU MU^LLt, 
CONTRACTION MUSCLE 
CONTRACTION, CALCIUM- 
BINDING, TROPONIN, E-F HAND, 2 
OPEN CONFORMATION 
REGULATORY DOMAIN, CALCIUM- 
REGULATED 3 MUSCLE 
CONTRACTION _j 


CALClUM-KbUULAlfcD MUiUJi 

CONTRACTION MUSCLE 
CONTRACTION, CALCIUM- 
BINDING, TROPONIN, E-F HAND, 2 
OPEN CONFORMATION 
REGULATORY DOMAIN, CALCIUM- 
REGULATED 3 MUSCLE 
CONTRACTION 


i 

Jh 

j3 
Z 

3 

i 

O 

2 

i ^* 
3 ^ 

B 9 




< 


CALMODULIN, CALCIUM JBINDIINU, 
HELDC-LOOP-HELIX, SIGNALLING, 2 
COMPLEX(CALCIUM-BINDING 
PROTEIN/PEPTIDE) 


CALMODULIN, CALCIUM BINDiNU, 
HELIX-LOOP-HELIX, SIGNALLING, 2 
COMPLEX(CALCIUM-BINDING 
PROTEIN/PEPTIDE) 




TRANSFERASE RECEPTOR 
TYROSINE KINASE, PROTEIN 
INTERACTION MODULE, 2 


Coumpound 


TROPONIN C; CHAIN: NULL; 


TROPONIN C; CHAIN: NULL; 


TROPONIN C; ITNX 4 CHAIN: 
NULL; ITNX 5 


CONTRACTILE SYSTEM 
PROTEIN TROPONIN C ITOP 3 


CONTRACTILE SYSTEM 
PROTEIN TROPONIN C ITOP 3 


CALMODULIN; CHAIN: A; 
RS20; CHAIN: B; 


CALMODULIN; CHAIN: A; 
RS20; CHAIN: B; 




EPHA4 RECEPTOR TYROSINE 
KINASE; CHAIN: A; 


SEQFOL 
D score 


56.30 
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score 
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PDB annotation 


(TRANSFERASE/SUBSTRATE) 
TYROSINE KINASE, SIGNAL 
TRANSDUCTION, 
PHOSPHOTRANSFERASE, 2 
COMPLEX (KINASE/PEPTIDE 
SUBSTRATE/ATP ANALOG), 
ENZYME, 3 COMPLEX 
(TRANSFERASE/SUBSTRATE) 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN, RGD, 
EXTRACELLULAR MATRIX, 2 
HEPARIN-BINDING, 
GLYCOPROTEIN 


TRANSFERASE NUK; 
TRANSFERASE, EPH RECEPTOR 
TYROSINE KINASE, 


TRANSFERASE NUK; 
TRANSFERASE, EPH RECEPTOR 
TYROSINE KINASE, 


TYROSINE KINASE TYROSINE 
KINASE-INHIBITOR COMPLEX, 
DOWN-REGULATED KINASE, 2 
ORDERED ACTIVATION LOOP 


STRUCTURAL PROTEIN INTEGRIN, 
HEMIDESMOSOME, FIBRONECTIN, 
CARCINOMA, STRUCTURAL 2 
PROTEIN 


STRUCTURAL PROTEIN INTEGRIN, 
HEMIDESMOSOME, FIBRONECTIN, 
CARCINOMA, STRUCTURAL 2 
PROTEIN 


TRANSFERASE ALPHA BETA FOLD 


STRUCTURAL PROTEIN TENASCIN, 
FIBRONECTIN TYPE-ffl, HEPARIN, 


Coumpound 


PEPTIDE SUBSTRATE; CHAIN: 

B; 


FIBRONECTIN; CHAIN: NULL; 


TYROSINE-PROTEIN KINASE 
RECEPTOR EPH; CHAIN: A; 


TYROSINE-PROTEIN KINASE 
RECEPTOR EPH; CHAIN: A; 


HAEMATOPOETIC CELL 
KINASE (HCK); CHAIN: A; 


INTEGRIN BETA-4 SUBUNIT; 
CHAIN: A, B; 


INTEGRIN BETA-4 SUBUNIT; 
CHAIN: A, B; 


| LCK KINASE; CHAIN: A; 


TENASCIN; CHAIN: A, B; 


SEQFOL 
D score 
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EXTRACELLULAR 2 MATRIX, 
ADHESION, FUSION PROTEIN, 
STRT inTT TO A T oi? rvn? txt 


TYROSINE-PROTEIN KINASE NMR, 
RECEPTOR OLl GOMERIZATION, 
EPH RECEPTORS, TYROSINE 2 
PHOSPHORYLATION, SIGNAL 
TRANSDUCTION, TYROS INE- 


4 


TRANSFERASE KDR; TYROSINE 
KINASE 


PROTEIN BINDING ED-B, 
FIBRONECTIN, TYPEin DOMAIN, 
ANGIOGENESIS, PROTEIN 2 
BINDING 


PROTEIN BIND1 NG ED-B, 
FIBRONECTIN, TYPEin DOMAIN, 
ANGIOGENESIS, PROTEIN 2 


COAGULATION FACTOR 


COMPLEX (TWO DNA-BINDING 
PROTEINS/DNA) MAT ALPHA-2; 
COMPLEX (TWO DNA-BINDING 
PROTEINS/DNA), COMPLEX, 2 DNA- 
BINDING PROTEIN, DNA, 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) GHF-1: COMPLEX 


Coumpound 




EPHRIN TYPE-B RECEPTOR 2; 
CHAIN: NULL; 


GLYCOPROTEIN FIBRONECTIN 
(TENTH TYPE HI MODULE) 
(NMR, 36 STRUCTURES') ITTF 3 


VASCULAR ENDOTHELIAL 
GROWTH FACTOR RECEPTOR 
CHAIN: A; 


FIBRONECTIN; CHAIN: A; 


FIBRONECTIN; CHAIN: A; 


HUMAN TISSUE FACTOR; 2HFT 
4 CHAIN: NULL; 2HFT 5 


MATING-TYPE PROTEIN A-l; 
CHAIN: A; MATING-TYPE 
PROTEIN ALPHA-2; CHAIN: B; 
DNA; CHAIN: C; 


PIT-1; CHAIN: A, B; DNA; 
CHAIN: C, D; 


SEQFOL 
D score 




















PMF 
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5" 

2 * o 
^ £ o ■ 

GQ P w 


PROTEIN/DMA HUMbUUUMAlIN, 
DNA, COMPLEX, DNA-BINDING 
PROTEIN, PROTEIN/DNA 


i s 

-J 2j 

x, g 
3 o < * 

sis! 

ill 

£ £ 2 

Q Q ffi 


3 | 

3-1 1 

Z J 

II 

^ o 


COMPLEX (DJNA-JbJIINLUINU 

PROTEIN/DNA) DNA-BINDING 
PROTEIN, DNA, PAIRED BOX, 
TRANSCRIPTION 2 REGULATION 




COMPLEX 

(TRANSCRIPTION/HOMEOBOX/DNA 
) PHEROMONE RECEPTOR 
TRANSCRIPTION FACTOR; 
MATTNG-TYPE PROTEIN ALPHA-2; 
TRANSCRIPTION REGULATION, 
TRANSCRIPTIONAL REPRESSION, 2 
DNA-BINDING PROTEIN, COMPLEX 
(TRANSCRIPTION/HOMEOBOX/DNA 


Z 
3 

3 

0 

3 




Coumpound 

, . 




HOMEOBOX PROTEIN HOX-Bl; 
CHAIN: A; PBXl; CHAIN: B; 
DNA CHAIN: D; DNA CHAIN: E; 


INSULIN GENE ENHANCER 
PROTEIN ISL-1; CHAIN: NULL; 


HOMEOBOX PROTEIN PBXl; 
CHAIN: A; 


PAIRED PROTEIN; CHAIN: A, B, 
C; DNA; CHAIN: D, E, F 


DNA-BINDING PROTEIN OCT-2 J 
POU HOMEODOMAIN (NMR, 
AVERAGE STRUCTURE) IHDP 


MCMl TRANSCRIPTIONAL 
REGULATOR; CHAIN: A, B; 
MAT ALPHA-2 
TRANSCRIPTIONAL 
REPRESSOR; CHAIN: C, D; STE6 
OPERATOR DNA; CHAIN: E, F; 


OCT-3; lOCP 5 CHAIN: NULL; 
10CP6 


DNA BINDING PROTEIN OCT-l 
POU HOMEODOMAIN DNA- 
BINDING PROTEIN MUTANT 


SEQFOL 
D score 




















PMF 
score 
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PDB annotation 


HALOPEROXIDASE 
BROMOPEROXIDASE L, 
HALOPEROXIDASE L; 
HALOPEROXIDASE, 
OXIDOREDUCTASE 


HALOrEKOXlDAbtr 
HALOPEROXIDASE F; 
HALOPEROXIDASE, 
OXIDOREDUCTASE, PROPIONATE 
COMPLEX 


AMINOPErllUAbb 
AMINOPEPTIDASE, PROLINE 
IMINOPEPTIDASE, SERINE 
PROTEASE, 2 XANTHOMONAS 
CAMPESTRIS 


AMINOPEPTlDASh, 
AMINOPEPTIDASE, PROLINE 
IMINOPEPTIDASE, SERINE 
PROTEASE, 2 XANTHOMONAS 
CAMPESTRIS 


HYDROLASE 

HALOALKANE DEHALOGENASE, 
ALPHA/BETA-HYDROLASE 


3 

t 3 O 
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o 52 


4 O 

< CO 

^ s 


HYDROLASE L1NB, 1,3,4,0- 
TETRACHLORO- 1 ,4- 
CYCLOHEXADIENE 


t 

s 

o 
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< 


CHLOROPEROXIDASE L; 
CHAIN: A, B } C; 


CHLOROPEROXIDASE F; 
CHAIN: NULL; 


PROLINE IMINOPEPTIDASE; 
CHAIN: A, B; 


PROLINE IMINOPEPTIDASE; 
CHAIN: A, B; 


HALOALKANE 
DEHALOGENASE; CHAIN: 
NULL; 


HALOALKANE 
DEHALOGENASE; CHAIN: 
NULL; 


HALOALKANE 
DEHALOGENASE; 1- 
CHLOROHEXANE CHAIN: A; 


HALOALKANE 
DEHALOGENASE; CHAIN: A; 


SEQFOL 
D score 






60.20 
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DEHALOGENASE, LINDANE, 

BIODEGRADATION, ALPHA/BETA- 
T-rvTvprvr act? 


HYDROLASE HOMODIMER, 
ALPHA/BETA HYDROLASE FOLD, 
DTSTrRSVITl rrF n TTPT? a o txtuttd tt/~\"d 


HYDROLASE HOMODIMER, 
ALPHA/BETA H YDROLASE FOLD, 
DISUBSTITUTED UREA 2 INHIBITOR 


HYDROLASE LIPASE 
HYDROLASE 

PSEUDOMONADACEAE, CIS- 
PEPTTDE, CLOSED 
CONFORMATION, 2 HYDROLASE, 
LID 


P 

.J' 

Si? 

i? ^ 

p 3 % 

w 5 £ 
<J P < 


HYDROLASE ALPHA BETA 
HYDROLASE FOLD, PROLINE, 
PROLYL AMINOPEPTIDASE, 2 
SERRATTA TMTNrnPPPTTnAOT 


HYDROLASE ALPHA BETA 
HYDROLASE FOLD, PROLINE, 
PROLYL AMINOPEPTIDASE, 2 


LIPASE LIPASE; LIPASE, 
HYDROLASE, 

PSEUDOMONADACEAE, COVALENT 
INTERMEDIATE, 2 TRIGLYCERIDE 
ANALOGUE, ENANTIOSELECnVITY 


CO 
CO 

pS 

fee 

u o 
o & 
X*r 

*§3 
2§ 

CO H 

OO 

CJ - 

0 w 

Z CO 


Coumpound 




EPOXIDE HYDROLASE; CHAIN: 
A, B; 


EPOXIDE HYDROLASE; CHAIN: 
A, B; 


if 
<f 

2 A w 

5 j g ^ g 

2 3 o g o 

d w co 

. d 3 d 3 


HYDROXYNITRILE LYASE; 
CHAIN: A; 


PROLYL AMINOPEPTIDASE; 
CHAIN: A; 


PROLYL AMINOPEPTIDASE; 
CHAIN: A; 


TRIAC YL- GLYCEROL- 
HYDROLASE; CHAIN: D, E; 


SYNTAXIN BINDING PROTEIN 
1; CHAIN: A; SYNTAXIN 1A: 


SEQFOL 
D score 
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COMPLEX, MULT1-SUBUNIT 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FlJNLrliK/UJNA; 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC HNtibK/DNAj ZliNU 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC rlNubK/DlNA; ziiNC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FlNUbK/DNAj zinc 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC finuek/dna; zinc 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


ZINC FINGER TRANSCKJU' 1 1<JN 
FACTOR SPl; ZINC FINGER, 
TRANSCRIPTION ACTIVATION, SPl 
COMPLEX (TRANSCRIPTION 


Coumpound 

CHAIN: B; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE: CHAIN: B, C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D,E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


SP1F3; CHAIN: NULL; 
YYl; CHAIN: C; ADENO- 


SEQFOL 
D score 






76.20 
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REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATIONADNA^ 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA^ 


TRANSCRIPTION REGULATION 
TRANSCRIPTION REGULATION, 
ADRL ZINC FINGER. NMR 


TRANSCRIPTION REGULATION 
TRANSCRIPTION REGULATION, 
ADRl. ZINC FINGER. NMR 




COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROT1 iIN/DN A") 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) HVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 


Coumpound 


ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


ADRl ; CHAIN: NULL; 


ADRl; CHAIN: NULL; 


COMPLEX(TRANSCRIPTION 
REGULATION/DNA) 
TRAMTRACK PROTEIN (TWO 
ZINC-FINGER PEPTIDE) 
COMPLEXED WITH 2DRP 3 
DNA 2DRP 4 


ZINC FINGER PROTEIN GLII; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLII; 
CHAIN: A; DNA; CHAIN: C, D; 


SEQFOL 
D score 
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BINDING PROTEIN/DNA) 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN IIA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 


BLOOD COAUULAllUJN, 2>iiiuiNn 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


SERINE PROTEINASE . 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE DOMAINS, 
ANTICOAGULANT COMPLEX, 2 
ANTIFIBRINOLYTIC COMPLEX 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD 141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE DOMAINS, 
ANTICOAGULANT COMPLEX, 2 
ANTIFIBRINOLYTIC COMPLEX 


SIGNALLING PROTEIN tJINDiNU 
PROTEIN, CYTOKINE, SIGNALLING 


Coumpound 


ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO-MAI; 
CHAIN: P; 


BLOOD COAGULATION 
FACTOR VHA; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T t U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN: C; 


THROMBIN LIGHT CHAIN; 
CHAIN: A, B, C, D; THROMBIN 
HEAVY CHAIN; CHAIN: M, N, 
O, P; THROMBOMODULIN; 
CHAIN: I, J, K, L; THROMBIN 
INHIBITOR L-GLU-L-GLY-L- 
ARM; CHAIN: E,F, G, H; 


THROMBIN LIGHT CHAIN; 
CHAIN: A, B, C, D; THROMBIN 
HEAVY CHAIN; CHAIN: M, N, 
O, P; THROMBOMODULIN; 
CHAIN: I, J, K, L; THROMBIN 
INHIBITOR L-GLU-L-GLY-L- 
ARM; CHAIN: E, F, G,H; 


TUMOR NECROSIS FACTOR 
RECEPTOR; CHAIN: A, B; 


SEQFOL 
D score 




57.15 








PMF 
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PROTEIN 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, SIGNALLING 
PROTEIN 


GLYCOPROTEIN GLYCOPROTEIN 


GLYCOPROTEIN GLYCOPROTEIN f 


GLYCOPROTEIN GLYCOPROTEIN 


GLYCOPROTEIN GLYCOPROTEIN 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 
BINDING, HYDROLASE, 3 
GLYCOPROTEIN 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGU LATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 
BINDING, HYDROLASE, 3 
GLYCOPROTEIN 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 
BINDING, HYDROLASE, 3 
GLYCOPROTEIN 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 


Coumpound 




TUMOR NECROSIS FACTOR 
RECEPTOR; CHAIN: A, B; 


: NULL; 


; NULL; 


: NULL; 




FACTOR IXA; CHAIN: C, L,; D- 
PHE-PRO-ARG; CHAIN: I; 


FACTOR DCA; CHAIN: C, L,; D- 
P HE-PRO- ARG; CHAIN: I; 


FACTOR IXA; CHAIN: C, L,; D- 
PHE-PRO-ARG; CHAIN: I; 


FACTOR DCA; CHAIN: C, L,; D- 
PHE-PRO-ARG; CHAIN: I; 


| LAMININ; CHAIN: 


| LAMININ; CHAIN: 


| LAMININ; CHAIN: 


1 LAMININ; CHAIN: 


SEQFOL 
D score 




59.06 








178. 19 
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PMF 
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CHRISTMAS h A^iUK; ^uiviri^n,^, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 
BINDING, HYDROLASE, 3 
GLYCOPROTEIN 


■< 

1 


SERINE PRUlJiAol} r vim; r yua; 
BLOOD COAGULATION, SERINE 
PROTEASE 


£ e 
> s 

o < 
2ow 

* ° 3 

w CQ 


SERINE PROTEASE UNWimiuK 
FACTOR XA INHIBITOR; 
ANTISTASIN, CRYSTAL 
STRUCTURE, FACTOR XA 
INHIBITOR, 2 SERINE PROTEASE 
INHIBITOR, THROMBOSIS 


SERINE PROToAob WNriim i UK 
FACTOR XA INHIBITOR; 
ANTISTASIN, CRYSTAL 
STRUCTURE, FACTOR XA 
INHIBITOR, 2 SERINE PROTEASE 
INHIBITOR, THROMBOSIS 
PLASMINOGEN ACTIVATION 


Coumpound 




HYDROLASE CALCIUM-FREE 
PHOSPHOLIPASE A=2= 
(E.C.3.1.1.4)1PP2 4 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


COAGULATION FACTOR VIIA : 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


ANTISTASIN; CHAIN: NULL; 


ANTISTASIN; CHAIN: NULL; 
T-PLASMINOGEN ACTIVATOR 


SEQFOL 
D score 












55.31 
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PLASMINOGEN ACTIVATION 


PLASMINOGEN ACTIVATION 


LIPID DEGRADATION 
PHOSPHOLIPASE A2, LIPID 
DEGRADATION T-TVnROT AST? 


LIPID DEGRADATION 
PHOSPHOLIPASE A2, LIPID 
DEGRADATTOM HYnPHT aqp 


5 . 

III 

£ S3 o h- 
5 g u c 


BLOOD COAGULATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, E PIDERMAL 2 
GROWTH FAfnrm T TlTT? nn\A a tkt 


> 

1 
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! 


X u 
S S2 

gfgag 

WnGr 

?§5f 

h d ^ c 

2 £ £ < 
Oops 

O & H< V 


OXIDOREDUCTASE TWO DOMAIN 
MOTIF, ROSSMANN FOLD 


Coumpound 


Fl-G; ITPG 7 CHAIN: NULL; 
ITPG 8 


T-PLASMINOGEN ACTIVATOR 
Fl-G; ITPG 7 CHAIN: NULL; 
ITPG 8 


T-PLASMINOGEN ACTIVATOR 
Fl-G; ITPG 7 CHAIN: NULL; 
ITPG 8 


PHOSPHOLIPASE A2; CHAIN: 
A,B; 


PHOSPHOLIPASE A2; CHAIN: 
A, B; 


PHOSPHOLIPASE A2 
INHIBITOR; CHAIN: NULL 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 


LECTIN (AGGLUTININ) WHEAT 
GERM AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


NADPH-CYTOCHROME P450 
REDUCTASE; CHAIN: A, B; 


CHLOROPLAST FERREDOXIN- 
NADP+ OXIDOREDUCTASE; 
CHAIN: A, B; 


SEQFOL 
D score 
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1 

( 


5 

2 w2 i 

Dp <NO 

3 W <2 A 


OXIDOREDUCTASb r WK; 
FLAVOENZYME, 
PHOTOSYNTHESIS, ELECTRON 
TRANSFER, HYDRIDE 2 TRANSFER, 
OXIDOREDUCTASE 


OXIDOREDUCTASB rNK; 
OXIDOREDUCTASE, 
FLAVOPROTEIN, NADP, FAD, 
THYLAKOID MEMBRANE, 2 
HYCOBILISOME, FNR, NADP+ 
REDUCTASE 




COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 


Coumpound 


OXIDOREDUCTASE 
(NADP+(A),FERREDOXIN(A)) 
FERREDOXIN:NADP+ 
OXIDOREDUCTASE 
(FERREDOXIN REDUCTASE, 
IFNB 3 FLAVOENZYME) 
(E.C.I. 1 8. 1.2) IFNB 4 IFNB 72 


FLAVIN REDUCTASE; CHAIN: 
A, B, C, D; 


FERREDOXIN:NADP+ 
REDUCTASE; CHAIN: A, B; 


FERREDOXIN--NADP+ 
REDUCTASE; CHAIN: NULL; 


OXIDOREDUCTASE 
(NITROGENOUS ACCEPTOR) 
NADH-DEPENDENT NITRATE 
REDUCTASE (CYTOCHROME B 
REDUCTASE 2CND 3 
FRAGMENT) (E.C.I. 6.6.1) 
COMPLEXED WITH FAD 2CND 
4 (SYNCHROTRON X-RAY 
DIFFRACTION) 2CND 5 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 


SEQFOL 
D score 
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PDIt annotation 


ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (TOANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, ZINC 
FINGER PROTEfN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTCIN/DNA) 


\ 


COMPLEX (TRANSCRIPTION 
FACTOR/DNA) TRANSCRIPTION 
FACTOR, PROTEIN-DNA COMPLEX, 
CYTOKINE 2 ACTIVATION, 
COMPLEX (TRANSCRIPTION 
FACTOR/DNA) 

LIPID BINDING PROTEIN APO-E3: 


Coumpound 


OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C: 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


ZINC FINGER PROTEIN GLI1; 
CHAIN: A; DNA; CHAIN: C a D; 




STAT3B; CHAIN: A; 18-MER 
DESOXYOLIGONUCLEOTIDE; 
CHAIN: B; 

APOLIPOPROTEIN E; CHAIN: A; 


SEQFOL 
D score 




103.77 


99.56 


00 

00 
00 


93.51 






PMF 
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Verify 
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PDB annotation 


LIPID 1 RAN or UK 1, LlflU 

TRANSPORT, HEPARIN-BINDING, 
PLASMA 2 PROTEIN, HDL, VLDL 
REMARK 


TRANSMnMDKAiNJb rKUIUUN 
COLICIN, BACTERIOCIN, ION 
CHANNEL FORMATION, 
TRANSMEMBRANE 2 PROTEIN 


STRUCTURAL FKUllilIN 1WU 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


STRUCTURAL FKU1 HIM lwu 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


STRUCTURAL FKUlbiN I wu 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


STRUCTURAL r KU 1 HUN 1 WU 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


NSECl; PROTEIN-PROTEIN 
COMPLEX, MULTI-SUBUNIT 


ENDOCYTOSlb/bXUCY lUolo 
SYNAPTOTAGMIN ASSOCIATED 35 
KDA PROTEIN, P35A, THREE HELIX 


Coumpound 




COLICIN IA; CHAIN: NULL; 


ALPHA SPECTRIN; CHAIN: A, 
B, C; 


ALPHA SPECTRIN; CHAIN: A, 
B,C; 


ALPHA SPECTRIN; CHAIN: A, 
B, C; 


ALPHA SPECTRIN; CHAIN: A, 
B, C; 


SYNTAXIN BINDING PROTEIN 
1; CHAIN: A; SYNTAXIN 1A; 
CHAIN: B; 


a 

< 

< 

i 

CO 


SEQFOL 
D score 




115.08 














PMF 
score 
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0.10 


0.40 


0.18 


oo 

? 
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PD1S annotation 


BUNDLE i 


ENDOCYTOSIS/EXOCYTOSIS 
SYNAPTOTAGMIN ASSOCIATED 35 
KDA PROTEIN, P35A, THREE HELIX 
BUNDLE 


TRANSFERASE ALPHA- 
SUPERHELIX, TRANSFERASE 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN 


TRANSCRIPTION REGULATION 
SIGMA70; RNA POLYMERASE 
SIGMA FACTOR, TRANSCRIPTION 
REGULATION 


TRANSCRIPTION REGULATION 
SIGMA70; RNA POLYMERASE 
SIGMA FACTOR, TRANSCRIPTION 
REGULATION 




G-PROTEIN CD ("42 GTPASE- 
ACTIVATING PROTEIN; G-PROTEIN, 
GAP, SIGNAL-TRANSDUCTION 


COMPLEX(GTPASE 
ACTIVATN/PROTO-ONCOGENE) 
GTPASE- ACTIVATING PROTEIN 
RHOGAP; COMPLEX (GTPASE 
ACTIVATION/PROTO-ONCOGENE), 
GTPASE, 2 TRANSITION STATE, GAP 

CALCIUM/PHOS PHOLIPID-BINDING 


Coumpound 




S YNTAXIN- 1 A; CHAIN: A, B, C; 


SOLUBLE LYTIC 
TRANSGLYCOSYLASE SLT70; 
CHAIN: A; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


RNA POLYMERASE PRIMARY 
SIGMA FACTOR; CHAIN: NULL; 


RNA POLYMERASE PRIMARY 


SIGMA FACTOR: CHAIN: NULL: 






RHOGAP; CHAIN: NULL; 


P50-RHOGAP; CHAIN: A; 
TRANSFORMING PROTEIN 
RHOA; CHAIN: B; 

ANNEXIN IV; 1 ANN 5 CHAIN: 


SEQFOL 
D score 






104.88 






89.88 






105.03 


106.55 
431.20 


PMF 
score 
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d 
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Verify 
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PDB annotation 


f . " 

< 2 i - 

n 2 g 5 2 
3 § 5s & K <i 




RNA-BINDING PROTEIN/RNA TRA 
PRE-MRNA; SPLICING . 
REGULATION, RNP DOMAIN, RNA 
COMPLEX 


< < * 

1 1 

3SS ( 

=? § ^ & 


S | i 
< z i 

5 o ; 

o ro it 

laS 

im 

2 £ 3 <-> 


lies 

y « 2 ^ 

o « £ 2 


£| 

-4 71 
^ Si 

2 ft, 

11 

O CQ 


Coumpound 


NULL; 1 ANN 6 


OXIDOREDUCTASECACTING 
ONNADH ORNADPH) 
DIHYDROPTERIDINE 
REDUCTASE (DHPR) 
(E.C.1,6.99.10) COMPLEX IDHR 
3 WITH NADH IDHR 4 


1 M 
§ ?? 

8394 


SXL-LETHAL PROTEIN; CHAIN: 
A, B;RNA (5'- 

R(P*GP*UP*UP*GP*UP*UP*UP* 
UP*UP*UP*UP*UV CHAIN: P, Q; 


SXL-LETHAL PROTEIN; CHAIN: 
A, B; RNA (5- 

R(P*GP*UP*UP*GP*UP*UP*UP* 
UP*UP*UP*UP*U)- CHAIN: P, Q; 


O U * * 
p - * < 

> A" 2 £ S w 

0*5,5.0; 
J Q fe^ • ft- 
cu ^ w w s u 


POLYDENYLATE BINDING 
PROTEIN 1; CHAIN: A, B, C, D, 


SEQFOL 
D score 
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ation 


LEX, GENE 


g-g 

Z % w 
S ft. 3 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULA r HON/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


LNAPOLY(A) 
PABP 1; RRM, 
.EX, GENE 




PDB annot; 


PROTEIN-RNA COMP] 
REGULATION/RNA 


GENE REGULATION/] 
BINDING PROTEIN 1, 
PROTEIN-RNA COMP] 
REGULATION/RNA 


w S 2 w 


GENE REGULATION/R 
BINDING PROTEIN 1, 1 
PROTEIN-RNA GOMP1 
REGULATION/RNA 


Coumpound 


% * 

< 5 
* < 

.So 

S Si < H 
W < o 


- ♦ 2? 

2» 

| to 5 5 

(X PL, W <1 O 


POLYDENYLATE BINDING 
PROTEIN 1; CHAIN: A, B, C, D, 
E, F, G, H; RNA (5'- 
R(*AP*AP*AP*AP*AP*AP*AP* 
AP*AP*AP*A>3*); CHAIN: M, N, 
O, P, Q, R, S, T; 


I; ?i 

ffl 2 , ? O 

E - 2 ? ♦ w 
Eg 

PL O, W PS < o 


POLYDENYLATE BINDING 
PROTEIN 1; CHAIN: A, B, C, D, 
E, F, G, H; RNA (5'- 
R(*AP*AP*AP*AP*AP*AP*AP* 
AP*AP*AP*A)-3 t ); CHAIN: M, N, 
O, P, Q, R, S, T: 


POLYDENYLATE BINDING 
PROTEIN 1 ; CHAIN: A, B, C, D, 
E, F, G, H; RNA (5 1 - 
R(*AP*AP*AP*AP*AP*AP*AP* 
AP*AP*AP*A)-3'); CHAIN: M, N, 
O, P, Q, R, S, T; 


SEQFOL 
D score 
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PDB annotation 


GENE REGULATION/KNA FULY 
BINDING PROTEIN 1, PABP I; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


< 
g 

|i 

is 

< g 


NUCLEAR PROTEIN 
HETEROGENEOUS NUCLEAR 
RIBONUCLEOPROTEIN Al, 
NUCLEAR PROTEIN, HNRNP, RBD, 
RRM, RNP, RNA BINDING, 2 
RIBONUCLEOPROTEIN 


NUCLEAR PROTEIN 
HETEROGENEOUS NUCLEAR 
RIBONUCLEOPROTEIN A 1 , 
NUCLEAR PROTEIN, HNRNP, RBD, 
RRM, RNP, RNA BINDING, 2 
RIBONUCLEOPROTEIN 


5 

z; 

3 o 
3S 

5 op 
-J < 

|3 

g w 

sis 

< oo o 

N 

2 p S 


COMPLEX 

(RIBONUCLEOPROTEIN/DNA) 
HNRNP AI, UPl; COMPLEX 
(RIBONUCLEOPROTEIN/DNA), 
HETEROGENEOUS NUCLEAR 2 
RIBONUCLEOPROTEIN Al 


RNA BINDING DOMAIN RNA 
BINDING DOMAIN, RBD, RNA 
RECOGNITION MOTIF, RRM, 2 
SPLICING INHIBITOR, 
TRANSLATIONAL INHIBITOR, SEX 3 


Coumpound 


- * £ 
pu £ W es5 < O 


HU ANTIGEN C; CHAIN: A; 


HNRNP Al; CHAIN: NULL; 


HNRNP Al; CHAIN: NULL; 


SPLICING FACTOR U2AF 65 KD 
SUBUNIT; CHAIN: A; 


HETEROGENEOUS NUCLEAR 
RIBONUCLEOPROTEIN Al; 
CHAIN: A; 12-NUCLEOTIDE 
SINGLE-STRANDED 
TELOMETRIC DNA; CHAIN: B; 


SEX-LETHAL; CHAIN: A, B, C; 


SEQFOL 
D score 










50.59 






PMF 
score 


1.00 


1.00 


0.07 


1.00 




0.64 


0.04 


Verify 
score 
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0.06 


0.87 
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PDB annotation 


* CN cn 

8 «! pf S 

/3 w co p. 

> y I s 


SYSTEM/DNA 


THAN SWiKASii/iMM U INri 
SYSTEM/DNA HIV-1 RT; HIV-1 RT; | 
HIV, REVERSE TRANSCRIPTASE, 
MET184ILE, 3TC, PROTEIN-DNA 2 
COMPLEX, DRUG RESISTANCE, 
Ml 841, TRANSFERASE/IMMUNE 3 
SYSTEM/DNA 




REVERSE TKANSCJKIF 1 ASH 


REVERSE TKANovJKlr 1 AMI 


I 

2 

o 

las 
IN 

W g co 


> 

2 

n 

is 

P 

dr. 


Coumpound 


CHAIN: A; fflV-1 REVERSE 
TRANSCRIPTASE (CHAIN B); 
CHAIN: B; ANTIBODY (LIGHT 
CHAIN); CHAIN: L; ANTIBODY 
(HEAVY CHAIN); CHAIN: H; 
DNA (5 1 - CHAIN: T; DNA (5'- 
CHAIN: P; 


HIV-1 REVERSE 
TRANSCRIPTASE (CHAIN A); 
CHAIN: A; HIV-1 REVERSE 
TRANSCRIPTASE (CHAIN B); 
CHAIN: B; ANTIBODY (LIGHT 
CHAIN); CHAIN: L; ANTIBODY 
(HEAVY CHAIN); CHAIN: H; 
DNA (5'- CHAIN: T; DNA (5 f - 
CHAIN: P; 


REVERSE TRANSCRIPTASE 
HIV-1 REVERSE 
TRANSCRIPTASE (AMINO- 
TERMINAL HALF) (FINGERS 
1HAR 3 ANDPALM 
SUBDOMAINS) (RT216) 
(E.C.2.7.7.49) 1HAR4 


MMLV REVERSE 
TRANSCRIPTASE; IMML 4 
CHAIN: NULL; IMML 5 


MMLV REVERSE 
TRANSCRIPTASE; IMML 4 
CHAIN: NULL; IMML 5 


HIV-1 REVERSE 
TRANSCRIPTASE; IRTH 4 
CHAIN: A, B; IRTH 5 


HIV-1 REVERSE 
TRANSCRIPTASE; IRTH 4 


SEQFOL 
D score 










199.34 






PMF 
score 




0.99 


1.00 


1.00 




1.00 


1.00 


Verify 
score 
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0.36 




0.00 


-0.17 


Psi Blast 




CO 

a\ 
t 

o 
in 

CO 


>o 

i> 

CO 


le-59 . 


le-59 


o 


o 






CO 

8 


vn 
<n 


o 
r- 
m 


*— • 

»n 


00 

VO 
VO 


00 

\o 
VO 


STAR 
TAA 




CO 


CO 


VO 
CN 
CO 


vo 
CN 

CO 


5* 

CO 


co 






PO 








<: 


PQ 


Ph 




u 

$ 

V— • 


lhar 


Imml 


Imml 


Irth 


Irth 


CO * 


r 




1454 


1454 


1454 


1454 


1454 



595 



WO 02/070539 



PCT/US02/05095 



PDBt annotation 


i 

> 

3 
w 

CO 

»o ^ Lo v 
~ w 2 - 

h P* 
w c ^ a 

i< < 

CJ o 5 CJ 
w W Ej co 


W 

CO 

?S W 
> W CO «r 

Pi 

R H 

°£§ 


i 

1 




STRUCTURAL GENOMICS 
HYPOTHETICAL PROTEIN, 
METHANOCOCCUS JANNASCHII 






GROWTH FACTOR INSULIN-LIKE 
GROWTH FACTOR-1, GROWTH 
FACTOR, NMR, PROTEIN 2 
STRUCTURE, DISTANCE 
GEOMETRY 


OXIDOREDUCTASE 
FERROCYTOCHROME C\:OXYGEN 


Coumpound 


HIV-1 REVERSE 
TRANSCRIPTASE; 1VRT4 
CHAIN: A, B; IVRT 5 


HIV-1 REVERSE 
TRANSCRIPTASE; 1VRT4 
CHAIN: A, B; IVRT 5 


NUCLEOTIDYLTRANSFERASE 
REVERSE TRANSCRIPTASE 
(E.C.2.7.7.49) 3HVT3 


NUCLEOTIDYLTRANSFERASE 
REVERSE TRANSCRIPTASE . 
(E.C.2.7.7.49) 3HVT 3 


MJ0882; CHAIN: A; 


GROWTH FACTOR INSULIN- 
LIKE GROWTH FACTOR II (IGF- 
n, IGF-2) IIGL 3 (NMR, 20 
STRUCTURES) IIGL 4 IIGL 78 


GROWTH FACTOR INSULIN- 
LIKE GROWTH FACTOR II (IGF- 
II, IGF-2) IIGL 3 (NMR, 20 
STRUCTURES) IIGL 4 IIGL 78 


INSULIN-LIKE GROWTH 
FACTOR I; CHAIN: A; 


CYTOCHROME C OXIDASE; 
CHAIN: A, B, C, D, E, F, G, H, I. J. 


SEQFOL 
D score 








84.10 
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PMF 
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1.00 
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PDB annotation 


OXIDOREDUCTASE; 
OXIDOREDUCTASE, 
CYTOCHROME(C)-OXYGEN, 
CYTOCHROME C 2 OXIDASE 


OXIDOREDUCTASE 
FERROCYTOCHROME C\:OXYGEN 
OXIDOREDUCTASE; 
OXIDOREDUCTASE, 
CYTOCHROME(C)-OXYGEN, 
CYTOCHROME C 2 OXIDASE 


MUSCLE CONTRACTION MUSCLE 
CONTRACTION, CALCIUM- 
ACTIVATED, TROPONIN, E-F HAND 
2 CALCIUM-BINDING PROTEIN 


TRANSFERASE IL-2-INDUClBLIi 1- 
CELL KINASE; TRANSFERASE, 
REGULATORY INTRAMOLECULAR 
COMPLEX, KINASE 


OXIDOREDUCTASE PDZ DOMAIN, 
NNOS, NITRIC OXIDE SYNTHASE 


PEPTIDE RECOGNITION r br 1 lub 
RECOGNITION, PROTEIN 
LOCALIZATION 


METAL BINDING PROTEIN CAPP1NU 
PROTEIN, CALCIUM-BINDING, 
DUPLICATION, REPEAT, SH3 2 
DOMAIN, CYTOSKELETON 




Coumpound 


a 
6 

s 


CYTOCHROME C OXIDASE; 
CHAIN: A, B, C, D, E, F, G, H, I, J, 
K, L, M,N, 0,P, Q, 


TROPONIN C; CHAIN: A, B; 


ITK; CHAIN: NULL; 


NEURONAL NITRIC OXIDE 
SYNTHASE; CHAIN: A; 
HEPTAPEPTIDE; CHAIN: B; 


PSD-95; CHAIN: A; CRIPT; 
CHAIN: B; 


SPECTRIN ALPHA CHAIN; 
CHAIN: A; 


SIGNAL TRANSDUCTION 
PROTEIN GROWTH FACTOR 
RECEPTOR-BOUND PROTEIN 2 
(GRB2, N-TERMINAL IGBR 3 
SH3 DOMAIN) COMPLEXED 


SEQFOL 
D score 




115.24 














PMF 
score 
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0.87 


0.92 


0.54 
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PDIt annotation 










cytokine lcf; cytokine, 
lymphocyte 

chemoattra<:tant factor, pdz 

DOMAIN 


KINASE HCASK., GLGF REPEAT, 
DHR; PDZ DOMAIN, NEUREXIN, 
SYNDECAN, RECEPTOR 
CLUSTERING IcTMASF 


; « 9 u * 

" CO ^ 2 rtB 

< o * c 

£ N < 

2i?s 

B 5 P _1 


SIGNAL TRANSDUCTION HDLG, 
DHR3 DOMAIN; SIGNAL 
TRANSDUCTION, SH3 DOMAIN, 
REPEAT 


Coumpound 


WITH SOS-A PEPTIDE IGBR 4 
(NMR, 29 STRUCTURES) IGBR 
5 


ADAPTOR PROTEIN 
CONTAINING SH2 AND SH3 
GROWTH FACTOR RECEPTOR- 
BOUND PROTEIN 2 (GRB2) 
IGFC 3 (C-TERMINAL SH3 
DOMAIN) (NMR, MINIMIZED 
MEAN STRUCTURE) IGFC 4 


TRANSFERASE GUANYLATE 
KINASE (E.C.2.7.4.8) COMPLEX 
WITH IGKY 3 GUANOSINE 
MONOPHOSPHATE 1 GKY 4 


TRANSFERASE GUANYLATE 
KINASE (E.C.2.7.4.8) COMPLEX 
WITH IGKY 3 GUANOSINE 
MONOPHOSPHATE IGKY 4 


INTERLEUKIN 16; CHAIN: 
NULL; 


HCASK/LIN-2 PROTEIN; 
CHAIN: A, B; 


HCASK/LIN-2 PROTEIN; i 
CHAIN: A, B; 


HUMAN DISCS LARGE 
PROTEIN; CHAIN: NULL; 


SEQFOL 
D score 






146.00 












PMF 
score 




0.66 
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1.00 
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Verify 
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PDB annotation 


if 

"2 

< 9 
3g 

ll 


i3 

9 ( 

a_, 

^ !=3 ' 
43 O , 
n P£ 

< P 

3 . 
* & 

ll 

CO Pu, 


SERINE PRO lEAob rTb; SJiKINfc 
PROTEASE, HYDROLASE 


SERINE PROTEASE HYDKULASE, 
SERINE PROTEASE, DIGESTION, 
PANCREAS, ZYMOGEN, 2 SIGNAL, 
MULTIGENE FAMILY 


HYDROLASE (SERINE rKO 1 EASfcj 
MYELOBLASTIN; HYDROLASE, 
SERINE PROTEASE, 
GLYCOPROTEIN, ZYMOGEN, 
SIGNAL 








Coumpound 


Q 
U 


ELASTASE; CHAIN: P; 


ELASTASE; CHAIN: P; 


TRYPSIN; CHAIN: NULL; 


PR3; CHAIN: A, B, C, D; 


COMPLEX(PROTEINASE/INHIB 
ITOR) TRYPSIN (E.C.3.4.21.4) 
COMPLEXED WITH INHIBITOR 
FROM BITTER IMCT 3 GOURD 
IMCT 4 


COMPLEX(PROTEINASE/INHIB 
ITOR) TRYPSIN (E.C.3.4.21.4) 
COMPLEXED WITH INHIBITOR 
FROM BITTER IMCT 3 GOURD 
IMCT 4 


HYDROLASE(SERINE 
PROTEINASE) HUMAN 
LEUKOCYTE ELASTASE (HLE) 
(NEUTROPHIL ELASTASE 
(HNE)) IPPF 3 (E.C.3.4.21.37) 
COMPLEX WITH THE THIRD 
DOMAIN OF TURKEY IPPF 4 
OVOMUCOID INHIBITOR 
(OMTKY3) IPPF 5 


SEQFOL 
D score 






115.93 


98.94 


200.42 




103.86 


315.52 


PMF 
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LOO 








1.00 






Verify 
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PDIi annotation 


HYDROLASE ] 
MICROPLASM] NOGEN, SERINE 
PROTEASE, ZYMOGEN, 
CHYMOTRYPS EN 2 FAMILY, 
HYDROLASE 


GROWTH FACTOR 7S NGF; 
GROWTH FACTOR (BETA-NGF), 
HYDROLASE - SERINE PROTEINASE 
2 (GAMMA-NGF), INACTIVE SERINE 
PROTEINASE OVLPHA-MOF^ 


GROWTH FACTOR 7S NGF; 
GROWTH FACTOR (BETA-NGF), 
HYDROLASE - SERINE PROTEINASE 
2 (GAMMA-NGF), INACTIVE SERINE 
PROTEINASE f ALPHA -NOF"* 


COMPLEX (SERINE 
PROTEASE/INHIBITOR) TRYPSIN 
INHIBITOR; SERINE PROTEASE, 
INHIBITOR, COMPLEX, METAL 
BINDING SITES, 2 PROTEIN 
ENGINEERING, PROTEASE- 
SUBSTRATE INTERACTIONS, 3 
METALLOPROl '"R1HS 


i 




Coumpound 


PLASMINOGEN; CHAIN: A, B, 
C, D; 


NERVE GROWTH FACTOR; 
CHAIN: A, B, G, X, Y, Z; 


NERVE GROWTH FACTOR; 
CHAIN: A, B, G, X, Y, Z; 


ECOTIN; CHAIN: A; ANIONIC 
TRYPSIN; CHAIN: B; 


HYDROLASE (SERINE 
PROTEINASE) TRYPSIN 
(E.C.3.4.2L4) COMPLEXED 
WITH THE INHIBITOR ITRN 3 
DIISOPROPYL- 

FLUOROPHOSPHOFLUORIDAT 
E (DFP) ITRN 4 HUMAN 
TRYPSIN, DFP INHIBITED ITRN 
6 


HYDROLASE (SERINE 
PROTEINASE) TRYPSIN 


SEQFOL 
D score 


117.30 




00 

si 






100.94 


PMF 
score 




LOO 




1.00 


1.00 




Verify 
score 




0.65 
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PDB annotation 








SERINE PROTEASE HYDROLASb, 
SERINE PROTEASE, DIGESTION, 
PANCREAS, 2 ZYMOGEN, SIGNAL 


SERINE PROTEASE HYDROLASE, 
SERINE PROTEASE, DIGESTION, 
PANCREAS, 2 ZYMOGEN, SIGNAL 




SIGNAL TRANSDUCTION SIGNAL 
TRANSDUCTION 


SIGNAL TRANSDUCTION SIGNAL 
TRANSDUCTION 


COMPLEX (SIGNAL 
TRANSDUCTION/PEPTIDE) 
COMPLEX, 14-3-3, 
PHOSPHOPEPTIDE, SIGNAL 
TRANSDUCTION 


| COMPLEX (SIGNAL | 


Coumpound 


(E.C.3.4.21.4) COMPLEXED 
WITH THE INHIBITOR ITRN 3 
DIISOPROPYL- 

FLUOROPHOSPHOFLUORIDAT 
E (DFP) ITRN 4 HUMAN 
TRYPSIN, DFP INHIBITED ITRN 
6 


HYDROLASE(SERINE 
PROTEINASE) TRYPSIN 
(E.C.3.4.21.4) COMPLEXED 
WITHBENZAMIDINE 
j INHIBITOR 2TBS 3 


HYDROLASE(SERINE 
PROTEINASE) TRYPSIN 
(E.C.3.4.21.4) COMPLEXED 
WITHBENZAMIDINE 
INHIBITOR 2TBS 3 


BETA TRYPSIN; CHAIN: NULL; 


BETA TRYPSIN; CHAIN: NULL; 




14-3-3 PROTEIN ZETA; CHAIN: 
A, B, C, D; 


14-3-3 PROTEIN ZETA; CHAIN: 
A, B, C, D; 


14-3-3 PROTEIN ZETA; CHAIN: 
A, B; PHOSPHOPEPTIDE; 
CHAIN: Q, R 


| 14-3-3 PROTEIN ZETA; CHAIN: 


SEQFOL 
D score 






97.32 




99.14 




234.97 






284,35 


PMF 
score 




1.00 




1.00 








1.00 


1.00 




Verify 
score 




0.65 




0.73 








0.55 


0.59 
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PDlt annotation 


TRANSDUCTION/PEPTIDE) 
COMPLEX, 14-:>-3, 
PHOSPHOPEPTIDE, SIGNAL 
TRANSDUCTION 


TRANSMEMBRANE PROTEIN 
COLICIN, B ACTERIO ON, ION 
CHANNEL FORMATION, ! 
TRANSMEMRR ANF 9 PROXPTXT ! 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCT! TR AT. PR OTPTM 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCT! JR A T . PR OTPTM 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
strt rr/n irat.pr otptm 


ENDOCYTOSIS/EXOCYTOSIS 
SYNAPTOTAGMIN ASSOCIATED 35 
KDA PROTEIN, P35A, THREE HELIX 


ENDOCYTOSIS/EXOCYTOSIS 
SYNAPTOTAGMIN ASSOCIATED 35 
KDA PROTEIN, P35A, THREE HELIX 
BUNDLE 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL. 


Coumpound 


A, B; PHOSPHOPEPTIDE; 
CHAIN: Q, R 


COLICIN IA; CHAIN: NULL; 


ALPHA SPECTRIN; CHAIN: A, 


ALPHA SPECTRIN; CHAIN: A, 
B,C; 


ALPHA SPECTRIN; CHAIN: A, 
B, C; 


SYNTAXIN-1 A; CHAIN: A, B, C; 


SYNTAXIN-1 A; CHAIN: A, B, C; 


HUMAN SKELETAL MUSCLE 
ALPHA -ACTININ 2; CHAIN: A: 


SEQFOL 
D score 




133.38 














PMF 
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PDH annotation 


STRUCTURAL PROTEIN 
STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL !PT* OTRTTsl 


STRUCTURAL PROTEIN TWO ! 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRI I CT\ IR AT. PT? OTP TNI 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL l>ROTRTM 


ENDOCYTOSIS/EXOCYTOSIS 
NSECl; PROTEIN-PROTEIN 
COMPLEX. MTTf/n-STmTTWTT 
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ENDOCYTOSIS/EXOCYTOSIS 
NSECl; PROTEIN-PROTEIN 
COMPLEX. MI n .TT-ST TRT rwrT 


ENDOCYTOSIS/EXOCYTOSIS 
SYNAPTOTAGMIN ASSOCIATED 35 
KDA PROTEIN, P35A, THREE HELIX 
BUNDLE 


ENDOCYTOSIS/EXOCYTOSIS 
SYNAPTOTAGMIN ASSOCIATED 35 


Coumpound 


ALPHA SPECTRIN; CHAIN: A, 
B, C; 


ALPHA SPECTRIN; CHAIN: A, 
B, C; 


ALPHA SPECTRIN; CHAIN: A, 
B;C; | 


SYNTAXIN BINDING PROTEIN 
1; CHAIN: A; SYNTAXIN 1A; 
CHAIN: B: 


SYNTAXIN BINDING PROTEIN 
1; CHAIN: A; SYNTAXIN 1A; 
CHAIN: B; 


SYNTAXIN BINDING PROTEIN 
1; CHAIN: A; SYNTAXIN 1A; 
CHAIN: B; 


SYNTAXIN BINDING PROTEIN 
1; CHAIN: A; SYNTAXIN 1A; 
CHAIN: B; 1 


SYNTAXIN- 1 A ; CHAIN: A, B, C; 


SYNTAXIN- 1 A; CHAIN: A, B, C; 


SEQFOL 
D score 
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PDB annotation 


S 

13 : 

U < 
< 

5 « 

X« p 
3 PQ 


ENDOC YTOSIS/EXOC Y 1 
SYNAPTOTAGMIN ASSOCIATED 35 
KDA PROTEIN, P35A, THREE HELIX 
BUNDLE 


a 

IS 

a 1-1 


CHAPERONE ARCHAEAL FKUlblN 


CONTRACTILE PROTEIN 1 Kir Ln- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN 


CONTRACTILE PROTEIN I KlrLE- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN 


CONTRACTILE PROTEIN 1 Kii'Lb- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN 


CONTRACTILE PRO 1 blN IKlrXJi- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN 




IMMUNE SYSTEM KH DOMAIN, 
ALPHA-BETA FOLD, RNA-BINDING 
MOTIF 


I IMMUNE SYSTEM KH DOMAIN, | 


Coumpound 




SYNTAXEN-IA; CHAIN: A, B, C; 


SSOl PROTEIN; CHAIN: A; 




HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 




HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


LIGASE(SYNTHETASE) SERYL- 
TRNA SYNTHETASE 
(E.C.6.1.1.1 1) (SERINE-TRNA 
LIGASE) ISES 3 COMPLEXED 
WITH SERYL-HYDROXAMATE- 
AMP ISES 4 


NEURO-ONCOLOGICAL 
VENTRAL ANTIGEN 1; CHAIN: 
A: 


| RNA-BINDING 




PREFOLDIN; CHAIN: 
PREFOLDIN; CHAIN: 
PREFOLDIN; CHAIN: 


SEQFOL 
D score 
























PMF 
score 




0.19 
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0.59 
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Verify 
score 
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PDB annotation 


C-MYC, 2 DIPOLAR COUPLING, 
DNA-BINDING, RNA-BINDING, RNA 
BINDING 3 PROTEIN 


SI RNA-BINDING DOMAIN 
POLYRIBONUCLEOTIDE 
NUCLEOTIDYLTRANSFERASE, SI 
RNA-BINDING DOMAIN, 
POLYNUCLEOTIDE 
PHOSPHORYLASE 2 (PNPASE) 


RIBONUCLEOPROTEIN RNA- 
BINDING PROTEIN IVIG 19 




GENE REGULATION APO PROTEIN 


GENE REGULATION APO PROTEIN 


CELL CYCLE CDC6P; CDC6, CDUlo, 
ORCl, AAA PROTEIN, DNA 
REPLICATION INITATION 2 
FACTOR, CELL CYCLE CONTROL 
FACTOR 


HELICASE DNA REPAIR, DNA 
REPLICATION, SOS RESPONSE, 
HELICASE, ATP- 2 BINDING, DNA- 
BINDING 




ACTIN-BINDING PROTEIN ACTIN- 
BINDING PROTEIN, CALCIUM- 
BINDING, PHOSPHORYLATION 


STRUCTURAL PROTEIN 
DYSTROPHIN, MUSCULAR 
DYSTROPHY, CALPONIN 
HOMOLOGY DOMAIN, 2 ACTIN- 
BINDING, UTROPHIN 












< 


< 














O 




















< CQ 0" Q 




N: NULL; 






§ 






HAIN: N 




U <g 


EXCINUCLEASE UVRAB C 
COMPONENT UVRB; CHA 


It 




CHAIN: 
CHAIN: 
CHAIN: 
CHAIN 






IAIN: A, 


Coumpo 




PNPASE; CHAINS 


VIGILIN; IVIG 5 C 
IVIG 6 




EXCINUCLEASE I 
COMPONENT UV1 


CELL DIVISION C 
PROTEIN 6; CHAE 




PCRA (SUBUNIT); 
PCRA (SUBUNIT): 
PCRA (SUBUNIT): 
PCRA (SUBUNIT); 




T-FIMBRIN; CHAJ 




DYSTROPHIN; C! 
D; 


SEQFOL 
D score 
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a 
o 

« 

fi 

cs 

3 


STRUCTURAL PROTEIN CALPONIN 
HOMOLOGY DOMAIN, DOMAIN 
SWAPPING, ACTIN BINDING, 2 
UTROPHIN, DYSTROPHIN, 
STRUCTURAL IPROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE HI, 2 1 
TRANSCRIPTION INITIATION, ZINC 
FINGER PROTEIN 


COMPLEX (DNA-BINDING ! 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA^ 


TRANSCRIPTION REGULATION 
PROTO-ONCOGENE, NUCLEAR 
BODIES (PODS), LEUKEMIA, 2 


5 

j 

| 

! 


LIGASE CBL, U1JCH7, ZAP-70, E2, 
UBIQUITIN, E3, 

PHOSPHORYLATION, 2 TYROSINE 
KINASE, UBIQUITINATION, 
PROTEIN DEGRADATION, 


METAL BINDING PROTEIN RING 
FINGER PROTEIN MATl : RING 


Coumpound 


UTROPHIN ACTIN BINDING 
REGION; CHAIN: A, B; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; * 
CHAIN: B, C, E, F; 


ZINC FINGER PROTEIN GLIl ; 
CHAIN: A; DNA; CHAIN: C, D; 


TRANSCRIPTION FACTOR 
PML; CHAIN: NULL; 


VIRUS EQUINE HERPES VIRUS- 
1 (C3HC4, OR RING DOMAIN) 
ICHC 3 (NMR, 1 STRUCTURE) 
ICHC 4 


SIGNAL TRANSDUCTION 
PROTEIN CBL; CHAIN: A; ZAP- 
70 PEPTIDE; CHAIN: B; 
UBIQUITIN-CONJUGATING 
ENZYME EI 2- 18 KDA UBCH7; 
CHAIN: C; 


CDK-ACTIVATING KINASE 
ASSEMBLY FACTOR MATl; 


SEQFOL 
D score 
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PDB annotation 

FINGER fC3HC4^> 


DNA-BINDING PROTEIN V(D)J 
RECOMBINATION ACTIVATING 
PROTEIN 1; RAGl, V(D)J 
RECOMBINATION, ANTIBODY, 
MAD, RING FINGER, 2 ZINC 
BINUCLEAR CLUSTER, ZINC 
FINGER, DNA-BINDING PROTEIN 


i 

< 
( 

i 


HE AAiVlbKiZ*A 1 iUIN DUMA1IN 
HEXAMERIZATION DOMAIN, 
ATPASE, TRANSPORT 


CHAPERONE HSLV; HSLU 
CHAPERONE, HSLVU, CLPQY, AAA- 
ATPASE, ATP-DEPENDENT 2 
PROTEOLYSIS, PROTEASOME 


CHAPERONE HSLV; HSLU 
CHAPERONE, HSLVU, CLPQY, AAA- 
ATPASE, ATP-DEPENDENT 2 
PROTEOLYSIS, PROTEASOME 


HYDROLASE ARSA AlFASb; P- 
LOOP, ANTIMONITE BINDING SITE, 
ATP BINDING SITE 


CELL CYCLE CDC6P; CDC6, (JlXJlo, 
ORCl, AAA PROTEIN, DNA 
REPLICATION INITATION 2 
FACTOR, CELL CYCLE CONTROL 
FACTOR 


CHAPERONE AAA-ATPASL, ULrY, 
ATP-DEPENDENT PROTEOLYSIS 


CHAPERONE AAA-ATPASE, CLP Y, 
ATP-DEPENDENT PROTEOLYSIS 


TRANSFERASE SHIKIMATE KINASE, 
PHOSPHORYL TRANSFER, ADP, 


q 

CL 

a 

o 
U 

< 

c 


c 1 

. u 

ii 




N-ETHYLMALEIMIDE- 
SENSITIVE FUSION PROTEIN; 
CHAIN: A; 


HEAT SHOCK PROTEIN HSLV; 
CHAIN: A, B, C, D; HEAT 
SHOCK PROTEIN HSLU; 
CHAIN: E,F; 


HEAT SHOCK PROTEIN HSLV; 
CHAIN: A, B, C, D; HEAT 
SHOCK PROTEIN HSLU; 
CHAIN: E, F; 


ARSEMTE-TRANSLOCATING 
ATPASE; CHAIN: A; 


CELL DIVISION CONTROL 
PROTEIN 6; CHAIN: A, B; 


HEAT SHOCK PROTEIN HSLU; 
CHAIN: A; 


HEAT SHOCK PROTEIN HSLU; 
CHAIN: A; 


SHIKIMATE KINASE; CHAIN: 
A, B; 


SEQFOL 
D score 
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PD)) annotation 


SHIKIMATE 2 PATHWAY, P-LOOP 
PROTEIN. TRANSFERASE 


PROTEIN TRANSPORT FIFTY-FOUR 
HOMOLOG, P48; FFH, SRP54, 
SIGNAL RECOGNITION PARTICLE, 
GTPASE, M DOMAIN, 2 RNA- 
BINDING, SIGNAL SEQUENCE- 
BINDING, HELDC-TURN-HELIX, 3 
PROTEIN TARGETING, PROTEIN 
TRANSPORT 


COMPLEX (INHIBITOR/NUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


TRANSCRIPTION RNAIP; RANGAP; 
! GTPASE- ACTIVATING PROTEIN 
FOR SPIl, GTPASE- ACTIVATING 
PROTEIN, GAP, RNA 1 P, RANGAP, 
LRR, LEUCINE- 2 RICH REPEAT 
PROTEIN, TWINNING, 
HEMIHEDRAL TWINNING, 3 
MEROHEDRAL TWINNING, 
MEROHEDRY 


ACETYLATION RNASE INHIBITOR, 
RIBONUCLEASE/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH REPEATS 


PANCREATIC STONE INHIBITOR 
PANCREATIC STONE INHIBITOR, 
LECTIN 


puni 




iS 




NHIBITOR; 
lOGENIN; 


II 

PI - 
BIO. 

o £ o 


I 


:hain: 


Coumpc 




FFH; CHAIN: A, B, 




RIBONUCLEASE D 
CHAIN: A, D; ANG 
CHAIN: B, E; 


RIBONUCLEASE ft 
CHAIN: NULL; 


LITHOSTATHINE; ( 
NULL 


SEQFOL 
D score 
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PDB annotation 


PANCREATIC STONE INHIBITOR 
PANCREATIC STONE INHIBITOR, 
LECTIN 


PANCREATIC STONE INHUJ1 i OK 
PANCREATIC STONE INHIBITOR, 
LECTIN 


METAL BINDING PROTEAN 
PANCREATIC STONE PROTEIN, PSP; 
PANCREATIC STONE INHIBITOR, 
LITHOSTATHINE 


METAL BINDING PROTEIN 
PANCREATIC STONE PROTEIN, PSP; 
PANCREATIC STONE INHIBITOR, 
LITHOSTATHINE 


METAL BINDING PROTEIN 
PANCREATIC STONE PROTEIN, PSP; 
PANCREATIC STONE INHIBITOR, 
LITHOSTATHINE 




PROTEIN TRANSPORT HELIX- 
TURN-HELIX TPR-LIKE REPEAT, 
PROTEIN TRANSPORT 




HYDROLASE SERCAl; ION PUMP, 
CALCIUM, MEMBRANE PROTEIN, P- 
TYPE ATPASE, ACTIVE 2 
TRANSPORT 




REPLICATION DNA NUCLEOTIDE 
EXCISION REPAIR, UVRABC, 
HELICASE, 2 

HYPERTHERMOSTABLE PROTEIN 


REPLICATION DNA NUCLEOTIDE 
EXCISION REPAIR, UVRABC, 


Coumpound 


LITHOSTATHINE; CHAIN: 
NULL 


LITHOSTATHINE; CHAIN: 
NULL 


LITHOSTATHINE; CHAIN: A; 


LITHOSTATHINE; CHAIN: A; 


LITHOSTATHINE; CHAIN: A; 




VESICULAR TRANSPORT 
PROTEIN SEC17; CHAIN: A; 




CALCIUM-TRANSPORTING 
ATPASE SARCOPLASMIC 
CHAIN: A; 




DNA NUCLEOTIDE EXCISION 
REPAIR ENZYME UVRB; 
CHAIN: A; 


DNA NUCLEOTIDE EXCISION 
REPAIR ENZYME UVRB; 
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HELICASE, 2 

HYPERTHER M OSTAFtt F. PR OTP TXT 


HYDROLASE UVRB; MULTIDOMAIN 
PROTEIN 


HYDROLASE UVRB; MULTIDOMAIN 
PROTEIN 


GENE REGULATION APO PROTEIN 


GENE REGULATION APO PROTEIN 


TRANSLATION YEAST INITIATION 
FACTOR 4A, EIF4A; HELICASE, 
INITIATION FACTOR 4A, DEAD-BOX 
PROTEIN 


TRANSLATION EUKARYOTIC 
INITIATION FACTOR 4A; IF4A, 
HELICASE. np.An.Tinv PPHTPTM 


TRANSLATION EUKARYOTIC 
INITIATION FACTOR 4A; IF4A, 
HELICASR. DF.An.ROY PPnTPTM 


H 

tv 

5 < 
| £ 

tee 

J W £ 

-) t 
<F 

*M <. 

*J PL 


GENE REGULATON EEF4A; 
TRANSLATION INITIATION, 
SACCHAROMYCES CEREVISIAE, 
DEAD BOX 2 PT30TFTM V AA/TTT V 


HELICASE RNA HELICASE, 
HEPATITIS C VIRUS, HCV, 
UNWINDING MECHANISM 


CALCIUM-BINDING PROTEIN CALB; 
CALCIUM-H-/PHOSPHOLIPID 
BINDING PROTEIN, 2 CALCIUM- 
BINDING PROTEIN 






H 






< 
















Coumpound 


CHAIN: A; 


EXCINUCLEASE ABC SUBU> 
B; CHAIN: A; 


EXCINUCLEASE ABC SUBW 
B; CHAIN: A: 


EXCINUCLEASE UVRABC 
COMPONENT UVRB: CHAIN: 


EXCINUCLEASE UVRABC 
COMPONENT UVRB: CHAIN: 


EUKARYOTIC INITIATION 
FACTOR 4A; CHAIN: A; 


YEAST INITIATION FACTOR 
4A; CHAIN: A, B; 


YEAST INITIATION FACTOR 
4A; CHAIN: A, B; 


HCV HELICASE; CHAIN: A, B; 


TRANSLATION INITIATION 
FACTOR 4A; CHAIN: A; 


RNA HELICASE; CHAIN: NUL] 


PROTEIN KINASE C (BETA); 
CHAIN: A, B; 


SEQFOL 
D score 
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PDB annotation 


CALCIUM-BINDING PROTEIN CALB; 
CALCIUM++/PHOSPHOLIPID 
BINDING PROTEIN, 2 CALCIUM- 
BINDING PROTEIN 


CALCIUM-BINDING PROTEIN CALB; 
CALCIUM++/PHOSPHOLIPID 
BINDING PROTEIN, 2 CALCIUM- 
BINDING PROTEIN 


CALCIUM-BINDING PROTEIN CALB; 
CALCIUM++/PHOSPHOLIPID 
BINDING PROTEIN, 2 CALCIUM- 
BINDING PROTEIN 


ENDOCYTOSIS/EXOCYTOSIS 
SYNAPTOTAGMIN, C2-DOMAIN, 
EXOCYTOSIS, 

NEUROTRANSMITTER 2 RELEASE, 
ENDOCYTOSIS/EXOCYTOSIS 


ENDOCYTOSIS/EXOCYTOSIS 
SYNAPTOTAGMIN, C2-DOMAIN, 
EXOCYTOSIS, 

NEUROTRANSMITTER 2 RELEASE, 
ENDOCYTOSIS/EXOCYTOSIS 


HYDROLASE CPLA2; 
PHOSPHOLIPASE, LIPID-BINDING, 
HYDROLASE 


HYDROLASE CPLA2; 
PHOSPHOLIPASE, LIPID-BINDING, 
HYDROLASE 


LIPID DEGRADATION PLC-DI; 
PHOSPHORIC DIESTER 
HYDROLASE, HYDROLASE, LIPID 
DEGRADATION, 2 TRANSDUCER, 
CALCIUM-BINDING, 
PHOSPHOLIPASE C, 3 


Coumpound 


PROTEIN KINASE C (BETA); 
CHAIN: A, B; 


PROTEIN KINASE C (BETA); 
CHAIN: A, B; 


PROTEIN KINASE C (BETA); 
CHAIN: A, B; 


SYNAPTOTAGMIN I; CHAIN: A; 


SYNAPTOTAGMIN I; CHAIN: A; 


CYTOSOLIC PHOSPHOLIPASE 
A2; CHAIN: A, B; 


CYTOSOLIC PHOSPHOLIPASE 
A2; CHAIN: A, B; 


PHOSPHOINOSITIDE-SPECIFIC 
PHOSPHOLIPASE C, CHAIN: A, 
B; 


SEQFOL 
D score 
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PHOSPHOINOSITIDE-SPECIFIC 


LIPID DEGRADATION PLC-Di; 
PHOSPHORIC DIESTER 
HYDROLASE, ^HYDROLASE, LIPID 
DEGRADATION, 2 TRANSDUCER, 
CALCIUM-BINDING, 
PHOSPHOLIPASE C, 3 
PHOSPHOINOSITIDE-SPECTFir 


LIPID DEGRADATION PLC-Dl; 
PHOSPHORIC DIESTER 
HYDROLASE, HYDROLASE, LIPID 
DEGRADATION, 2 TRANSDUCER, 
CALCIUM-BINDING, 
PHOSPHOLIPASE C, 3 
PHOSPHOINOSITIDE-SPECIFTr 


TRANSFERASE CALCIUM++, 
PHOSPHOLIPII) BINDING PROTEIN, 
CALCIUM-BINDING 2 PROTEIN, 
PHOSPHATIDYLSERINE, PROTEIN 
KINASE C 


TRANSFERASE CALCIUM++, 
PHOSPHOLIPID BINDING PROTEIN, 
CALCIUM-BINDING 2 PROTEIN, 
PHOSPHATIDYLSERINE, PROTEIN 
KINASE C 1 


TRANSFERASE CALCIUM++, 
PHOSPHOLIPID BINDING PROTEIN, 
CALCIUM-BINDING 2 PROTEIN, 
PHOSPHATIDYLSERINE, PROTEIN 
KINASE C 


HYDROLASE CALB DOMAIN; 
HYDROLASE, C2 DOMAIN, CALB 
DOMAIN 

HYDROLASE C ALB DOMAIN; 


Coumpound 




PHOSPHOINOSITIDE-SPECIFIC 
PHOSPHOLIPASE C, CHAIN: A, 
B; 


PHOSPHOINOSITIDE-SPECIFIC 
PHOSPHOLIPASE C, CHAIN: A, 

B; I 


PROTEIN KINASE C, ALPHA 
TYPE; CHAIN: A; 


PROTEIN KINASE C, ALPHA 
TYPE; CHAIN: A; 


PROTEIN KINASE C, ALPHA 
TYPE; CHAIN: A; 


PHOSPHOLIPASE A2; CHAIN: 
NULL; 

PHOSPHOLIPASE A2; CHAIN: 


SEQFOL 
D score 
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PDB annotation 


KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PROTEIN/KINASE1 


HORMONE/GROWTH FACTOR P18- 
INK4C; CELL CYCLE INHIBITOR, 
P18INK4C, TUMOR, SUPPRESSOR, 
CYCLIN- 2 DEPIiNDENT KINASE, 
HORMONE/GROWTH FACTOR 


HORMONE/GROWTH FACTOR PI 8- 
INK4C; CELL CYCLE INHIBITOR, 
P18INK4C, TUMOR, SUPPRESSOR, 
CYCLIN- 2 DEPIiNDENT KINASE, 
HORMONE/GRG WTH FACTOR 


SIGNALING PROTEIN HELIX-TURN- 
HELIX, ANKYRIN REPEAT 


SIGNALING PROTEIN HELIX-TURN- 
HELIX, ANKYRIN REPEAT 


METAL BINDING PROTEIN ZINC- 
BINDING MODULE, ANKYRIN 
REPEATS, METAL BINDING 
PROTEIN 


CELL CYCLE INHIBITOR PI 8- 
INK4C(INK6); CELL CYCLE 
INHIBITOR, P18- INK4C(INK6), 
ANKYRIN REPEAT, 2 CDK 4/6 
INHIBITOR 


CELL CYCLE INHIBITOR PI 8- 
INK4C(INK6); CELL CYCLE 
INHIBITOR, Pl8-INK4C(INK6), 
ANKYRIN REPEAT, 2 CDK 4/6 
INHIBITOR 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


Coumpound 




CYCLIN-DEPENDENT KINASE 
6 INHIBITOR: CHAIN: A: 




CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A; 


CYCLIN-DEPENDENT KINASE 
4 INHIBITOR B; CHAIN: A: 


CYCLIN-DEPENDENT KINASE 
4 INHIBITOR B; CHAIN: A; 


PYK2-ASSOCIATED PROTEIN 
BETA; CHAIN: A; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A, B; 




CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A, B; 




NF-KAPPA-B P65 SUBUNIT; " 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 


SEQFOL 
D score 
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score 
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Verify 
score 




0.42 


0.39 


0.23 


0.81 


-0.07 


0.22 


0.52 


0.21 


Psi Blast 




i— < 

CO 

6 

CO 


M 
00 

vd 


On 

CO 


1.2e-32 


oi 
o 


1.7e-30 


cN 
CO 

<& 

00 

vd 


CO 

1 

« 

r-t 


It 




VO 
00 


00 


CO 
CO 

oo 


ON 
O 
On 


00 
00 


VO 
VO 
00 


oo 


CM 
00 


STAR 
TAA 




on 


00 

r- 


CO 
ON 
VO 


VO 


VN 
00 
VO 


On 
VO 


00 


o 

VO 


CHAI 
N ID 




< 


<' 


< 


< 


< 


< 


< 




la 




on 

3 


On • 
-O 


to 
ON 
T3 


CO 

On 
T3 


ldcq 


lihb 


lihb 


| 


fa! 




1503 


1503 


1503 

— — J 


1503 


1503 


ro 
O 
m 
i— « 


o 


S 



620 



WO 02/070539 ^ W PCT/US02/05095 



PDB annotation 




TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HELIX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HELIX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HELIX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 


Coumpound 


B-ALPHA; CHAIN: D; 1 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D; 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D; 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D; 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 


SEQFOL 
D score 
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PDB annotation 


INTERACTION MODULE, 2 
DIMERIZATION DOMAIN, 
TRANSFERASE 


TRANSFERASE RECEPTOR 
TYROSINE KINASE, PROTEIN 
INTERACTION MODULE, 2 
DIMERIZATION DOMAIN, 
TRANSFERASE 


SIGNAL TRANSDUCTION SAM 
DOMAIN, EPH RECEPTOR, SIGNAL 
TRANSDUCTION. OLIGOMER 


SIGNAL TRANSDUCTION SAM 
DOMAIN, EPH RECEPTOR, SIGNAL 
TRANSDUCTION. OLIGOMER 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


TUMOR SUPPRliSSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


TUMOR SUPPRliSSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


TUMOR SUPPRliSSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTXF 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTXF 


TUMOR SUPPRliSSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


TUMOR SUPPRliSSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR. 


Coumpound 




EPHA4 RECEPTOR TYROSINE 
KINASE; CHAIN: A; 


EPHB2; CHAIN: A, B, C, D, E, F, 
G, H; 


uT 
m 

0 

u 

CO 

< 

5 

S sf 

So 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


SEQFOL 
D score 
























PMF 
score 




1.00 
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0.99 


1.00 
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0.98 
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PDB annotation 


ANKYRIN MOTIF 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


TUMOR SUPPRESSOR TUMOK 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PROTEIN/KINASE) 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PROTEIN/KINASE) 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 


Coumpound 




P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P191NK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


SEQFOL 
D score 
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FDI» annotation 


KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PROTEIN/KINASF/l 


COMPLEX (INI-QBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PROTEIN/KINASE^ 


COMPLEX (IN^OBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PROTFTN/KTNASFA 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PRGTEIN/KTNASF/i 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PROTErNr/KINASF^ 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PROTEIN/KINASE) 
COMPLEX (INHIBITOR 


Coumpound 




CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 

CYCLIN-DEPENDENT KINASE 


SEQFOL 
D score 














PMF 
score 




1.00 
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1.00 
LOO 


Verify 
score 




0.49 
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0.17 
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FDB annotation 


P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, • 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR F65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


ANK-REPEAT MYOTROPHIN, 
ACETYLATION, NMR, ANK-REPEAT 


| ANK-REPEAT MYOTROPHIN, | 


Coumpound 


CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D; 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D; 


i-B P65 SUBUNIT; 
NF-KAPPA-B P50D 
CHAIN: C; I-KAPPA- 
CHAIN: D; 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D; 


^-B P65 SUBUNIT; 
, NF-KAPPA-B P50D 

CHAIN: C; I-KAPPA- 

CHAIN: D; 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D; 


V.-B P65 SUBUNIT; 
; NF-KAPPA-B P50D 
; CHAIN: C; I-KAPPA- 
; CHAIN: D; 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D; 


MYOTROPHIN; CHAIN: NULL 


| MYOTROPHIN; CHAIN: NULL 


NF-KAPP/ 
CHAIN: A; 
SUBUNIT; 
B-ALPHA; 


NF-KAPP/ 
CHAIN: A 
SUBUNIT; 
B-ALPHA; 


NF-KAPP; 
CHAIN: A 
SUBUNIT; 
B-ALPHA; 
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PDB annotation 


COMPLEX (TRANSCRlr 1 AUJN 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HELIX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HELIX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HELIX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HELIX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRPTION 
REGULATION/ANIC REPEAT), 
ANKYRIN 2 REPEAT HELIX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HELIX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 


Coumpound 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


SEQFOL 
D score 
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PDB annotation 


COMPLEX (AN 11- 
ONCOGENE/ANKYRIN REPEATS) 
P53BP2; ANKYRIN REPEATS, SH3, 
P53, TUMOR SUPPRESSOR, 
MULTIGENE 2 FAMILY, NUCLEAR 
PROTEIN, PHOSPHORYLATION, 
DISEASE MUTATION, 3 
POLYMORPHISM, COMPLEX (ANTI- 
ONCOGENE/ANKYRIN REPEATS) 


HYDROLASE TETRATRICOPEPTIDE, 
TRP; HYDROLASE, PHOSPHATASE, 
PROTEIN-PROTEIN INTERACTIONS, 
TPR, 2 SUPER-HELIX, X-RAY 
STRUCTURE 


TRANSCRIPTION KbUULAllUM 
PROTO-ONCOGENE, NUCLEAR 
BODIES (PODS), LEUKEMIA, 2 
TRANSCRIPTION REGULATION 


i 
> 


SIGNALLING CUMrLbX KACi; 
P67PHOX; SIGNALLING COMPLEX, 
GTPASE, NADPH OXIDASE, 
PROTEIN-PROTEIN 2 COMPLEX, TPR 
MOTIF 


§ 1 

m CL| O 
* ^ ON 
hn ^ PL, 

9 O 

? 9 < § 

O S 2 0Q 


CHAPERONE HOP, TPK-UUMA1N, 
PEPTIDE-COMPLEX, HELICAL 


Coumpound 


P53; CHAIN: A; 53BP2; CHAIN: 
B; 


SERINE/THREONINE PROTEIN 
PHOSPHATASE 5; CHAIN: 
NULL; 


TRANSCRIPTION FACTOR | 
PML; CHAIN: NULL; 


VIRUS EQUINE HERPES VIRUS- 
1 (C3HC4, OR RING DOMAIN) 
ICHC 3 (NMR, 1 STRUCTURE) 
ICHC 4 


RAS-RELATED C3 BOTULINUM 
TOXIN SUBSTRATE 1; CHAIN: 
A; NEUTROPHIL CYTOSOL 
FACTOR 2 (NCF-2) CHAIN: B; 


TPR2A-DOMAIN OF HOP; 
CHAIN: A; HSP90-PEPTIDE 
MEEVD; CHAIN: B; 


TPRl-DOMAIN OF HOP; CHAIN: 
A, B; HSC70-PEPTIDE; CHAIN: 


o £ 
















to g 
<y « 

8« 
















PMF 
score 


0.76 


0.05 


CO 
1—1 

d 


0.82 


CO 

o 
o 

1 


0.31 


CN 

q 
d 


Verify 
score 


0.01 


oro- 


-0.76 


-0.25 


0.02 


0.05 


0.00 


Psi Blast 


On 

CO 
0> 

«r» 


On 
CN 


CO 


6e-14 


* 

CO 


CN 
CN 
i 

00 

vo* 


i-M 
i 

4> 

»n 




cn 


o 

ON 


a 

CN 


CN 
VO 
CN 


1—1 
On 
i— i 


vo 


o 
i— * 
i— < 




















ON 
VO 


ON 

CO 


00 
CN 






On 
CO 


CN 




















CQ 










< 


< 


go 


lyes 




lbor 


lchc 


VO 
On 


leir 


lelw 




1504 


1507 


1507 


1507 


1507 


1507 


1507 



635 



WO 02/070539 



PCT/US02/05095 



PDH annotation 


REPEAT, HSC70, 2 HSP70, PROTEIN 
BINDING 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSC70, 2 HSP70, PROTEIN 
BINDING 


METAL BINDING PROTEIN RING 
FINGER PROTEIN MATl; RING 
FINGER (ClKCAto 


DNA-BINDING PROTEIN V(D)J 
RECOMBINATION ACTIVATING 
PROTEIN 1; RAGl, V(D)J 
RECOMBINATION, ANTIBODY, 
MAD, RING FINGER, 2 ZINC 
BINUCLEAR CLUSTER, ZINC 
FINGER, DNA-BINDING PROTEIN 


COMPLEX 

(IMMUNOGLOBULIN/AUTOANTIGE 
N) COMPLEX 

(IMMUNOGLOBULIN/AUTOANTIGE 
N), RHEUMATOID FACTOR 2 AUTO- 
ANTIBODY nOMPT F Y 


w 5 s 


if *<( 

2 J c 

lei 

Sam 


INSECT IMMUNITY INSECT 
IMMUNITY, LPS -BINDING, 
MOM fYP hit ir a nT-n?CT/o,xT 


INSECT IMMUNITY INSECT 
IMMUNITY, LPS-B INDING. 


Coumpound 


o 

U 


TPRl-DOMAIN OF HOP; CHAIN: 
A, B; HSC70-PEPHDE; CHAIN: 
C, D; 


CDK-ACTIVATING KINASE 
ASSEMBLY FACTOR MATl; 
CHAIN: A: 


RAGl; CHAIN: NULL; 


IGG4 REA; CHAIN: A; RF-AN 
IGM/LAMBDA; CHAIN: H, L; 


HEMOLIN; CHAIN: A, B; 


HEMOLIN; CHAIN: A, B; 


HEMOLIN; CHAIN: A, B; 


HEMOLIN; CHAIN: A, B; 


SEQFOL 
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HOMOPHILIC ADHESION 


INSECT IMMUNITY INSECT 
IMMUNITY, LPS-BINDING, 
HOMOPHILIC ADHESION 


INSECT IMMUNITY UMSbUl 
IMMUNITY, LPS-BINDING, 
HOMOPHILIC ADHESION 


INSECT IMMUNITY INSJaUl 
IMMUNITY, LPS-BINDING, 
HOMOPHILIC ADHESION 


INSECT IMMUNITY INSbUl 
IMMUNITY, LPS-BINDING, 
HOMOPHILIC ADHESION 


T-CELL SURFACE GLYCOPROTEIN 
IMMUNOGLOBULIN FOLD, 
TRANSMEMBRANE, 
GLYCOPROTEIN, T-CELL, 2 MHC, 
LIPOPROTEIN, T-CELL SURFACE 
GLYCOPROTEIN 


CELL ADHESION NEURAL UbLL 
ADHESION i 


CELL ADHESION NEUKAL CbLL 
ADHESION 


CELL ADHESION NEURAL UbLL 
ADHESION 


CELL ADHESION NEUKAL CbLL 
ADHESION 


CELL ADHESION NEUKAL (JbLL 
ADHESION 


CELL ADHESION NEUKAL CULL 
ADHESION 


CELL ADHESION NEURAL CbLL 
ADHESION 


1 CELL ADHESION NEUKAL CbLL | 


Coumpound 


HEMOLIN; CHAIN: A, B; 


HEMOLIN; CHAIN: A, B; 


HEMOLIN; CHAIN: A, B; 


HEMOLIN; CHAIN: A, B; 


T-CELL SURFACE 
GLYCOPROTEIN CD4; CHAIN: 
NULL; 


AXONIN-1; CHAIN: A; 


AXONIN-1; CHAIN: A; 


AXONIN-1; CHAIN: A; 


AXONIN-1; CHAIN: A; 


AXONIN-1; CHAIN: A; 


AXONIN-1; CHAIN: A; 


AXONIN-1; CHAIN: A; 


1 AXONIN-1; CHAIN: A; 


SEQFOL 
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PDB annotation 


ADHESION 


CELL ADHESION NEURAL CELL 
ADHESION 


GROWTH FACTOR/GROWTH 
FACTOR RECEITOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RPCFPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RPCPPTOP 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RPrpPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBIJLIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 
CjROWTH FACTOR/GROWTH 












-IAIN: 




s 

25 


Coumpound 




AXONIN-1; CHAIN: A; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; C 
C, D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; C] 
CD; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CI 
C,D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CI 
C,D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CP 
C, D; 

FIBROBLAST GROWTH 


SEQFOL 
D score 
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PDB annotation 


FACTOR RECEPTOR FGF, rvirK, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LDCE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 


Coumpound 


FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C,D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C,D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR I; CHAIN: 
C,D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C,D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C, D; 


SEQFOL 
D score 
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PDB annotation 


FACTOR RECEPTOR 1 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOIIULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROWTH FACTOR/GROWTH j 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECE PTOT* 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOB ULIN-LIKE. SIGNAL 


Coumpound 




FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C, D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR I; CHAIN: 
C, D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR I; CHAIN: 
C, D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C, D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C, D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 


SEQFOL 
D score 
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PD'B annotation 


ELECTRON MICROSCOPY, 2 
POLIOVIRUS-RECEPTOR COMPLEX, 
VIRUSAaRAL PROTEIN P Ef!P.PTf)R 


VIRUS/VIRAL PROTEIN, RECEPTOR 
CDI55, PVR, HUMAN POLIOVIRUS, 
ELECTRON MICROSCOPY, 2 
POLIOVIRUS-RECEPTOR COMPLEX, 
VIRUS/VIRAL PROTETM PPrPPTriP 


CELL ADHESION NCAM; NCAM, 
IMMUNOGLOBULIN FOLD, 
GLYCOPROTEfN 


CELL ADHESION NCAM; NCAM, 
IMMUNOGLOBULIN FOLD, 
GLYCOPROTEfN 


CELL ADHESION NCAM; NCAM, 
IMMUNOGLOBULIN FOLD, 
1 GLYCOPROTEIN 


CELL ADHESION NCAM; NCAM, 
IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN 


CELL ADHESION NCAM; NCAM, 
IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN 


CELL ADHESION NCAM; NCAM, 
IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN 


CELL ADHESION NCAM; NCAM, 
IMMUNOGLOBULIN FOLD, 
OL YHOPP OTP! N 


CELL ADHESION NCAM; NCAM, 
IMMUNOGLOBULIN FOLD, 
GLYCOPROTETM 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2: FGFR2: 


Coumpound 


CHAIN: 2; VP3; CHAIN: 3; VP4; 
CHAIN: 4; 


POLIOVIRUS RECEPTOR; 
CHAIN: R; VPl; CHAIN: 1; VP2; 
CHAIN: 2; VP3; CHAIN: 3; VP4; 
CHAIN: 4; 


NEURAL CELL ADHESION 
MOLECULE; CHAIN: A, B, C, D; 


NEURAL CELL ADHESION 
MOLECULE; CHAIN: A, B, C, D; i 


! NEURAL CELL ADHESION 
MOLECULE; CHAIN: A, B, C, D; 


NEURAL CELL ADHESION 
MOLECULE; CHAIN: A, B, C, D; 


NEURAL CELL ADHESION 
MOLECULE; CHAIN: A, B, C, D; 


NEURAL CELL ADHESION 
MOLECULE; CHAIN: A, B, C, D; 


NEURAL CELL ADHESION 
MOLECULE; CHAIN: A, B, C, D; 


NEURAL CELL ADHESION 
MOLECULE; CHAIN: A, B, C, D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 


SEQFOL 
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PDB annotation 


IMMUNOGLOBULIN (IG)LlKJi 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IO)LKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LDCE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 


Coumpound 


FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G, H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G,H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN; A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G, H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G, H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G, H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G, H; 


SEQFOL 
D score 
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PDB annotation 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEI'TOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS. B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEIPTOR FGF2; FGFR2; 
IMMUNOGLOB ULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROIJP WITHIN IG-LIKE 
DOMAINS. B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS. B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBtJLIN (IG)LIKE 
DOMAINS BELONGING TO THE I- j 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS. B-TREFOTL FOTT> 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBTJLIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 


Coumpound 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2\ CHAIN: 
E, F, G, H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G, H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G, H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G, H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G, H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 


SEQFOL 
D score 
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score 


0.01 


0.13 


0.41 


-0.02 


0.23 


ZVO 


Psi Blast 


00 
00 


CN 
CO 

6 

cs 


% 

00 

vd 


Tt 

CO 
<0 
00 

vd 


CO 

«? 


<S 

<D i 
CN 




CM 

»n 


«n 
^- 


CO 

en 
m 


00 

CN 

VO | 




VO 
CN 


STAR 
TAA 


r-H 




«— < 

VO 
CO 


^* 
in 


i 


VO 


CHAI 
N ID 


O 


O 


0 


0 


O 


O 


Cm 




CN 
> 


CN 

> 


CN 


CN 

1 


CN 

> ! 


fat 


1508 


OO 

o 
>n 


o 

m 


00 

o 
<n 


30 
O 

m 

l-H 


1508 



644 



WO 02/070539 W W PCT/US02/05095 



PDB annotation 


SET 2 SUBGROUP WITHIN lO-LlKb 
DOMAINS, B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 


GROWTH FACTOR/GROW TH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
| DOMAINS, B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 


GROWTH FACTOR/GROW IH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGFl; FGFRl; 


Coumpound 


if 
o 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G,H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E,F, G,H; 


FIBROBLAST GROWTH 
FACTOR 2;. CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G, H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G, H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; CHAIN: 
E,F,G,H; 


FIBROBLAST GROWTH 
FACTOR 1; CHAIN: A, B; 


SEQFOL 
D score 
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PDB annotation 


IMMUNOGLOBULIN (IG) LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGRO UP WITHIN IG-LDCE 
DOMAINS. B-TREFOIL FOLD 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGFl; FGFRl; 
IMMUNOGLOBULIN (IG) LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS. B-TREFOTT. FOT .n 


GROWTH FAClfOR/GROWTH 
FACTOR RECEPTOR FGFl; FGFRl; 
IMMUNOGLOB ULIN (IG) LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS. B-TREFOTT. FOT .n 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGFl ; FGFRl ; 
IMMUNOGLOB'ULIN (IG) LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS. B-TfcFFOTT. FOT .n 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGFl; FGFRl; 
IMMUNOGLOBULIN (IG) LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROlfP WITHIN IG-LIKE 
DOMAINS. R-TRFFOTT. FOT n 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGFl ; FGFRl ; 
IMMUNOGLOBULIN (IG) LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 


Coumpound 


FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: . 
C, D; 


FIBROBLAST GROWTH 
FACTOR 1; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C, D; 


FIBROBLAST GROWTH 
FACTOR I; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C,D; 


FIBROBLAST GROWTH 
FACTOR I; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C,D; 


FIBROBLAST GROWTH 
FACTOR 1; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C,D; 


FIBROBLAST GROWTH 
FACTOR 1; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
C, D; 


SEQFOL 
D score 
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PDB annotation 


4 

2 

ii 

eg 

S s 

g c 
o g 

0 « 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FC(EPSILON) 
IGE-FC; IMMUNOGLOB ULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 
BINDING 2 PROTEIN, IGE 
AMTmnnv tod -en 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FC(EPSELON) 
IGE-FC; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 
BINDING 2 PROTEIN, IGE 
ANTIBODY TOF-FP 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FCQEPSILON) 
IGE-FC; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 
BINDING 2 PROTEIN, IGE 
ANTIBODY TOK-Fr 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FC(EPSILON) 
IGE-FC; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 
BINDING 2 PROTEIN, IGE 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FC(EPSILON) 
IGE-FC; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEDSI, RECEPTOR, IGE- 
BINDING 2 PROTEIN, IGE 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FCfEPSILON^ 


Coumpound 


RECEPTOR CHAIN: A; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILON 
RECEPTOR CHAIN: A; IG 
EPSILON CHAIN C REGION; 
CHAIN: B, D; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILON 
RECEPTOR CHAIN: A; IG 
EPSILON CHAIN C REGION; 
CPLMN: B, D; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILON 
RECEPTOR CHAIN: A; IG 
EPSILON CHAIN C REGION; 
CHAIN: B, D; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILON 
RECEPTOR CHAIN: A; IG 
EPSILON CHAIN C REGION; 
CHAIN: B, D; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILON 
RECEPTOR CHAIN: A; IG 
EPSILON CHAIN C REGION; 
CHAIN: B, D; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILON 


SEQFOL 
D score 
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PDB annotation 


IGE-FC; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 
BINDING 2 PROTEIN, IGE 
ANTIBODY, IGE-FC 


IMMUNE SYSTEM HIGH AFFINU Y 
IGE-FC RECEPTOR, FC(EPSILON) 
IGE-FC; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 
BINDING 2 PROTEIN, IGE 
ANTIBODY, IGE-FC 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FC(EPSILON) 
IGE-FC; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 
BINDING 2 PROTEIN, IGE 
ANTIBODY, IGE-FC 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FC(EPSILON) 
IGE-FC; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 
BINDING 2 PROTEIN, IGE 
ANTIBODY, IGE-FC 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FC(EPSILON) 
IGE-FC; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 
BINDING 2 PROTEIN, IGE 
ANTIBODY, IGE-FC 


IMMUNE SYSTEM, MEMBRANE 
PROTEIN CD32; FC RECEPTOR, 
IMMUNOGLOULIN, LEUKOCYTE, 
CD32 


IMMUNE SYSTEM, MEMBRANE 
PROTEIN CD32; FC RECEPTOR, 


Coumpound 


RECEPTOR CHAIN: A; IG 
EPSILON CHAIN C REGION; 
CHAIN: B, D; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILON 
RECEPTOR CHAIN: A; IG 
EPSILON CHAIN C REGION; 
CHAIN: B, D; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILON 
RECEPTOR CHAIN: A; IG 
EPSILON CHAIN C REGION; 
CHAIN: B, D; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILON 
RECEPTOR CHAIN: A; IG 
EPSILON CHAIN C REGION; 
CHAIN: B, D; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILON 
RECEPTOR CHAIN: A; IG 
EPSILON CHAIN C REGION; 
CHAIN: B, D; 


FC RECEPTOR 
FC(GAMMA)RIIA; CHAIN: A; 


FC RECEPTOR 
FC(GAMMA)RIIA; CHAIN: A; 


SEQFOL 
D score 
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PBB annotation 


i 

! 


(IMMUNOGLOBULIN/RECEPTOR) 
IMMUNOGLOBULIN FOLD, 
TRANSMEMBRANE, 
GLYCOPROTEIN, RECEPTOR, 2 
SIGNAL, COMPLEX 
(IMMUNOGLOBULIN/RECEPTOR) 


(IMMUNOGLOBULIN/RECEPTOR) 
IMMUNOGLOBULIN FOLD, 
TRANSMEMBRANE, 
GLYCOPROTEIN, RECEPTOR, 2 
SIGNAL, COMPLEX 
flMMUNOGLOBULIN/RECEPTOR) 


COMPLEX 

(IMMUNOGLOBULIN/RECEPTOR) 
IMMUNOGLOBULIN FOLD, 
TRANSMEMBRANE, 
GLYCOPROTEIN, RECEPTOR, 2 
SIGNAL, COMPLEX 
OMMUNOGLOBULIN/RECEPTOR) 
KINASE KINASE, TWITCHIN, 


INTRASTERIC RbCiULAUUN 
IMMUNOGLOBULIN 
IMMUNOGLOBULIN, BENCE JONES 
PROTEIN 




Coumpound 


T LYMPHOCYTE ADHESION 
GLYCOPROTEIN CD2 (RAT) 
1HNG3 


INTERLEUKIN-1 BETA; CHAIN: 
A; TYPE 1 INTERLEUKIN-1 
RECEPTOR; CHAIN: B; 


INTERLEUKIN-1 BETA; UHAIN: 
A; TYPE 1 INTERLEUKIN-1 
RECEPTOR; CHAIN: B; 


INTERLEUKIN-1 BETA; CHAIN: 
A; TYPE 1 INTERLEUKIN-1 
RECEPTOR; CHAIN: B; 

TWITCHIN; CHAIN: NULL; 


LAMBDA III BENCE JONES 
PROTEIN CLE; CHAIN: A, B 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN Gl (IGGl) 
(MCG) WITH A HINGE 
DELETION IMCO 3 
IMMUNOGLOBULIN 


SEQFOL 
D score 
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PDB annotation 


Dl,2; IVCA 6 IMMUNOGLOBULIN 
SUPERFAMILY, INTEGRIN-BINDING 
IVCA 15 


GLYCOPROTEIN CD4; 
IMMUNOGLOBULIN FOLD, 
TRANSMEMBRANE, 
GLYCOPROTEIN, T-CELL, 2 MHC 
LIPOPROTEIN. POLYMORPHISM 


GLYCOPROTEIN CD4; 
IMMUNOGLOBULIN FOLD, 
TRANSMEMBRANE, 
GLYCOPROTEIN, T-CELL, 2 MHC 
LIPOPROTEIN. POLYMORPHISM 


GLYCOPROTEIN CD4; 
IMMUNOGLOBULIN FOLD, 
TRANSMEMBRANE, 
GLYCOPROTEIN, T-CELL, 2 MHC 
LIPOPROTEIN. POLYMORPHISM 


CELL ADHESION ICAM-2; 
IMMUNOGLOBULIN FOLD, CELL 
ADHESION, GLYCOPROTEIN, 2 
TRANSMEMBRANE, REPEAT, 
SIGNAL 


IMMUNE SYSTEM P58 NATURAL 
KILLER CELL RECEPTOR; KIR, 
NATURAL KILLER RECEPTOR, 
INHIBITORY RECEPTOR, 2 
IMMUNOGLOBULIN 


IMMUNE SYSTEM P58 NATURAL 
KILLER CELL RECEPTOR; KIR, 
NATURAL KILLER RECEPTOR, 
INHIBITORY RECEPTOR, 2 
IMMUNOGLOBULIN 


IMMUNE SYSTEM P58 NATURAL 


Coumpound 


ADHESION MOLECULE-1; 
IVCA 4 CHAIN: A, B; IVCA 5 


T-CELL SURFACE 
GLYCOPROTEIN CD4; CHAIN: 
A,B; 


T-CELL SURFACE 
GLYCOPROTEIN CD4; CHAIN: 
A,B; 


T-CELL SURFACE 
GLYCOPROTEIN CD4; CHAIN: 
A, B; 


INTERCELLULAR ADHESION 
MOLECULE-2; CHAIN: NULL; 


MHC CLASS I NK CELL 
RECEPTOR PRECURSOR; 
CHAIN: A; 


MHC CLASS I NIC CELL 
RECEPTOR PRECURSOR; 
CHAIN: A; 


MHC CLASS I NK CELL 


SEQFOL 
D score 
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PDB annotation 


KILLER CELL RECEPTOR; KIR, 
NATURAL KILLER RECEPTOR, 
INHIBITORY RECEPTOR, 2 
IMMUNOGLOBULIN 


IMMUNE SYSTEM P58 NATURAL 
KILLER CELL RECEPTOR; KIR, 
NATURAL KILLER RECEPTOR, 
INHIBITORY RECEPTOR, 2 
IMMUNOGLOBULIN 


IMMUNE SYSTEM P58 NATURAL 
KILLER CELL RECEPTOR; KIR, 
NATURAL KILLER RECEPTOR, 
INHIBITORY RECEPTOR, 2 
IMMUNOGLOBULIN 


IMMUNE SYSTEM P58 NA1 UKAL 
KILLER CELL RECEPTOR; KIR, 
NATURAL KILLER RECEPTOR, 
INHIBITORY RECEPTOR, 2 
IMMUNOGLOBULIN 


IMMUNE SYSTEM CD32; RECEPTOR, 
FC, CD32, IMMUNE SYSTEM 


IMMUNE SYSTEM CD32; RECEPTOR, 
FC, CD32, IMMUNE SYSTEM 


IMMUNE SYSTEM CD32; RECEPTOR, 
FC, CD32, IMMUNE SYSTEM 


IMMUNE SYSTEM CD32; RECEPTOR, 
FC, CD32, IMMUNE SYSTEM 


IMMUNE SYSTEM CD32; RECEPTOR, 
FC, CD32, IMMUNE SYSTEM 


IMMUNE SYSTEM CD32; RECEPTOR, 
FC, CD32, IMMUNE SYSTEM 


IMMUNE SYSTEM CD32; RECEPTOR, 
FC. CD32, IMMUNE SYSTEM 


1 IMMUNE SYSTEM CD32; RECEPTOR, | 


Coumpound 


RECEPTOR PRECURSOR; 
CHAIN: A; 


MHC CLASS I NK CELL 
RECEPTOR PRECURSOR; 
CHAIN: A; 


MHC CLASS I NK CELL 
RECEPTOR PRECURSOR; 
CHAIN: A; 




MHC CLASS INK CELL 
RECEPTOR PRECURSOR; 
CHAIN: A; 


FC GAMMA RIIB; CHAIN: A; 


FC GAMMA RIIB; CHAIN: A; 


FC GAMMA RIIB; CHAIN: A; 


FC GAMMA RIIB; CHAIN: A; 


FC GAMMA RIIB; CHAIN: A; 


FC GAMMA RIIB; CHAIN: A; 


FC GAMMA RIIB; CHAIN: A; 


I FC GAMMA RIIB; CHAIN: A; 


SEQFOL 
D score 
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PDB annotation 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) | 


TRANSCRIPTION REGULATION 
TRANSCRIPTION REGULATION, 
ADRI, ZINC FINGER, NMR 


TRANSCRIPTION REGULATION 
TRANSCRIPTION REGULATION, 
ADRI, ZINC FINGER, NMR 


PRO REGION PRO REGION, 
FOLDASE, PROTEIN FOLDING, 
SERINE PROTEASE 




ENDOCYTOSIS/EXOCYTOSiS 
SYNAPTOTAGMIN ASSOCIATED 35 
KDA PROTEIN, P35A, THREE HELIX 
BUNDLE 


i 


COMPLEX (INHIB ITOR/N U CLE ASb) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


COMPLEX (INMBITOR/NUCLbASb; 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


Coumpound 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


ADRI; CHAIN: NULL; 


ADRI; CHAIN: NULL; 


ALPHA-LYTIC PROTEASE; 
CHAIN: A, B, C; 




SYNTAXIN-1 A; CHAIN: A, B, C; 




RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B,E; 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


SEQFOL 
D score 
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PDB annotation 


COMPLEX (INHIBITOR/NUCLEASE) 
COMPLEX (INHTOITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


COMPLEX (INlilBITOR/NUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


COMPLEX (INHIBITOIVNUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


COMPLEX (INHIBITOR/NUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, CELL 
ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, CELL 
ADHESION 


TRANSFERASE CRYSTAL 
STRUCTURE, RAB 

GERANYLGERyVNYLTRANSFERASE, 
2.0 A 2 RESOLUTION, N- 


Coumpound 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


INTERNALIN B; CHAIN: A; 


INTERNALIN B; CHAIN: A; 


RAB 

GERANYLGERANYLTRANSFE 
RASE ALPHA SUBUNIT; 
CHAIN: A, C; RAB 


SEQFOL 
D score 
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PDB annotation 


\ ' 

1§ ! 

S2 1 

£ w 

3 « 

Ii 

is 

Cjl, CO 


IS.i 

S|2 i 
=^ 9 w _ 

ills 

Ills 


isi 

jj n R 


LIGASE CYCLIN A/CDK.2- 
ASSOCIATED PROTEIN P45; CYCLIN 
A/CDK2-ASSOCIATED PROTEIN P 19; 
SKPl, SKP2, F-BOX, LRR, LEUCINE- 
RICH REPEAT, SCF, UBIQUITIN, 2 
E3, UBIQUITIN PROTEIN LIGASE 


LIGASE CYCLIN A/CDK2- 
ASSOCIATED P45; CYCLIN A/CDK2- 
ASSOCIATED P19; SKPl, SKP2, F- 
BOX, LRRS, LEUCINE-RICH 
REPEATS, SCF, 2 UBIQUITIN, E3, 
UBIQUITIN PROTEIN LIGASE 


TRANSCRIPTION RNAIP; KAJNCiAr; 
GTPASE-ACTIVATING PROTEIN 
FOR SPIl, GTPASE-ACTIVATING . 
PROTEIN, GAP, RNAIP, RANGAP, 
LRR, LEUCINE- 2 RICH REPEAT 
PROTEIN, TWINNING, 
HEMIHEDRAL TWINNING, 3 
MEROHEDRAL TWINNING, 
MEROHEDRY 


TRANSCRIPTION RNAIP; RANiiAr; 
GTPASE-ACTIVATING PROTEIN 


Coumpound 


GERANYLGERANYLTRANSFE 
RASE BETA SUBUNIT; CHAIN: 
B.D: 


NUCLEAR RNA EXPORT 
FACTOR 1; CHAIN: A, B; 


NUCLEAR RNA EXPORT 
FACTOR 1; CHAIN: A, B; 


SKP2; CHAIN: A, C, E, G, I, K, M, 
0; SKPl; CHAIN: B, D, F, H, J, L, 
HP; 


SKP2; CHAIN: A, C; SKPl; 
CHAIN: B,D; 


GTPASE-ACTIVATING 
PROTEIN RNA1_SCHP0; 
CHAIN: A, B; 


GTPASE-ACTIVATING 
PROTEIN RNAl SCHPO; 


SEQFOL 
D score 
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PDB; annotation 


FOR SPIl, GTPASE-ACTIVATING 
PROTEIN, GAP, RNAIP, RANGAP, 
LRR, LEUCINE- 2 RICH REPEAT 
PROTEIN, TWINNING, 
HEMIHEDRAL TWINNING, 3 
MEROHEDRAL TWINNING, 
MEROHEDRY 


TRANSCRIPTION RNAIP; RANGAP; 
GTPASE-ACTIVATING PROTEIN 
FOR SPIl, GTPASE-ACTIVATING 
PROTEIN, GAP, RNAIP, RANGAP, 
LRR, LEUCINE- 2 RICH REPEAT 
PROTEIN, TWINNING, 
HEMIHEDRAL TWINNING, 3 
MEROHEDRAL TWINNING, 
MEROHEDRY 


TRANSCRIPTION RNAIP; RANGAP; 
GTPASE- ACTIVATING PROTEIN 
FOR SPIl, GTPASE- ACTIVATING 
PROTEIN, GAP, RNAIP, RANGAP, 
LRR, LEUCINE- 2 RICH REPEAT 
PROTEIN, TWINNING, 
HEMIHEDRAL TWINNING, 3 
MEROHEDRAL TWINNING, 
MEROHEDRY 


K 

W z 
S S 55 

ego* 
S 2 P h 

lili 

mi 


ACETYLATION RNASE INHIBITOR, 
RIBONUCLEASIi/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH 'REPEATS 


Coumpound 


CHAIN: A, B; 


GTPASE-ACTIVATING 
PROTEIN RNA1 SCHPO; 
CHAIN: A, B; 


GTPASE-ACTIVATING 
PROTEIN RNA1 SCHPO; 
CHAIN: A, B; 


RIBONUCLEASE INHIBITOR; 
CHAIN: NULL; 


RIBONUCLEASE INHIBITOR; 
CHAIN: NULL; 


SEQFOL 
D score 








109.97 
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PDB annotation 


CONTRACTILE PROTEIN 
TROPOMYOSIN COILED-COIL 
ALPHA-HELICAL, CONTRACTILE 
PROTEIN 


TRANSMEMBRANE PROTEIN 
COLICIN, BACTERIOCIN, ION 
CHANNEL FORMATION, 
TRANSMEMBRANE 2 PROTEIN 


ENDOCYTOSIS/EXOCYTOSiS 
SYNAPTOTAGMIN ASSOCIATED 35 
KDA PROTEIN, P35A, THREE HELIX 
BUNDLE 


ISOMERASE ISOMERASE, MUTASb, 
INTRAMOLECULAR TRANSFERASE 




LIGASE CYCLIN A/CDK2- 
ASSOCIATED PROTEIN P45; CYCLIN 
A/CDK2-ASSOCIATED PROTEIN P19; 
SKPl, SKP2, F-BOX, LRR, LEUCINE- 
RICH REPEAT, SCF, UBIQUITIN, 2 
E3, UBIQUITIN PROTEIN LIGASE 


LIGASE SKP2 F-BOX; SKPl; SKPl, 
SKP2, F-BOX, LRR, LEUCINE-RICH 
REPEAT, SCF, UBIQUITIN, 2 E3, 
UBIQUITIN PROTEIN LIGASE 


LIGASE CYCLIN A/CDK2- 
ASSOCIATED P45; CYCLIN A/CDK2- 
ASSOCIATED P19; SKPl, SKP2, F- 
BOX, LRRS, LEUCINE-RICH 
REPEATS, SCF, 2 UBIQUITIN, E3, 
UBIQUITIN PROTEIN LIGASE 


TRANSCRIPTION TUMOR 
SUPPRESSOR, CANCER, UBIQUITIN, 
BETA SANDWICH, 2 


Coumpound 


TROPOMYOSIN; CHAIN: A, B, 
C,D 


COLICIN IA; CHAIN: NULL; 


SYNTAXIN-1 A; CHAIN: A, B, C; 


Q 
CJ GQ 

I 

11 




SKP2; CHAIN: A, C, E, G, I, K, M, 
0; SKPl; CHAIN: B, D, F, H, J, L, 
N, P; 


CYCLIN A/CDK2-ASSOCIATED 
P19; CHAIN: A, C; CYCLIN 
A/CDK2-ASS0CIATED P45; 
CHAIN: B, D; 


SKP2; CHAIN: A, C; SKPl; 
CHAIN: B, D; 


ELONGIN B; CHAIN: A, D, G, J; 
ELONGIN C; CHAIN: B, E, H, K; 
VHL; CHAIN: C, F, I, L; 


SEQFOL 
D score 




















PMF 
score 
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1.00 


0.62 


0.99 


1.00 


Verify 
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0.46 


0.36 
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0.11 




0.78 
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PDI! annotation 


TRANSCRIPTION, 
TRANSCRIPTIONAL ELONGATION 


TRANSCRIPTION TUMOR 
SUPPRESSOR, CANCER, UBIQUITIN, 
BETA SANDWICH, 2 
TRANSCRIPTION, 
TRANSCRIPTIONAL ELONGATION 


PHOSPHOTRANSFERASE RHOGAP 
DOMAIN; PHOSPHOTRANSFERASE, 
TPASE ACTIVATING PROTEIN, GAP, 
CDC42, 2 PHOSPHOINOSITIDE 3- 
KINASE, SH3 DOMAIN, SH2 
DOMAIN, 3 SIGNAL 
TRANSDUCTION 


PHOSPHOTRANSFERASE RHOGAP 
DOMAIN; PHOSPHOTRANSFERASE, 
TPASE ACTIVATING PROTEIN, GAP, 
CDC42, 2 PHOSPHOINOSITIDE 3- 
KINASE, SH3 DOMAIN, SH2 
DOMAIN, 3 SIGNAL 
TRANSDUCTION 


PHOSPHOTRANSFERASE RHOGAP 
DOMAIN; PHOSPHOTRANSFERASE, 
TPASE ACTIVATING PROTEIN, GAP, 
CDC42, 2 PHOSI'HOINOSITIDE 3- 
KINASE, SH3 DOMAIN, SH2 
DOMAIN, 3 SIGNAL 
TRANSDUCTION 


PHOSPHOTRANSFERASE RHOGAP 
DOMAIN; PHOSPHOTRANSFERASE, 
TPASE ACTIVATING PROTEIN, GAP, 
CDC42, 2 PHOSPHOINOSITIDE 3- 
KINASE, SH3 DOMAIN, SH2 


Coumpound 




ELONGIN B; CHAIN: A, D, G, J; 
ELONGIN C; CHAIN: B, E, H, K; 
VHL; CHAIN: C, F, I, L; 


PHOSPHATIDYLINOSITOL 3- 
KINASE; CHAIN: A, B; 


PHOSPHATIDYLINOSITOL 3- 
KINASE; CHAIN: A, B; 


PHOSPHATIDYLINOSITOL 3- 
KINASE; CHAIN: A, B; 


PHOSPHATIDYLINOSITOL 3- 
KINASE; CHAIN: A, B; 


SEQFOL 
D score 




52.02 






67.87 




PMF 
score 






0.63 


1.00 




0.76 


Verify 
score 






0.10 


0.57 




0.09 
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PBB annotation 


DOMAIN, 3 SIGNAL 
TRANSDUCTION 


PHOSPHOTRANSFERASE RHOGAP 
DOMAIN; PHOSPHOTRANSFERASE, 
TPASE ACTIVATING PROTEIN, GAP, 
CDC42, 2 PHOSPHOINOSITIDE 3- 
KINASE, SH3 DOMAIN, SH2 
DOMAIN, 3 SIGNAL 
TRANSDUCTION 


G-PROTEIN CDC42 GTPASE- 
ACTIVATING PROTEIN; G-PROTEIN, 
GAP, SiGNAL-TRANSDUCTION 


G-PROTEIN CDC42 GTPASE- 
ACTIVATING PROTEIN; G-PROTEIN, 
GAP, SIGNAL-TRANSDUCTION 


G-PROTEIN CDC42 GTPASE- 
ACTIVATING PROTEIN; G-PROTEIN, 
GAP, SIGNAL-TRANSDUCTION 


COMPLEX(GTPASE 
ACTIVATN/PROTO-ONCOGENE) 
GTPASE-ACTIVATING PROTEIN 
RHOGAP; COMPLEX (GTPASE 
ACTIVATION/PROTO-ONCOGENE), 
GTPASE, 2 TRANSITION STATE, GAP 


COMPLEX(GTPASE 
ACTIVATN/PROTO-ONCOGENE) 
GTPASE-ACTIVATING PROTEIN 
RHOGAP; COMPLEX (GTPASE 
ACTIVATION/PROTO-ONCOGENE), 
GTPASE, 2 TRANSITION STATE, GAP 


COMPLEX(GTPASE 
ACTIVATN/PROTO-ONCOGENE) 
GTPASE-ACTIVATING PROTEIN 
RHOGAP; COMPLEX (GTPASE 


Coumpound 




PHOSPHATIDYLINOSITOL 3- 


r 

r 

f 

| 


RHOGAP; CHAIN: NULL; 


RHOGAP; CHAIN: NULL; 


RHOGAP; CHAIN: NULL; 


P50-RHOGAP; CHAIN: A; 
TRANSFORMING PROTEIN 
RHOA; CHAIN: B; 


P50-RHOGAP; CHAIN: A; 
TRANSFORMING PROTEIN 
RHOA; CHAIN: B; 


P50-RHOGAP; CHAIN: A; 
TRANSFORMING PROTEIN 
RHOA; CHAIN: B; 


SEQFOL 
D score 










97.96 






107.06 


PMF 
score 




0.95 


1.00 


0.54 




1.00 
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Verify 
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0.44 
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0.08 
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PDB annotation 


ENZYME, UBIQUITIN-DIRECTED 2 
PROTEOLYSIS, CELL CYCLE 
CONTROL, LIGASE 


UBIQUITIN CONJUGATION UBCl; 
UBIQUITIN CONJUGATION, LIGASE 


UBIQUITIN CONJUGATION UBC 1 ; 
UBIQUITIN CONJUGATION, LIGASE 


UBIQUITIN CONJUGATION 
UBIQUITIN CONJUGATION, 
UBIQUITIN CARRIER PROTEIN, 
THIOESTER 2 BOND, LIGASE 


UBIQUITIN CONJUGATION 1 
UBIQUITIN CONJUGATION, 
UBIQUITIN CARRIER PROTEIN, 
THIOESTER 2 BOND, LIGASE 


UBIQUITIN CONJUGATION UBC7; 
UBIQUITIN CONJUGATION, LIGASE, 
YEAST 


UBIQUITIN CONJUGATION UBC7; 
UBIQUITIN CONJUGATION, LIGASE, 
YEAST 




COMPLEX (INHIBITOR/NUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


COMPLEX (INHIBITOR/NUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


Coumpound 




UBIQUITIN CONJUGATING 
ENZYME; CHAIN: NULL; 


UBIQUITIN CONJUGATING 
ENZYME; CHAIN: NULL; 


UBIQUITIN CONJUGATING 
ENZYME; CHAIN: NULL; 


UBIQUITIN CONJUGATING 
ENZYME; CHAIN: NULL; 


UBIQUITIN CONJUGATING 
ENZYME; CHAIN: NULL; 


UBIQUITIN CONJUGATING 
ENZYME; CHAIN: NULL; 




RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B,E; 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


SEQFOL 
D score 




96.00 




82.57 




77.96 








67.17 


PMF 
score 






1.00 




1.00 




1.00 




0.03 




Verify 
score 






0.49 




0.66 




0.80 




-0.14 
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PDB annotation 


COMPLEX (INHIBITOR/NUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


COMPLEX (INHIBITOR/NUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


COMPLEX (INHIBITOR/NUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP.RIBONT] CT JROPP OTPTM 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP.RIB ONU CLEOPR OTRTNT 1 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP.RIB ONUCLEOPR OTPTM 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


Coumpound 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B M ; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


SEQFOL 
D score 
















PMF 
score 


0.68 


0.76 


0.99 


0.60 


0.01 


0.22 


0.13 


Verify 
score 


0.07 
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PDB annotation 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNPjRIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNPjRIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 


Coumpound 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R;U2A'; CHAIN: A, C;U2B M ; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN:.Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B,D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B n ; 


SEQFOL 
D score 




















PMF 
score 


0.68 


0.98 
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Verify 
score 
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PDB annotation 


(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP.RIB ONI JCLEOPROTFTNJ 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, * 
SNRNP.RIBONT IC T .F.OPT? OTFTM 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, CELL 
ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, CELL ! 
ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALOUM BINDING, CELL 
ADHESION 


TRANSFERASE CRYSTAL 
STRUCTURE, RAB 

GERANYLGERANYLTRANSFERASE, 
2.0 A 2 RESOLUTION, N- 
FORMYLMETHIONINE, ALPHA 
SUBUNIT, BETA SUBUNIT 


CONTRACTILE PROTEIN LEUCINE- 
RICH REPEAT, BETA-BETA-ALPHA 
CYLINDER, DYNEIN, 2 
CHLAMYDOMQNAS FT.AfiF.n.A 


CONTRACTILE IPROTEIN LEUCINE- 
RICH REPEAT, BETA-BETA- ALPHA 
CYLINDER, DYNEIN, 2 
CHLAMYDOMONAS. FLA OFT T A 


CONTRACTILE PROTEIN LEUCINE- 
RICH REPEAT, BETA-BETA-ALPHA 
CYLINDER, DYNEIN, 2 
CHLAMYDOMONAS. FLAGELLA 


Coumpound 


CHAIN; B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


INTERNALIN B; CHAIN: A; 


INTERNALIN B; CHAIN: A; 


INTERNALIN B; CHAIN: A; - 


RAB 

GERANYLGERANYLTRANSFE 
RASE ALPHA SUBUNIT; 
CHAIN: A, C; RAB 
GERANYLGERANYLTRANSFE 
RASE BETA SUBUNIT; CHAIN: 
B, D; 


OUTER ARM DYNEIN; CHAIN: 
A; 


OUTER ARM DYNEIN; CHAIN: 
A; 


OUTER ARM DYNEIN; CHAIN: 
A; 


SEQFOL 
D score 




















PMF 
score 
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PDB annotation 


LRR, LEUCINE- 2 RICH REPEAT 
PROTEIN, TWINNING, 
HEMIHEDRAL TWINNING, 3 
MEROHEDRAL TWINNING, 
MEROHEDRY 


TRANSCRIPTION RNAIP; RANGAP; 
GTPASE-ACTIVATING PROTEIN 
FOR SPIl, GTPASE-ACTIVATING 
PROTEIN, GAP, RNAIP, RANGAP, 
LRR, LEUCINE- 2 RICH REPEAT 
PROTEIN, TWINNING, 
HEMIHEDRAL TWINNING, 3 
MEROHEDRAL TWINNING, 
MEROHEDRY 


ACETYLATION .RNASE INHIBITOR, 
RIBONUCLEASB/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH .'REPEATS 


ACETYLATION : RNASE INHIBITOR, 
RIBONUCLEASB/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH REPEATS 




POTASSIUM CHANNELS 
POTASSIUM CHANNELS, 
TETRAMERIZATION DOMAIN, X- 
RAY 2 STRUCTL/RE. APLYSl A KVl.l 


SIGNALING PROTEIN VOLTAGE- 
GATED POTASSIUM CHANNEL, 
ASSEMBLY DOM ATM TPTP A K/TPP 


METAL TRANSPORT ION CHANNEL, 
OXIDOREDUCTASE, BETA SUBUNIT 


SIGNALING PROTEIN VOLTAGE- 1 


Coumpound 




GTPASE-ACTIVATING 
PROTEIN RNA1 SCHPQ; 
CHAIN: A, B; 


RIBONUCLEASE INHIBITOR; 
CHAIN: NULL; 


RIBONUCLEASE INHIBITOR; 
CHAIN: NULL; 




POTASSIUM CHANNEL KVl.l; 
CHAIN: NULL; 


KV1.2 VOLTAGE-GATED 
POTASSIUM CHANNEL; 
CHAIN: A. B, C. D. E. F. G. H: 


KV BETA2 PROTEIN; CHAIN: A; 
POTASSIUM CHANNEL KVl.l; 
CHAIN: E; 


KV1.2 VOLTAGE-GATED 


SEQFOL 
D score 




















PMF 
score 




0,15 
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Verify 
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PDIi annotation 


NUCLEOTIDE 1 EXCHANGE FACTOR, 
GENE REGULATION 


SIGNALING PROTEIN DAPPl, PHISH, 
BAM32; PLECK STRIN, 3- 
PHOSPHOINOS [TIDES, INOSITOL 
TETRAKISPHOSPHATE 2 SIGNAL 
TRANSDUCTION PROTEIN, 
ADAPTOR PROTEIN 


SIGNALING PROTEIN DAPPl, PHISH, 
BAM32; PLECKSTRIN, 3- 
PHOSPHOINOSKTIDES, INOSITOL 
TETRAKISPHOSPHATE 2 SIGNAL 
TRANSDUCTION PROTEIN, 
ADAPTOR PROTRTM 




SIGNAL TRANSDUCTION SON OF 
SEVENLESS; PLECKSTRIN, SON OF 
SEVENLESS, SIGNAL 
TRANSDUCTION 


CONTRACTILE PROTEIN ACIDIC 
PROFILIN ISOFORM, ACTIN- 
BINDING PROTEIN, POLY-L- 2 
PROLINE BINDING PROTEIN, 
CONTRACTILE PR OTFXM 


CONTRACTILE PROTEIN ACIDIC 
PROFILIN ISOFORM, ACTTN- 


Coumpound 




DUAL ADAPTOR OF 
PHOSPHOTYROSINE AND 3- 
CHAJN: A; 


DUAL ADAPTOR OF 
PHOSPHOTYROSINE AND 3- 
CHAIN: A; 


PHOSPHORYLATION 
PLECKSTRIN (N-TERMINAL 
PLECKSTRIN HOMOLOGY 
DOMAIN) MUTANT IPLS 3 
WITH LEU GLU (HIS)6 ADDED 
TO THE C TERMINUS IPLS 4 
(INS(G1 05-LEHHHHHH)) (NMR, 
25 STRUCTURES) IPLS 5 


SOS 1; CHAIN: NULL; 


PROFILIN n; CHAIN: A, B, C, D; 


PROFILIN II; CHAIN: A, B, C, D; 


SEQFOL 
D score 
















PMF 
score 




0.33 


0.35 


0.93 
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Verify 
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PDB annotation 


BINDING PROTEIN, POLY-L- 2 
PROLINE BINDING PROTEIN, 
CONTRACTILE PROTEIN 


PROTEIN BINDING ACETYLATION, 
ACTIN-BINDING PROTEIN, 
MULTIGENE FAMILY 


PROTEIN BINDING ACETYLATION, 
ACTIN-BINDING PROTEIN, 
MULTIGENE FAMILY 








CONTRACTILE PROTEIN ACIDIC 
PROFILIN ISOFORM, ACTIN- 
BINDING PROTEIN, POLY-L- 2 
PROLINE BINDING PROTEIN, 
CONTRACTILE PROTEIN 


CONTRACTILE PROTEIN ACIDIC 
PROFILIN ISOFORM, ACTIN- 
BINDING PROTEIN, POLY-L- 2 
PROLINE BINDING PROTEIN, 
CONTRACTILE PROTEIN 








MOTOR PROTEIN MOTOR PROTEIN, 
ATPASE, MICROTUBULE 
ASSOCIATED 


MOTOR PROTEIN MOTOR PROTEIN, 
ATPASE, MICROTUBULE 


Coumpound 




PROFILIN; CHAIN: NULL; 


PROFILIN; CHAIN: NULL; 


ACTIN BINDING PROTEIN 
PROFILIN IPNE 3 


ACTIN BINDING PROTEIN 
PROFILIN IPNE 3 




PROFILIN H; CHAIN: A, B, C, D; 


PROFILIN II; CHAIN: A, B, C, D; 


ACTIN BINDING PROTEIN 
PROFILIN IPNE 3 


ACTIN BINDING PROTEIN 
PROFILIN IPNE 3 




KINESIN; CHAIN: NULL; 


KINESIN; CHAIN: NULL; 


SEQFOL 
D score 






167.69 




167.40 










133.14 








PMF 
score 




1.00 




1.00 






1.00 


1.00 


LOO 
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Verify 
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PDB annotation 


ASSOCIATED 

CONTRACTILE PROTEIN NCD, 
CLARET SEGREGATIONAL 
PROTEIN; NCD CRYSTAL 
STRUCTURE, MICROTUBULE 
MOTORS, KINESIN 2 SUPERFAMILY, 
CONTRACTILE PROTPTM 


MOTOR PROTE EN MOTOR PROTEIN, 
CYTOSKELETON 


MOTOR PROTE2N MOTOR PROTEIN, 
CYTOSKELETON 


CONTRACTILE PROTEIN KINESIN, 
MICROTUBULE-BASED MOTOR, 
NCD, CONTRACTILE PROTEIN 


CONTRACTILE PROTEIN 
CONTRACTILE PROTEIN, KAR3, 
KINESIN-RELATED PROTEIN, 
MOTOR 2 PROTEIN, ATPASE, P- 
LOOP, MICROTUBULE BINDING 
PROTEIN 


! MOTOR PROTEIN MOTOR PROTEIN, 
CYTOSFCELETOM 


MOTOR PROTEIN MOTOR PROTEIN, 
CYTOSKELETON 




AMINOTRANSFERASE 
AMINOTRANSFERASE, PYRIDOXAL 
ENZYME 


Coumpound 


MICROTUBULE MOTOR 
PROTEIN NCD; CHAIN: A, B, C, 
D; 


KINESIN; CHAIN: A, B; 


KINESIN; CHAIN: A, B; 


KINESIN MOTOR NCD; CHAIN: 
A; 


KINESIN-LIKE PROTEIN KAR3; 
CHAIN: NULL; 


KINESIN HEAVY CHAIN; 
CHAIN: A, B, C, D: 


KINESIN HEAVY CHAIN; 
CHAIN: A, B, C, D; 


TRANS FERA SE(AMINOTRANSF 
ERASE) ASPARTATE 
AMINOTRANSFERASE 
(E.C.2.6.1.1) COMPLEXED WITH 
lARS 3 PYRIDOXAL-5'- 
PHOSPHATE lARS 4 


ASPARTATE 

AMINOTRANSFERASE; CHAIN: 
A, B; 


SEQFOL 
D score 




















PMF 
score 


0.10 


0.83 


LOO 


0.54 
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0.98 
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0.07 


0.75 


Verify 
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PDB annotation 


AMINOTRANSFERASE, 5- 
FLUOROMETHYLORNITHINE, PLP- 
DEPENDENT 2 ENZYME, 
PYRJDOXAL PHOSPHATE 




HYDROLASE SERCAl; ION PUMP, 
CALCIUM, MEMBRANE PROTEIN, P- 
TYPE ATPASE, ACTIVE 2 
TRANSPORT ! 




HYDROLASE TETRATRICOPEPTIDE, 
TRP; HYDROLASE, PHOSPHATASE, 
PROTEIN-PROTEIN INTERACTIONS, 
TPR, 2 SUPER-HELIX, X-RAY 
STRUCTURE 


HYDROLASE TETRATRICOPEPTIDE, 
TRP; HYDROLASE, PHOSPHATASE, 
PROTEIN-PROTEIN INTERACTIONS, 
TPR, 2 SUPER-HELIX, X-RAY 
STRUCTURE 


SIGNALLING COMPLEX RACl; 
P67PHOX; SIGNALLING COMPLEX, 
GTPASE, NADPH OXIDASE, 
PROTEIN-PROTEIN 2 COMPLEX, TPR 
MOTIF 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSP90, 2 PROTEIN 
BINDING 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSP90, 2 PROTEIN 
BINDING 


1 CHAPERONE HOP, TPR-DOMAIN, | 


Coumpound 


AMINOTRANSFERASE; CHAIN: 
A, B, C; 




CALCIUM-TRANSPORTING 
ATPASE SARCOPLASMIC 
CHAIN: A; 




SERINE/THREONINE PROTEIN 
PHOSPHATASE 5; CHAIN: 
NULL; 


SERINE/THREONINE PROTEIN 
PHOSPHATASE 5; CHAIN: 
NULL; 


RAS-RELATED C3 BOTULINUM i 
TOXIN SUBSTRATE 1; CHAIN: 
A; NEUTROPHIL CYTOSOL 
FACTOR 2 (NCF-2) CHAIN: B; 


TPR2A-D0MAIN OF HOP; 
CHAIN: A; HSP90-PEPTIDE 
MEEVD; CHAIN: B; 


TPR2A-DOMAIN OF HOP; 
CHAIN: A; HSP90-PEPTIDE 
MEEVD; CHAIN: B; 


TPR2A-D0MAIN OF HOP; 


SEQFOL 
D score 
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FDB annotation 


DEVELOPMENT, 2 SPECIFICITY 










COMPLEX (DNA-BINDING 
PROTEIN/DNA) HD; 
HOMEODOMAIN, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) HD; 
HOMEODOMAIN, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 




| COMPLEX (ZINC FINGER/DNA) | 


Coumpound 


DNA (5'- CHAIN: C; DNA (5'- 
CHAIN: D; 


DNA-BINDING FUSHI TARAZU 
PROTEIN (HOMEODOMAIN) 
(NMR, 20 STRUCTURES) IFTZ 3 


DNA-BINDING FUSHI TARAZU 
PROTEIN (HOMEODOMAIN) 
(NMR, 20 STRUCTURES) IFTZ 3 


DNA-BINDING PROTEIN 
ANTENNAPEDIA PROTEIN 
(HOMEODOMAIN) MUTANT 
WITH CYS 39 ISAN 3 
REPLACED BY SER AND 
RESIDUES 1-6 DELETED 
(C39S,DEL 1-6) ISAN 4 (NMR, 20 
| STRUCTURES) ISAN 5 


DNA-BINDING PROTEIN 
ANTENNAPEDIA PROTEIN 
(HOMEODOMAIN) MUTANT 
WITH CYS 39 ISAN 3 
REPLACED BY SER AND 
RESIDUES 1-6 DELETED 
(C39S,DEL 1-6) ISAN 4 (NMR, 20 
STRUCTURES) ISAN 5 


ANTENNAPEDIA PROTEIN; 
CHAIN: A, B; DNA; CHAIN: C, 
D,E,F; 


ANTENNAPEDIA PROTEIN; 
CHAIN: A, B; DNA; CHAIN: C, 
D,E,F; 




| QGSR ZINC FINGER PEPTIDE; 


SEQFOL 
D score 




59.53 




57.26 






61.44 






PMF 
score 






0.92 
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PDB annotation 


INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC | 


Coumpound 


PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
i CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E< 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C,F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C,F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 


SEQFOL 
D score 
















105.23 


PMF 
score 




0.49 


0.48 


1.00 


1.00 


0.95 


1.00 




Verify 
score 
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PDB annotation 


FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA ^ 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DMA^ 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FTNTtR'R / r»j A 1 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
1 (zinc F^^JrJF.^^/n>JA^ 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/UNA), RNA 
POLYMERASE til, 2 

TRANSCRIPTION INITIATION, ZINC 
FINOFR PT? OTF.TM 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 


Coumpound 


CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B', D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; -CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


SEQFOL 
D score 














117.50 


PMF 
score 




1.00 


1.00 


1.00 


1.00 


0.46 




Verify 
score 




0.59 
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0.03 
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TUB annotation 


REGULATION/DNA) YING-YANG I; 
TRANSCRIPTION INITIATION, 
INITIATOR ELIiMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION , 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/ ITW A ^ 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/ION A ^ 


COMPLEX (TR/VNSCRIPTION 
REGULATION/DNA) YING-YANG I; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA ^ 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEXN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGIIL ATTO>J/n>J A ^ 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION. 


Coumpound 


ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A,B; 


YYl ; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 


SEQFOL 
D score 












PMF 
score 






0.81 


0.16 


1.00 


Verify 
score 




0.47 
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PDB annotation 


INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
• RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 


Coumpound 


CHAIN: A,B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


SEQFOL 
D score 
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PMF 
score 




1.00 
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1.00 


Verify 
score 




0.25 
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0.42 
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PDB annotation 


PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 


Coumpound 


CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl ; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C s D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl ; \ 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl ; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


SEQFOL 
D score 








106.53 












PMF 
score 




0.34 


0.94 




LOO 


0.95 


1.00 


1.00 


0.99 


Verify 
score 




-0.08 
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PDF i annotation 


GENE; NMR, TITIIA, PROTEIN, DNA, 
TRANSCRIPTION FACTOR, 5S RNA 2 
GENE, DNA BINDING PROTEIN, 
ZINC FINGER, COMPLEX 3 
(TRANSCRIPTION 


COMPLEX (TRy INSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 


ZINC FINGER DNA BINDING 
DOMAIN DNA BINDING MOTIF, 
ZINC FINGER DNA BINDING 

DOM ATM 




COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA> 

LIGASE SKP2 F-BOX: SKPl: SKPL 


Coumpound 


CHAIN: E, F; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


SWI5; CHAIN: NULL; 


COMPLEX(TRANSCRIPTION 
REGULATION/DNA) 
TRAMTRACK PROTEIN (TWO 
ZINC-FINGER PEPTIDE) 
COMPLEXED WITH 2DRP 3 
DNA 2DRP 4 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 

CYCLIN A/CDK2-ASSOCIATED 


SEQFOL 
D score 












54.61 


PMF 
score 
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0.00 


0.36 
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PDB annotation 


TRANSPORT A.ND PROTECTION 


3 2 
: » w p 

5 2 P c 

s O < < 

! « S » 
•OS, 

'ggg 

K H p. 


5 <*« X 

sis is 

5 0 h < 0 p p £ 

Calais? 

0 <; O J K h) L 
U fa fa w p w t 


COMPLEX (TWO ELONGATION 
FACTORS) ELONGATION FACTOR 
FOR TRANSFER, HEAT UNSTABLE, 
ELONGATION FACTOR FOR 
TRANSFER, HE AT STABLE, 
ELONGATION FACTOR, COMPLEX 


1 & 
< * 

2 w p 
2 <> 

5 

3? 

) « r 


q 

s 

i 

] 

) 

1 

1 


TRANSLATION EF-TU; GTPASE, 
MOLECULAR SWITCH, TRNA, 
RIBOSOME, Q-BETA REPLICASE, 2 
CHAPERONE, DISULFIDE 

ISOMER AST? 


TRANSLATION PROTEIN-PROTEIN 
COMPLEX 


Coumpound 


CHAIN: A, B; 


ELONGATION FACTOR; 
CHAIN: A, B; 


ELONGATION FACTOR TU; 
CHAIN: A, C; ELONGATION 
FACTOR TS; CHAIN: B, D; 


ELONGATION FACTOR TU; 
CHAIN: A, C; ELONGATION . 
FACTOR TS; CHAIN: B, D; 


GTP-BINDING PROTEIN ERA; 
CHAIN: A, B: 


TRANSPORT AND 
PROTECTION PROTEIN 
ELONGATION FACTOR TU 
(DOMAIN I) - *GUANOSINE 
DIPHOSPHATE lETU 4 
COMPLEX lETU 5 


ELONGATION FACTOR TU (EF- 
TU); CHAIN: A; 


ELONGATION FACTOR EEFl A; 
CHAIN: A; ELONGATION 


SEQFOL 
D score 




129.64 




115.68 










PMF 
score 






0.86 




0.13 


0.45 


1.00 


1.00 


Verify 
score 
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PDlt annotation 


PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSP90, 2 PROTEIN 
BINDTNO 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSC70, 2 HSP70, PROTEIN 
BINDING 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMI'LEX, HELICAL 
REPEAT, HSC70, 2 HSP70, PROTEIN 
BINDING 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTTDE-COMFLEX, HELICAL 
REPEAT, HSC70, 2 HSP70, PROTEIN 
BINDING 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSC70, 2 HSP70, PROTEIN 
BINDING 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSC70, 2 HSP70, PROTEIN 
BINDING 


of 

p & 

g§ § 

' ^ I 2 W R ? 

2 55 S 3 & 
w S cn £ p o 


SIGNALING PROTEIN 
PEROXISMORE RECEPTOR 1, PTS1- 
BP, PEROXIN-5, PTSl PROTEIN- 
PEPTIDE COMPLEX, 
TETRATRICOPEPTIDE REPEAT, TPR. 


Coumpound 


CHAIN: A; HSP90-PEPTIDE 
MEEVD; CHAIN: B; 


TPRl-DOMAIN OF HOP; CHAIN: 
A, B; HSC70-PEPTIDE; CHAIN: 
C,D; 


TPRl-DOMAIN OF HOP; CHAIN: 
A, B; HSC70-PEPTIDE; CHAIN: 
C,D; 


TPRl-DOMAIN OF HOP; CHAIN: 
A, B; HSC70-PEPTIDE; CHAIN: 
C,D; 


TPRl-DOMAIN OF HOP; CHAIN: 
A, B; HSC70-PEPTIDE; CHAIN: 
C,D; 


TPRl-DOMAIN OF HOP; CHAIN: 
A, B; HSC70-PEPTIDE; CHAIN: 
C, D; 


PEROXISOMAL TARGETING 
SIGNAL 1 RECEPTOR; CHAIN: 
A, B; PTSl-CONTAINING 
PEPTIDE; CHAIN: C, D; 


PEROXISOMAL TARGETING 
SIGNAL 1 RECEPTOR; CHAIN: 
A, B; PTSl-CONTAINING 
PEPTIDE; CHAIN: C, D; 


SEQFOL 
D score 


















PMF 
score 




0.98 
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0.95 
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Verify 
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PDB annotation 


2 HELICAL REPEAT | 


SIGNALING PROTEIN 
PEROXISMORE RECEPTOR 1, PTS1- 
BP, PEROXIN-5, PTSl PROTEIN- 
PEPTIDE COMPLEX, 
TETRATRICOPEPTIDE REPEAT, TPR, 
2 HELICAL REPEAT 


SIGNALING PROTEIN 
PEROXISMORE RECEPTOR 1, PTS1- 
BP, PEROXIN-5, PTSl PROTEIN- 
PEPTIDE COMPLEX, 
TETRATRICOPEPTIDE REPEAT, TPR, 
2 HELICAL REPEAT 




LIPID TRANSPORT APO A-I; 
LIPOPROTEIN, LIPID TRANSPORT, 
CHOLESTEROL METABOLISM, 2 
ATHEROSCLEROSIS, HDL, LCAT- 
ACTIVATION 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER .REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


ENDOCYTOSIS/EXOCYTOSIS 
NSECl; PROTEIN-PROTEIN 
COMPLEX, MULTI-SUBUNIT 


ENDOCYTOSIS/EXOCYTOSIS 
SYNAPTOTAGMIN ASSOCIATED 35 


Coumpound 




PEROXISOMAL TARGETING 
SIGNAL 1 RECEPTOR; CHAIN: 
A, B; PTSl-CONTAINING 
PEPTIDE; CHAIN: C, D; 


PEROXISOMAL TARGETING 
SIGNAL 1 RECEPTOR; CHAIN: 
A, B; PTSl-CONTAINING 
PEPTIDE; CHAIN: C, D; 




APOLIPOPROTEIN A-I; CHAIN: 
A, B, C, D; 


ALPHA SPECTRIN; CHAIN: A, 
B,C; 


ALPHA SPECTRIN; CHAIN: A, 


SYNTAXIN BINDING PROTEIN 
1; CHAIN: A; SYNTAXIN 1A; 
CHAIN: B; 


SYNTAXIN-1 A; CHAIN: A, B, C; 


SEQFOL 
D score 










68.91 


68.37 








PMF 
score 
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Verify 
score 
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PDB annotation 


KDA PROTEIN, P35A, THREE HELIX 
BUNDLE 


PROTEIN TRANSPORT HELIX- 
TURN-HELIX 1TR-LIKE REPEAT, 
PROTEIN TRANSPORT 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRACTILE PROTETN 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRACTILE PROTPTM 


TRANSCRIPTION REGULATION 
SIGMA70; RNA POLYMERASE 
SIGMA FACTOR, TRANSCRIPTION 
REGULATION 


COMPLEX (INE0Q3ITOR/NUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


COMPLEX (INHIBITOR/NUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


/-> 

la* 

i|B 

BP 

z S9 ^ & 
22o2 


LIGASE CYCLIN A/CDK2- 
ASSOCIATED PROTEIN P45; CYCLIN 


Coumpound 




VESICULAR TRANSPORT. 
PROTEIN SEC 17; CHAIN: A; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


RNA POLYMERASE PRIMARY 
SIGMA FACTOR; CHAIN: NULL; 


' RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


NUCLEAR RNA EXPORT 
FACTOR 1; CHAIN: A, B; 


dV 

« cf 
u « 

it 

§5 

Oh 00 

co O 


SEQFOL 
D score 




66.08 


71.18 
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PDB annotation 


A/CDK2- AS S OCI ATED PROTEIN Fly; 
SKPl, SKP2, F-BOX, LRR, LEUCINE- 
RICH REPEAT, SCF, UBIQUITIN, 2 
E3. UBIOUITIN PROTEIN LIGASE ! 


LIGASE CYCLIN A/CDK2- 
ASSOCIATED PROTEIN P45; CYCLIN 
A/CDK2- AS S OCIATED PROTEIN P19; 
SKPl, SKP2, F-BOX, LRR, LEUCINE- 
RICH REPEAT, SCF, UBIQUITIN, 2 
E3, UBIOUITIN PROTEIN LIGASE 


LIGASE CYCLIN A/CDK2- 
ASSOCIATED P45; CYCLIN A/CDK2- 
ASSOCIATED P19; SKPl, SKP2, F- 
BOX, LRRS, LEUCINE-RICH 
REPEATS, SCF, 2 UBIQUITIN, E3, 
UBIQUITIN PROTEIN LIGASE 


ACETYLATION RNASE INHIBITOR, 
RIBONUCLEASE/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH REPEATS 


ACETYLATION RNASE INHIBITOR, 
RIBONUCLEASE/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH REPEATS 








Coumpound 


% 


SKP2; CHAIN: A, C, E, G, I, K, M, 
0; SKPl ; CHAIN: B, D, F, H, J, L, 
N,P; 


SKP2; CHAIN: A, C; SKPl; 
CHAIN: B, D; 


RIBONUCLEASE INHIBITOR; 
CHAIN: NULL; 


RIBONUCLEASE INHIBITOR; 
CHAIN: NULL; 




LYASE(OXO-ACID) CARBONIC 
ANHYDRASE II (E.C.4.2.1.1) (50 
MM TRIS, 2CBA 3 3 M 
AMMONIUM SULFATE, PH 7.8) 
2CBA4 


LYASE(OXO-ACID) CARBONIC 
ANHYDRASE II (E.C.4.2.1.1) (50 
MM TRIS, 2CBA 3 3 M 
AMMONIUM SULFATE, PH 7.8) 


SEQFOL 
D score 
















403.66 


PMF 
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PDB annotation 






OXIDOREDUCTASE FUMARATE 
REDUCTASE, SUCCINATE 
DEHYDROGENASE, RESPIRATORY 2 
CHAIN, CITRIC ACID CYCLE, 
FLAVOPROTEIN, IRON-SULPHUR 3 
PROTEIN, DIHAEM CYTOCHROME B 


OXIDOREDUCTASE 
OXIDOREDUCTASE 






ACTIN-BINDING PROTEIN ACTIN- 
BINDING PROTEIN, CALCIUM- 
BINDING, PHOSPHORYLATION 


ACTIN-BINDING PROTEIN ACTIN- 
BINDING PROTEIN, CALCIUM- 
BINDING, PHOSPHORYLATION 


STRUCTURAL PROTEIN CALPONIN 
HOMOLOGY, ACTIN BINDING, 
STRUCTURAL PROTEIN 


| STRUCTURAL PROTEIN CALPONIN | 


Coumpound 


HYDROPHOBIC PROTEIN; 
CHAIN: D,P; 


OXIDOREDUCTASE 
DIHYDROLIPOAMIDE 
DEHYDROGENASE (E.C. 1.8.1. 4) 
COMPLEX WITH ILPF 3 
FLAVIN- ADENINE- 
DINUCLEOTIDE (FAD) ILPF 4 


FUMARATE REDUCTASE 
FLAVOPROTEIN SUBUNIT; 
CHAIN: A, D; FUMARATE 
REDUCTASE IRON-SULFUR 
PROTEIN; CHAIN: B, E; 
FUMARATE REDUCTASE 
CYTOCHROME B SUBUNIT; 
CHAIN: C, F; 


FLAVOCYTOCHROME C3 
FUMARATE REDUCTASE; 
CHAIN: A, D; 


OXIDOREDUCTASE ' 
DIHYDROLIPOAMIDE 
DEHYDROGENASE (E.C. 1.8.1 .4) 
3LAD3 




T-FIMBRIN; CHAIN: NULL; 


T-FIMBRIN; CHAIN: NULL; 


UTROPHIN; CHAIN: A, B; 


| UTROPHIN; CHAIN: A, B; 


SEQFOL 
D score 
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PD B annotation 


HOMOLOGY, ACTIN BINDING, 
STRUCTURAL PR OTRTM 


STRUCTURAL PROTEIN CALPONIN 
HOMOLOGY, ACTIN BINDING, 
STRUCTURAL PR OTRTM 


ACTIN-BINDING CALPONIN 
HOMOLOGY (€H) DOMAIN; 
FILAMENTOUS ACTIN-BINDING 
DOMAIN. CYTOSKRT .ETON 


a 

; Is §* 

» 55 b 5 c 

1 < q H u. 

go P CJ 

|S|S 
Glide 


STRUCTURAL PROTEIN 
DYSTROPHIN, MUSCULAR 
DYSTROPHY, CALPONIN 
HOMOLOGY DOMAIN, 2 ACTIN- 
BINDING. UTR'OPHTM 


STRUCTURAL PROTEIN CALPONIN 
HOMOLOGY DOlvlAIN, DOMAIN 
SWAPPING, ACTIN BINDING, 2 
UTROPHIN, DYSTROPHIN, 
STRUCTURAL .PROTEIN 


LIGASE E6AP; KJBCH7; BILOBAL 
STRUCTURE, ELONGATED SHAPE, 
E3 UBIQUITIN LIGASE, E2 2 
UBIOUITTN CONJT JfJATINO FW7 VKvTC 


LIGASE E6AP; TJBCH7; BILOBAL 
STRUCTURE, ELONGATED SHAPE, 
E3 UBIQUITIN LIGASE, E2 2 
UBIQUITIN CONJUGATING ENZYME 

CELL ADHESION NCAM: NCAM. 


Coumpound 




UTROPHIN; CHAIN: A, B; 


SPECTRIN BETA CHAIN; 
CHAIN: A; 


SPECTRIN BETA CHAIN; 
CHAIN: A; 


DYSTROPHIN; CHAIN: A, B, C, 
D; 


UTROPHIN ACTIN BINDING 
REGION; CHAIN: A, B; 


UBIQUITIN-PROTEIN LIGASE 
E3A; CHAIN: A, B, C; 
UBIQUITIN CONJUGATING 
ENZYME E2; CHAIN: D: 


UBIQUITIN-PROTEIN LIGASE 
E3A; CHAIN: A, B, C; 
UBIQUITIN CONJUGATING 
ENZYME E2; CHAIN: D; 

NEURAL CELL ADHESION 


SEQFOL 
D score 






86.88 
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PMF 
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Verify 
score 




0.85 




0.86 


0.69 


0.68 


0.24 


0.66 


Psi Blast 




•n 
ro 

6 
*n 
*t 


in 

CO 


CO 

Th 

<u 
»n 

00 


le-35 


«n 

CO 
4> 

*r> 


o 


vo 
o 

o i 






CN 

co 

CN 


v*> 

CO 

<N 


vo 

CO 
CN 


CO 


CO 
CO 
CN 


CN 


o 

•O CN 


STAR 
TAA 




O 
CN 


o 

CN 


ts 
N 


vo 

CN 


CN 

.— 1 


CM 

o 

CN 


rs m 






< 


< 


< 






< 


< < 


|a 




a 

§ : 


1 : 


D 


o 


CJ* 






fag 


r 
i 


i 

n i 


S ! 


^ i 


i 


o 


1574 


1575 



702 



WO 02/070539 



PCT/US02/05095 



PDB annotation 


IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2; FGFR2; 
IMMUNOGLOBULIN (IG)LIKE 
DOMAINS BELONGING TO THE I- 
SET 2 SUBGROUP WITHIN IG-LIKE 
DOMAINS, B-TREFOIL FOLD 




IMMUNOGLOBULIN-LIKE DOMAIN 
CONNECTIN 127, TITIN IG REPEAT 
27; MUSCLE PROTEIN, 
IMMUNOGLOBULIN-LQCE DOMAIN 


IMMUNE SYSTEM CD32; RECEPTOR, 
FC, CD32, IMMUNE SYSTEM 




TRANSFERASE DINUCLEOTIDE- 
BINDING MOTIF, PHOSPHORIBOSYL 
TRANSFERASE 


TRANSFERASE DINUCLEOTIDE- 
BINDING MOTIF, PHOSPHORIBOSYL 
TRANSFERASE 


HYDROLASE NEURAMINIDASE; 
HYDROLASE, GLYCOSIDASE 


HYDROLASE NEURAMINIDASE; 
HYDROLASE, GLYCOSIDASE 




HORMONE/GROWTH FACTOR 
HORMONE, RECEPTOR, 
HORMONE/GROWTH FACTOR 


HORMONE/GROWTH FACTOR 
HORMONE, RECEPTOR, 


Coumpound 


MOLECULE; CHAIN: A, B, C, D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B, C, D; 
FIBROBLAST GROWTI-I 
FACTOR RECEPTOR 2; CHAIN: 
E, F, G, H; 


T LYMPHOCYTE ADHESION 
GLYCOPROTEIN CD2 (RAT) 
IHNG 3 


TITIN, 127; CHAIN: NULL; 


FC GAMMA RUB; CHAIN: A; 




NICOTINATE 
MONONUCLEOTIDE:5,6- 
CHAIN: A; 


NICOTINATE 
MONONUCLEOTIDE:5,6- 
CHAIN: A; 


SIALIDASE; CHAIN: NULL; 


< 

& 

c/ 
< 

< 


1 
1 

> 

i 

i 




GROWTH HORMONE; CHAIN: 
A; PROLACTIN RECEPTOR; 
CHAIN: B; 


GROWTH HORMONE; CHAIN: 
A; PROLACTIN RECEPTOR; 


SEQFOL 
D score 
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PDB annotation 


RNA BINDING PROTEIN 


RNA BINDING PROTEIN D3 CORE 
SNRNP PROTEIN; B CORE SNRNP 
PROTEIN SNRNP, SPLICING, SM, 
CORE SNRNP DOMAIN, SYSTEMIC 
LUPUS 2 ERYTHEMATOSUS, SLE, 
RNA BINDING PROTEIN 


RNA BINDING PROTEIN D3 CORE 
SNRNP PROTEIN; B CORE SNRNP 
PROTEIN SNRNP, SPLICING, SM, 
CORE SNRNP DOMAIN, SYSTEMIC 
LUPUS 2 ERYTHEMATOSUS, SLE, 
RNA BINDING PROTEIN 


! OXIDOREDUCTASE PDZ DOMAIN, 
NNOS, NITRIC OXIDE SYNTHASE 


OXIDOREDUCTASE PDZ DOMAIN, 
NNOS, NITRIC OXIDE SYNTHASE 


PEPTIDE RECOGNITION PEPTIDE 
RECOGNITION, PROTEIN 
LOCALIZATTfyNf 


CYTOKINE LCF ; CYTOKINE, 
LYMPHOCYTE 

CHEMOATTRACTANT FACTOR, PDZ 
DOMAIN 


KINASE HCASK, GLGF REPEAT, 
DHR; PDZ DOM/VIN, NEUREXIN, 
SYNDECAN, RECEPTOR 
CLUSTERING, KINASE 


Coumpound 


ASSOCIATED CHAIN: B, D, F, H, 


SMALL NUCLEAR 
RIBONUCLEOPROTEIN SM D3; 
CHAIN: A, C, E, G, I, K; SMALL 
NUCLEAR 

RIBONUCLEOPROTEIN 
ASSOCIATED CHAIN: B, D, F, H, 
J,L; 


SMALL NUCLEAR 

RIB ONUCLEOPROTEIN SM D3 ; 

CHAIN: A, C, E, G, I, K; SMALL 

NUCLEAR 

RIB ONUCLEOPROTEIN 
ASSOCIATED CHAIN: B, D, F, H, 
J, L; 


NEURONAL NITRIC OXIDE 
SYNTHASE; CHAIN: A; 
HEPTAPEPTIDE; CHAIN: B; 


NEURONAL NITRIC OXIDE 
SYNTHASE; CHAIN: A; 
HEPTAPEPTIDE; CHAIN: B: 


PSD-95; CHAIN: A; CRIPT; 
CHAIN: B; 


INTERLEUKIN 16; CHAIN: 
NULL; 


HCASK/LIN-2 PROTEIN; 
CHAIN: A, B; 


SEQFOL 
D score 


















PMF 
score 




0.99 
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Verify 
score 
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PDB annotation 


SIGNAL TRANSDUCTION HDLG, 
DHR3 DOMAIN; SIGNAL 
TRANSDUCTION, SH3 DOMAIN, 
REPEAT 


OXIDOREDUCTASE BETA-FINGER 


MEMBRANE 

PROTEIN/OXIDOREDUCTASE BETA- 
FINGER, HETERODIMER 


PEPTIDE RECOGNITION PSD-95; PDZ 
DOMAIN, NEURONAL NITRIC 
OXIDE SYNTHASE, NMDA 
RECEPTOR 2 BINDING 


HYDROLASE PDZ DOMAIN, HUMAN 
PHOSPHATASE, HPTPIE, PTP-BAS, 
SPECIFICITY 2 OF BINDING 




PROTEIN BINDING GATE-16; 
UBIQUITIN FOLD 




LIPID TRANSPORT APO A-I; 
LIPOPROTEIN, LIPID TRANSPORT, 
CHOLESTEROL METABOLISM, 2 
ATHEROSCLEROSIS, HDL, LCAT- 
ACTTVATTON 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


Coumpound 


HUMAN DISCS LARGE 
PROTEIN; CHAIN: NULL; 


NEURONAL NITRIC OXIDE 
SYNTHASE (RESIDUES 1-130); 
CHAIN: A; 


ALPHA- 1 SYNTROPHIN 
(RESIDUES 77-171); CHAIN: A; 
NEURONAL NITRIC OXIDE 
SYNTHASE (RESIDUES 1-130); 
CHAIN: B; 


POSTSYNAPTIC DENSITY 
PROTEIN 95; CHAIN: A; 


TYROSINE PHOSPHATASE 
(PTP-BAS, TYPE 1); CHAIN: A; 




GOLGI-ASSOCIATED ATPASE 
ENHANCER OF 16 KD; CHAIN: 

A.B: 




APOLIPOPROTEIN A-I; CHAIN: 
A,B,C,D; 


ALPHA SPECTRIN; CHAIN: A, 
B,C; 


SEQFOL 
D score 


















54.19 


61.70 


PMF 
score 


1.00 


0.95 
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Verify 
score 
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PDB annotation 


ENDOCYTOSLS/EXOCYTOSIS 
NSECI; PROTEIN-PROTEIN 
COMPLEX. MtJLTI-SimTJNTT 


ENDOCYTOSIS/EXOCYTOSIS 
NSEC 1 ; PROTEIN-PROTEIN 
COMPLEX. MI SfLTT-ST IRT TNTT 


MEMBRANE PROTEIN FOUR HELIX 
BUNDLE. ALPHA HELIX 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRA CTTT P. PR OTPTM 


TRANSPORT PROTEIN VAMP 2; 
MEMBRANE FUSION PROTEIN 
COMPLEX, TRANSPORT PROTEIN 


DESIGNED HELICAL BUNDLE 
DESIGNED HEOCAT, Rl JMnT.F. 


DESIGNED HELICAL BUNDLE 
DESIGNED HELICAL BUNDLE 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN nA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGUT . A TTOM/TNTHTR TTfYR *\ 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN HA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN. 


Coumpound 


SYNTAXIN BINDING PROTEIN 
1; CHAIN: A; SYNTAXIN 1A; 
CHAIN: B; 


SYNTAXIN BINDING PROTEIN 
I; CHAIN: A; SYNTAXIN 1A; 
CHAIN: B: 


SSOl PROTEIN; CHAIN: A; 


HUMAN SKELETAL MUSCLE 
ALPHA- ACTININ 2; CHAIN: A; 


SYNAPTOBREVIN 2; CHAIN: A, 

E, I; SYNTAXIN 1 A; CHAIN: B, 

F, J; SNAP-25B; CHAIN: C, G, K; 
SNAP-25B; CHAIN: D, H, L: 


DHPl; CHAIN: NULL; 


DHPl; CHAIN: NULL; 


ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO-MAI; 
CHAIN: P; 


ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO-MAI; 
CHAIN: P; 


SEQFOL 
D score 








57.26 


55.15 










PMF 
score 


0.45 


0.95 


0.04 






0.16 


0.17 


0.06 


0.47 


Verify 
score 


-0.00 


0.16 


0.23 






0.49 


0.17 


0.21 


0.32 
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PDB annotation 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 


SURFACE PROTEIN MEROZOITE 
SURFACE ANTIGEN 1, MAJOR 
BLOOD-STAGE EGF-LIKE DOMAIN, 
EXTRACELLULAR, MODULAR 
PROTEIN, SURFACE 2 ANTIGEN, 
MALARIA VACCINE COMPONENT, 
SURFACE PROTEIN 


SURFACE PROTEIN MEROZOITE 
SURFACE ANTIGEN 1, MAJOR 
BLOOD-STAGE EGF-LIKE DOMAIN, 
EXTRACELLULAR, MODULAR 
PROTEIN, SURFACE 2 ANTIGEN, 
MALARIA VACCINE COMPONENT, 
SURFACE PROTEIN 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


Coumpound 




MEROZOITE SURFACE 
PROTEIN 1; CHAIN: A; 


MEROZOITE SURFACE 
PROTEIN 1; CHAIN: A; 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN: C; 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN: C; 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN: C; 


SEQFOL 
D score 














PMF 
score 




o 

d 


0.04 


0.09 


-0.19 


0.16 


Verify 
score 




0.18 


0.06 


0.44 


0.17 
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PD1E annotation 


HYDROLASES tYDROLASE 
INHIBITOR PROTEIN-PEPTIDE 
COMPLEX 


HYDROLASE/HYDROLASE 
INHIBITOR PROTEIN-PEPTIDE 
COMPLEX 


SERINE PROTEINASE 
COAGULATION FACTOR H; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD 1 41 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE DOMAINS, 
ANTICOAGULANT COMPLEX, 2 
ANTIFIBRINOLVTrr mum cy 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN; TM, CD141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE DOMAINS, 
ANTICOAGULANT COMPLEX, 2 
ANTIFIRRINOT.Yrrr rrwPT by 


MATRIX PROTEIN 
EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNAL, MULTIGENE FAMILY, 
DISEASE MUTATION. 3 EGF-LIKE 


Coumpound 


DES-GLA FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H, I; 
DES-GLA FACTOR VIIA (LIGHT 
CHAIN); CHAIN: L, M; (DPN)- 
PHE-ARG; CHAIN: C, D; 
PEPTIDE E-76: CHAIN: X, Y: 


DES-GLA FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H, I; 
DES-GLA FACTOR VIIA (LIGHT 
CHAIN); CHAIN: L, M; (DPN)- 
PHE-ARG; CHAIN: C, D; 
PEPTIDE E-76; CHAIN: X. Y: 


THROMBIN LIGHT CHAIN; 
CHAIN: A, B, C, D; THROMBIN 
HEAVY CHAIN; CHAIN: M, N, 
O, P; THROMBOMODULIN; 
CHATN: I, J, K, L; THROMBIN 
INHIBITOR L-GLU-L-GLY-L- 
ARM; CHAIN: E, F, G, H; 


THROMBIN LIGHT CHAIN; 
CHAIN: A, B, C, D; THROMBIN 
HEAVY CHAIN; CHAIN: M, N, 
O, P; THROMBOMODULIN; 
CHAIN: I, J, K, L; THROMBIN 
INHIBITOR L-GLU-L-GLY-L- 
ARM; CHAIN: E, F, G, H; 


FIBRILLIN; CHAIN: NULL; 


SEQFOL 
D score 












PMF 
score 


-0.03 


-0.18 


0.01 


d 


-0.05 


Verify 
score 


0.24 


0.13 


-0.18 
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FDB annotation 


| 
1 

fa 

g 

c 

(2 


FRAGMENT, MATRIX PROTEIN 


MATRIX PROTEIN 
EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNAL, MULTIGENE FAMILY, 
DISEASE MUTATION, 3 EGF-LIKE 
DOMAIN, HUMAN FIBRILLIN-1 
FRAGMENT, MATRIX PROTEIN 


MATRIX PROTEIN 
EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNAL, MULTIGENE FAMILY, 
DISEASE MUTATION, 3 EGF-LIKE 
DOMAIN, HUMAN FIBRILLIN-1 
FRAGMENT, MATRIX PROTEIN 


MATRIX PROTEIN 
EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNAL, MULTIGENE FAMILY, 
DISEASE MUTATION, 3 EGF-LIKE 
DOMAIN, HUMAN FIBRILLIN-1 
FRAGMENT, MATRIX PROTEIN 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, SIGNALLING 
PROTEIN 


BLOOD CLOTTING 
COMPLEX(SERINE 
PROTEASE/COFACTOR/LIGAND), 
BLOOD COAGULATION, 2 SERINE 
PROTEASE, COMPLEX, CO-FACTOR, 


Coumpound 




FIBRILLIN; CHAIN: NULL; j 




FIBRILLIN; CHAIN: NULL; 


FIBRILLIN; CHAIN: NULL; 


TUMOR NECROSIS FACTOR 
RECEPTOR; CHAIN: A, B; 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L; 
BLOOD COAGULATION 
FACTOR VIIA; CHAIN: H; 
SOLUBLE TISSUE FACTOR; 


SEQFOL 
D score 




VO 

00 

no 
<o 






55.46 




PMF 
score 






0.55 


0.13 




0.03 


Verify 
score 






0.33 


0.36 
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INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 
BINDING, HYDROLASE, 3 
GLYCOPROTEIN 


en 












PDB annotation 


SERINE PROTEASE FVIIA; FVIIi 
BLOOD COAGULATION, SERIN 
PROTEASE 


SERINE PROTEASE FVIIA; FVIL 
BLOOD COAGULATION, SERIN 
PROTEASE 


SERINE PROTEASE FVIIA; FVII 
BLOOD COAGULATION, SERIN 
PROTEASE 


SERINE PROTEASE FVEA; FVII 
BLOOD COAGULATION, SERI* 
PROTEASE 


PLASMINOGEN ACTIVATION 


BLOOD COAGULATION FACTC 
STUART FACTOR; BLOOD 


pane 




'ACTOR VIIA 
MAIN: L; 
'ACTOR VIIA 
CHAIN: H; 
rUBITOR; 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


COAGULATION FACTOR VILA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VOA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


* ACTIVATOR 
UN; NULL; 


CJ 

Si 


Coump< 




COAGULATION F 
(LIGHT CHAIN); ( 
COAGULATION F 
(HEAVY CHAIN); 
TRIPEPTIDYL INI 
CHAIN: C; 


T-PLASMINOGE1 
Fl-G; lTPG7CHi 
ITPG 8 


BLOOD COAGUI 
FACTOR XA; CH 


O 2 
































PMF 
score 




-0.09 


-0.17 


-0.18 


0.71 


-0.05 


-0.12 


Verify 
score 




0.55 
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PDB annotation 


COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH FACTOR LIKR DOMAIN 


BLOOD COAGULATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH FACTOR T.TKR DOM ATM 


BLOOD COAG^JLATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH FACTOR T 1KV nniUATM 


BLOOD COAGULATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH FACTOR LTKR DOMAIN 


> 








CHAPERONE HSP40; CHAPERONE, 
HEAT SHOCK, PROTEIN FOLDING, 
DNAK 


Coumpound 




BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 


LECTIN (AGGLUTININ) WHEAT 
GERM AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


LECTIN (AGGLUTININ) WHEAT 
GERM AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


LECTIN (AGGLUTININ) WHEAT 
GERM AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


LECTIN (AGGLUTININ) WHEAT 
GERM AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


DNAJ; CHAIN: NULL; 


SEQFOL 
D score 








53.26 




60.84 






81.78 


PMF 
score 




-0.03 


0.46 




-0.12 J 




-0.19 


8 

d 
i 




Verify 
score 




0.04 


0.43 




ZZ'O 




0.04 
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PDB annotation 


CHAPERONE HSP40; CHAPERONE, 
HEAT SHOCK, PROTEIN FOLDING, 
DNAK 


MOLECULAR CHAPERONE HDJ-1; 
MOLECULAR CHAPERONE 


MOLECULAR CHAPERONE HDJ-1; 
MOLECULAR CHAPERONE 


MOLECULAR CHAPERONE HDJ-1; 
MOLECULAR CHAPERONE 






COMPLEX (DNA-BINDING 
PROTEIN/DNA) 




GENE REGULATION/DNA LEF-l 
HMG; LEFl, HMG, TCR-A, 
TRANSCRIPTION FACTOR, DNA 
BINDING, DNA 2 BENDING, 
COMPLEX (HMG DOMAIN/DNA), 
GENE REGULATION/DNA 


GENE REGULATION/DNA LEF-l 
HMG; LEFl, HMG, TCR-A, 


Coumpound 


DNAJ; CHAIN: NULL; 


HUMAN HSP40; CHAIN: NULL; 


HUMAN HSP40; CHAIN: NULL; 


HUMAN HSP40; CHAIN: NULL; 




DNA-BINDING HIGH 
MOBILITY GROUP PROTEIN 
FRAGMENT-B (HMGB) (DNA- 
BINDING IHME 3 HMG-BOX 
DOMAIN B OF RAT HMGl) 
(NMR, 1 STRUCTURE) IHME 4 


HUMAN SRY; IHRY 6 CHAIN: 
A; IHRY 7 DNA; IHRY? 
CHAIN: B; IHRY 10 


DNA-BINDING HIGH 
MOBILITY GROUP PROTEIN 1 
(HMGl) BOX 2, COMPLEXED 
WITH IHSM 3 

MERCAPTOETHANOL (NMR, 
MINIMIZED AVERAGE 
STRUCTURE) IHSM 4 


LYMPHOID ENHANCER- 
BINDING FACTOR; CHAIN: A; 
DNA (5'- CHAIN: B; DNA (5'- 
CHAIN: C; 


LYMPHOID ENHANCER- 
BINDING FACTOR; CHAIN: A; 


SEQFOL 
D score 




86.98 














125.69 




PMF 
score 


1.00 




1.00 


1.00 




0.43 


0.24 


0.75 




1.00 


Verify 
score 


0.84 
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0.97 
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PDB annotation 


TRANSCRIPTION FACTOR, DNA 
BINDING, DNA 2 BENDING, 
COMPLEX (HMG DOMAIN/DNA), 
GENE REGULATION/DNA 


GENE REGULATION/DNA LEF-1 
I-IMG; LEFl, HMG, TCR-A, 
TRANSCRIPTION FACTOR, DNA 
BINDING, DNA 2 BENDING, 
COMPLEX (HMG DOMAIN/DNA), 
GENE REGULATION/DNA 


DNA BINDING PROTEIN HMG BOX, 
DNA BENDING, DNA RECOGNITION, 
CHROMATIN, NMR, DNA 2 BINDING 
PROTEIN 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) 


GENE REGULATION/DNA LEF-1 
HMG; LEFl, HMG, TCR-A, 
TRANSCRIPTION FACTOR, DNA 
BINDING, DNA 2 BENDING, 
COMPLEX (HMG DOMAIN/DNA), 
GENE REGULATION/DNA ! 


GENE REGULATION/DNA LEF-1 
HMG; LEFl, HMG, TCR-A, 
TRANSCRIPTIOH FACTOR, DNA 
BINDING, DNA 2 BENDING, 


Coumpound 


DNA (5'- CHAIN: B; DNA (5 1 - 
CHAIN: C; 


LYMPHOID ENHANCER- 
BINDING FACTOR; CHAIN: A; 
DNA (5'- CHAIN: B; DNA (5'- 
CHAIN: C: 




NON HISTONE PROTEIN 6 A; 
CHAIN: A; 


HUMAN SRY; IHRY 6 CHAIN: 
A; IHRY 7 DNA; IHRY 9 
CHAIN: B; IHRY 10 


! HUMAN SRY; IHRY 6 CHAIN: 
A; IHRY 7 DNA; IHRY 9 
CHAIN: B; IHRY 10 


HUMAN SRY; IHRY 6 CHAIN: 
A; IHRY 7 DNA; IHRY 9 
CHAIN: B: IHRY 10 


LYMPHOID ENHANCER- 
BINDING FACTOR; CHAIN: A; 
DNA (5'- CHAIN: B; DNA (5'- 
CHAIN: C; 


LYMPHOID ENHANCER- 
BINDING FACTOR; CHAIN: A; 
DNA (5'- CHAIN: B; DNA (5'- 
CHAIN: C; 


SEQFOL 
D score 










83.75 




57.47 




PMF 
score 




1.00 


0.89 


j 0.78 




0.81 




0.74 


Verify 
score 




0.40 


0.10 


-0.10 




0.04 




-0.29 
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PDB annotation 


PYRIDOXAX 5"-PHOSPHATE, 2 
TETRAHYDROFOLATE, 
ASYMMETRIC DIMER 


COMPLEX (TRANSCRIPTION 
FACTOR/DNA) TRANSCRIPTION 
FACTOR, PROTEIN-DNA COMPLEX, 
CYTOKINE 2 ACTIVATION, 
COMPLEX (TRANSCRIPTION 
FACTOR/DNA^ 


CHAPERONE ARCHAEAL PROTEIN 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN 




LIGASE CBL, UBCH7, ZAP-70, E2, 
UBIQUITIN, E3, 

PHOSPHORYLATION, 2 TYROSINE 
KINASE, UBIQUTTINATION, 
PROTEIN DEGRADATION, 


TRANSFERASE RECEPTOR 
TYROSINE KINASE, PROTEIN 
INTERACTION MODULE, 2 
DIMERIZ ATI ON DOMAIN, 
TRANSFERASE 


Coumponnd 


ASE; CHAIN: A, B, C, D; 


STAT3B; CHAIN: A; I8-MER 
DESOXYOLIGONUCLEOTIDE; 
CHAIN: B; 


PREFOLDIN; CHAIN; A; 
PREFOLDIN; CHAIN: B; 
PREFOLDIN; CHAIN: C; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


VIRUS EQUINE HERPES VIRUS- 
I (C3HC4, OR RING DOMAIN) 
ICHC 3 (NMR, 1 STRUCTURE) 
ICHC 4 


SIGNAL TRANSDUCTION 
PROTEIN CBL; CHAIN: A; ZAP- 
70 PEPTIDE; CHAIN: B; 
UBIQUITIN-CONJUGATING 
ENZYME E12-18 KDA UBCH7; 
CHAIN: C; 


EPHA4 RECEPTOR TYROSINE 
KINASE; CHAIN: A; 
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PD B annotation 


DEGRADATION, 2 TRANSDUCER, 
CALCIUM-BINDING, 
PHOSPHOLIPASE C, 3 
PHOSPHOINOSITIDE-SPECIFIC 


ENDOCYTOSIS/EXOCYTOSIS BETA 
SANDWICH, C ALCIUM ION, C2 
DOMAIN 


TRANSFERASE CALCIUM++, 
PHOSPHOLIPID BINDING PROTEIN, 
CALCIUM-BINDING 2 PROTEIN, 
PHOSPHATIDYLSERINE, PROTEIN 
KINASE C 


TRANSFERASE CALCIUM++, 
PHOSPHOLIPID BINDING PROTEIN, 
CALCIUM-BINDING 2 PROTEIN, 
PHOSPHATIDYLSERINE, PROTEIN 
KINASE C 


TRANSFERASE CALCIUM++, 
PHOSPHOLIPID BINDING PROTEIN, 
CALCIUM-BINDING 2 PROTEIN, 
PHOSPHATIDYLSERINE, PROTEIN 
KINASE C 


HYDROLASE CALB DOMAIN; 
HYDROLASE, C2 DOMAIN, CALB 
DOMAIN 






ENDOCYTOSIS/EXOCYTOSIS C2- 1 


Coumpound 




SYNAPTOTAGMIN HI; CHAIN: 
A; 


PROTEIN KINASE C, ALPHA 
TYPE; CHAIN: A; 


PROTEIN KINASE C, ALPHA 
TYPE; CHAIN: A; 


PROTEIN KINASE C, ALPHA 
TYPE; CHAIN: A; 


PHOSPHOLIPASE A2; CHAIN: 
NULL; 


CALCIUM/PHOSPHOLIPID 
BINDING PROTEIN 
SYNAPTOTAGMIN I (FIRST C2 
DOMAIN) (CALB) IRSY 3 


CALCIUM/PHOSPHOLIPID 
BINDING PROTEIN 
SYNAPTOTAGMIN I (FIRST C2 
DOMAIN) (CALB) 1 RS Y 3 ! 


RABPHILIN 3-A; CHAIN: A; | 


SEQFOL 
D score 
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PDB annotation 


ENDOCYTOSIS/EXOCYTOSIS BETA 
SANDWICH, CALCIUM ION, C2 
DOMAIN 


TRANSFERAS32 CALCIUM-H-, 
PHOSPHOLIPID BINDING PROTEIN, 
CALCIUM-BIN DING 2 PROTEIN, 
PHOSPHATIDYLSERINE, PROTEIN 


TRANSFERASJi CALCIUM++, 
PHOSPHOLIPID BINDING PROTEIN, 
CALCIUM-BIN DING 2 PROTEIN, 
PHOSPHATIDYLSERINE, PROTEIN 


TRANSFERASE CALCIUM-H-, 
PHOSPHOLIPID BINDING PROTEIN, 
CALCIUM-BINDING 2 PROTEIN, 
PHOSPHATIDYLSERINE, PROTEIN 


HYDROLASE CALB DOMAIN; 
HYDROLASE, .C2 DOMAIN, CALB 
DOMAIN 








Coumpound 


SYNAPTOTAGMIN III; CHAIN: 
A; 


PROTEIN KINASE C, ALPHA 
TYPE; CHAIN: A; 


PROTEIN KINASE C, ALPHA 
TYPE; CHAIN: A; 


PROTEIN KINASE C, ALPHA 
TYPE; CHAIN: A; 


PHOSPHOLIPASE A2; CHAIN: 
NULL; 


CALCIUM/PHOSPHOLIPID 
BINDING PROTEIN 
SYNAPTOTAGMIN 1 (FIRST C2 
DOMAIN) (CALB) IRSY 3 


CALCIUM/PHOSPHOLIPID 
BINDING PROTEIN 
SYNAPTOTAGMIN I (FIRST C2 
DOMAIN) (CALB) IRSY 3 


CALCIUM/PHOSPHOLIPID 
BINDING PROTEIN 
SYNAPTOTAGMIN I (FIRST C2 
DOMAIN) (CALB) IRSY 3 
CALCIUM/PHOSPHOLIPID 


SEQFOL 
D score 
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PDB annotation 


< 

i 
p 
( 


s 

J < 

^ CO t 

0 - O 
-h g H 

J ^ ui 
0 £ o < 

CO CD CO 

*i 8 p*! w 


ENDOCYTOSIS/EXOUY 1US1S UZ- 
DOMAINS, C2B-DOMAIN, 
RABPHILIN, 

ENDOCYTOSIS/EXOCYTOSIS 


ENDOCYTOSIS/EXOCYTOSIS 
SYNAPTOTAGMIN, C2-DOMA1N, 
EXOCYTOSIS, 

NEUROTRANSMITTER 2 RELEASE, 
ENDOCYTOSIS/EXOCYTOSIS 


TRANSFERASE CALC1UM++, 
PHOSPHOLIPID BINDING PROTEIN, 
CALCIUM-BINDING 2 PROTEIN, 
PHOSPHATIDYLSERINE, PROTEIN 
KINASE C 


• 


POTASSIUM CHANNELS 
POTASSIUM CHANNELS, 
TETRAMERIZATION DOMAIN, X- 
RAY 2 STRUCTURE, APLYSIA KVl.l 


GENE REGULATION POL DOMAIN; 
PROTEIN-PROTEIN INTERACTION 
DOMAIN, TRANSCRIPTIONAL 2 


Coumpound 


BINDING PROTEIN 
SYNAPTOTAGMIN I (FIRST C2 
DOMAIN) (CALB) IRSY 3 


RABPHILIN 3-A; CHAIN: A; 


RABPHILIN 3-A; CHAIN: A; 


SYNAPTOTAGMIN I; CHAIN: A; 


PROTEIN KINASE C s ALPHA 
TYPE; CHAIN: A; 


CALCIUM/PHOSPHOLIPID 
BINDING PROTEIN 
SYNAPTOTAGMIN I (FIRST C2 
DOMAIN) (CALB) IRSY 3 


POTASSIUM CHANNEL KVl.l; 
CHAIN: NULL; 


PROMYELOCYTIC LEUKEMIA 
ZINC FINGER PROTEIN PLZF; 
CHAIN: A; 


SEQFOL 
D score 
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score 
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PE>B annotation 


REPRESSOR, ZINC-FINGER 
PROTEIN, X-RAY 
CRYSTALLOGRAPHY, 3 PROTEIN 
STRUCTURE, PROMYELOCYTIC 
LEUKEMIA, GENE REGULATION 


SIGNALING PROTEIN VOLTAGE- 
GATED POTASSIUM CHANNEL, 
ASSEMBLY D DMA TN TF.TR A A/TP 7? 


METAL TRANSPORT ION CHANNEL, 
OXIDOREDUCTASE, BETA SUBUNIT 


SIGNALING PROTEIN VOLTAGE- 
GATED POTASSIUM CHANNEL, 
TETRAMERIZATION DOMAIN, 2 
INTRACELLULAR GATE, 
TETRAMER 


PROTON TRANSPORT POTASSIUM 
CHANNELS, TETRAMERIZATION 
DOMAIN, X-RAY STRUCTURE, 2 
APLYSIA KVLl, PROTON 
TRANSPORT 


POTASSIUM CHANNEL POTASSIUM 
CHANNEL, TETRAMERIZATION 
DOMAIN, MOLECULAR 2 
RECOGNITION. ZINC-BINDING 




STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3 -HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 


Coumponnd 




KV1.2 VOLTAGE-GATED 
POTASSIUM CHANNEL; 
CHAIN: A, B, C. D. E. F. G. H: 


KV BETA2 PROTEIN; CHAIN: A; 
POTASSIUM CHANNEL KVLl; 
CHAIN: E; 


KV] .2 VOLTAGE-GATED 
POTASSIUM CHANNEL; 
CHAIN: A, B, C, D; 


! POTASSIUM CHANNEL KVLl; 
CHAIN: A; 


POTASSIUM CHANNEL 
PROTEIN SHAW; CHAIN: NULL; 




ALPHA SPECTRIN; CHAIN: A, 
B, C; 


ALPHA SPECTRIN; CHAIN: A, 
B,C; 


SEQFOL 
D score 
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PDB annotation 


TANDEM 3-HELIX CUU-fcD-UUlLS, 
STRUCTURAL PROTEIN 




CELL CYCLE REGULATION CELL 
CYCLE REGULATION, 
HERPESVIRUS SAIMIRI, VIRAL 
CYCLIN 


CELL DIVISION RCYCLIN H 
(RECOMBINANT); CYCLIN, CELL 
CYCLE, CELL DIVISION, NUCLEAR 
PROTEIN 


COMPLEX (PROTEIN 
KINASE/CYCLIN) CYCLIN- . 
DEPENDENT KINASE-2, CDK2, P33 
PROTEIN KINASE; CCNA, CCNl; 
COMPLEX (PROTEIN 
KINASE/CYCLIN), CYCLIN, CDK, 2 
PHOSPHORYLATION, SUBSTRATE 
COMPLEX 


BINDING PROTEIN CYCLIN, UbLi> 
CYCLE, KINASE-REGULATORY- 
SUBUNIT, 2 BINDING PROTEIN 




COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA J 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 


Coumpound 






CYCLIN HOMOLOG; CHAIN: A; 


CYCLIN H; CHAIN: NULL; 


CELL DIVISION PROTEIN 
KINASE 2; CHAIN: A, C; 
G2/M1TOTIC- SPECIFIC CYCLIN 
A; CHAIN: B, D; SUBSTRATE 
PEPTIDE; CHAIN: E, F; 


CYCLIN A; CHAIN: NULL; 




QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B,C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 


SEQFOL 
D score 
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PD B annotation 


COMPLEX (ZDMC FINGER/DNA) ZINC 
FINGER, PROIEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA^ 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROIEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA ^ 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA^ 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA ^ 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA 1 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGRR/DNA^ 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 


Coumpound 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


! DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


SEQFOL 
D score 








| 58.17 








PMF 
score 
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PD B annotation 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYI, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATTON/DNA^ 


TRANSCRIPTION REGULATION 
TRANSCRIPTION REGULATION, 
ADRL 7TNP FT'lsIfrRR TM7v/n? 


r 
> 


4 

3 w 

pi ft, ^ ^7 

-■< * ES £ 

Mil 

sill 
Sell 


COMPLEX (DN A-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROIBIN/DNA^ 


OXIDOREDUCTASE PDZ DOMAIN, 
NNOS, NITRIC OXIDE SYNTHASE 


OXIDOREDUCTASE PDZ DOMAIN, 
NNOS, NITRIC OXIDE SYNTHASE 


PEPTIDE RECOGNITION PEPTIDE 
RECOGNITION PROTEIN 


Coumponnd 


YYI; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


ADRl; CHAIN: NULL; 


COMPLEX(TRANSCRIPTION 
REGULATION/DNA) 
TRAMTRACK PROTEIN (TWO 
ZINC-FINGER PEPTIDE) 
COMPLEXED WITH 2DRP 3 
DNA 2DRP 4 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl ; 
CHAIN: A; DNA; CHAIN: C, D; 


NEURONAL NITRIC OXIDE 
SYNTHASE; CHAIN: A; 
HEPTAPEPTIDE; CHAIN: B: 


NEURONAL NITRIC OXIDE 
SYNTHASE; CHAIN: A; 
HEPTAPEPTIDE: CHAIN: B: 


PSD-95; CHAIN: A; CRIPT; 
CHAIN: B; 


SEQFOL 
D score 












54.14 




52.53 


PMF 
score 


0.42 
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0.09 
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Verify 
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SPECIFICITY 2 OF BINDING 
HYDROLASE PDZ DOMAIN, HUMAN 
PHOSPHATASE, HPTPIE, PTP-BAS, 
SPECIFICITY 2 OF BINDING 


LIPID TRANSPORT APO A-I; 
LIPOPROTEIN, LIPID TRANSPORT, 
CHOLESTEROL METABOLISM, 2 
ATHEROSCLEROSIS, HDL, LCAT- 
ACTIVATIOM 


DESIGNED HELICAL BUNDLE 
DESIGNED HELICAL BUNDLE 


RNA-BINDING PROTEIN/RNA TRA 
PRE-MRNA; SPLICING 
REGULATION, RNP DOMAIN, RNA 
COMPLEX 


RNA-BINDING PROTEIN/RNA TRA 
PRE-MRNA; SPLICING 
REGULATION, RNP DOMAIN, RNA 


RNA-BINDING PROTEIN/RNA TRA 
PRE-MRNA; SPLICING 
REGULATION. RNP DOMAIN, RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX. GENE 


Coumpound 


TYROSINE PHOSPHATASE 
(PTP-BAS, TYPE 1); CHAIN: A; 


APOLIPOPROTEIN A-I; CHAIN: 
A, B, C, D; 


DHPl; CHAIN: NULL; 
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POLYDENYLATE BINDING 
PROTEIN 1 ; CHAIN: A, B, C, D, 
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D score 
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PDB annotation 


REGULATION/RNA 


GENE REGULATION/RNA POLY^Aj 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/KNA rVLY[A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 

i . 


GENE REGULATION/KNA rULi^AJ 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/KNA FULY^a; 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLYCAJ 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


| GENE REGULATION/RNA POLY(A) | 


Coumpound 
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PDB annotation 


BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/.&NA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


NUCLEAR PROTEIN 
HETEROGENEOUS NUCLEAR 
RIBONUCLEOPROTEIN Al, 
NUCLEAR PROTEIN, HNRNP, RBD, 
RRM, RNP, RNA BINDING, 2 


NUCLEAR PROTEIN 
HETEROGENEOUS NUCLEAR 
RIBONUCLEOPROTEIN Al, 
NUCLEAR PROTEIN, HNRNP, RBD, 
RRM, RNP, RNA BINDING. 2 


Coumpound 
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POLYDENYLATE BINDING 
PROTEIN 1; CHAIN: A, B, C, D, 
E, F, G, H; RNA (5'- 
R(*AP*AP*AP*AP*AP*AP*AP* 
AP*AP*AP*A)-30; CHAIN: M,N, 
O, P, Q, R, S, T; 


HNRNP Al; CHAIN: NULL; 


HNRNP Al; CHAIN: NULL; 


SEQFOL 
D score 




53.52 
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PDB annotation 


RIBONUCLEOPROTEIN 


NUCLEAR PROTEIN 
HETEROGENEOUS NUCLEAR 
RIBONUCLEOPROTEIN Al, 
NUCLEAR PROTEIN, HNRNP, RBD, 
RRM, RNP, RNA BINDING, 2 
RIBONUCLEOPROTEIN 


RNA BINDING PROTEIN RNA- 
BINDING DOMAIN 




RNA BINDING PROTEIN RNA- 
BINDING DOMAIN 


OUTER MEMBRANE PROTEIN 
OSMOPORIN; OUTER MEMBRANE 
PROTEIN, NON-SPECIFIC PORIN, 
OSMOPORIN, 2 BETA-BARREL, 
TRANSMEMBRANE 




RIBONUCLEOPROTEIN PTB, PTB- 
C198, HETEROGENEOUS NUCLEAR 
POLYP YRIMIDINE TRACT BINDING 
PROTEIN, RNP, RNA, SPICING, 2 
TRANSLATION 






Coumpound 




HNRNP Ai; CHAIN: NULL; 


HETEROGENEOUS NUCLEAR 
RIBONUCLEOPROTEIN DO; . 
| CHAIN: A; 


38 
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SO 
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w a S 


OMPK36; CHAIN: A, B, C; 


OUTER MEMBRANE PROTEIN 
PHOSPHOPORIN (PHOE) IPHO 3 


POLYP YRIMIDINE TRACT- 
BINDING PROTEIN; CHAIN: A; 


RNA-BINDING PROTEIN SEX- 
LETHAL PROTEIN (C- 
TERMINUS, OR SECOND RNA- 
BINDING DOMAIN ISXL 3 
(RBD-2), RESIDUES 199 - 294 
PLUS N-TERMINAL MET) ISXL 
4 (NMR, 17 STRUCTURES) ISXL 
5 


5 

1 

PL, 

o 
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SEQFOL 
D score 
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PBB annotation 




INTEGRAL MEMBRANE PROTEIN 
PORIN MATRIX PORIN, OMPF 
PORIN; 20MF 7 PORIN, MEMBRANE 


_ - <N 

S * [2 g P 
i- ga ^ & o S 




COMPLEX 

(RIBONUCLEOPROTEIN/DNA) 
HNRNP Al, UPJl; COMPLEX 
(RIBONUCLEOIPROTEIN/DNA), 
HETEROGENEOUS NUCLEAR 2 


RNA BINDING DOMAIN RNA 
BINDING DOMAIN, RBD, RNA 
RECOGNITION MOTIF, RRM, 2 
SPLICING INHIBITOR, 
TRANSLATION AL INHIBITOR. SF.X 1 


Coumponnd 


LETHAL PROTEIN (C- 
TERMINUS, OR SECOND RNA- 
BINDING DOMAIN ISXL 3 
(RBD-2), RESIDUES 199 - 294 
PLUS N-TERMINALMET) ISXL 
4 (NMR, 17 STRUCTURES) ISXL 
5 


MATRIX PORIN OUTER 
MEMBRANE PROTEIN F; 20MF 
5 CHAIN: NULL; 20MF 6 


HETEROGENEOUS NUCLEAR 
RIBONUCLEOPROTEIN Al ; 
CHAIN: A; 12-NUCLEOTIDE 
SINGLE-STRANDED 
TELOMETRIC DNA; CHAIN: B; 


HETEROGENEOUS NUCLEAR 
RIB ONUCLEOPROTEIN A 1 ; 
CHAIN: A; 12-NUCLEOTIDE 
SINGLE-STRANDED 
TELOMETRIC DNA; CHAIN: B; 


HETEROGENEOUS NUCLEAR 
RIBONUCLEOPROTEIN Al; 
CHAIN: A; 12-NUCLEOTIDE 
SINGLE-STRANDED 
TELOMETRIC DNA; CHAIN: B; 


SEX-LETHAL; CHAIN: A, B, C; 


SEQFOL 
D score 










165.44 
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PDB annotation 


DETERMINATION, X CHKUMUoUJYLb 
DOSAGE COMPENSATION 


RNA BINDING DOMAIN RNA 
BINDING DOMAIN, RBD, RNA 
RECOGNITION MOTIF, RRM, 2 
SPLICING INHIBITOR, 
TRANSLATIONAL INHIBITOR, SEX 3 
DETERMINATION, X CHROMOSOME 
DOSAGE COMPENSATION 




COAGULATION/CRY Si AL 
STRUCTURE/HEP ARIN-B FACTOR 
IIA; COAGULATION/CRYSTAL 
STRUCTURE/HEP ARIN-BINDING 
SITE/ 2 HIRUDIN/THROMBIN 
INHIBITOR ! 




CONTRACTILE PROTEIN mlXxiD 
ALPHA-BETA STRUCTURE, ACTIN- 
BINDING PROTEIN, PROTEIN 2 
DOMAIN PACKING, CONTRACTILE 
PROTEIN 


CONTRACTILE FRU L blJN M1AH1J 
ALPHA-BETA STRUCTURE, ACTIN- 
BINDING PROTEIN, PROTEIN 2 
DOMAIN PACKING, CONTRACTILE 
PROTEIN 




COMPLEX (TWO DNA-BlNDINu 
PROTEINS/DNA) MAT ALPHA-2; 
COMPLEX (TWO DNA-BINDING 
PROTEINS/DNA), COMPLEX, 2 DNA- 
BINDING PROTEIN, DNA, 
TRANSCRIPTION REGULATION 


Coumpound 




SEX-LETHAL; CHAIN: A, B, C; 




THROMBIN; CHAIN: A,B,C; 
THROMBIN; CHAIN: D,E,F; 
HAEMADIN; CHAIN: I,J,K; 




HORSE PLASMA GELSOLIN; 
CHAIN: A, B 


HORSE PLASMA GELSOLIN; 
CHAIN: A, B 




MATING-TYPE PROTEIN A-1; 
CHAIN: A; MATING-TYPE 
PROTEIN ALPHA-2; CHAIN: B; 
DNA; CHAIN: C; 


SEQFOL 
D score 














876.96 
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PDB annotation 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) GHF-1; COMPLEX 
(DNA-BINDING PROTEIN/DNA), 
PITUITARY, CPHD, 2 POU DOMAIN, 
TRANSCRIPTION FACTOR 


PROTEIN/DNA HOMEODOMAIN, 
DNA, COMPLEX, DNA-BINDING 
PROTEIN. PROTRTN/DN A 


§ g 

ipsa 

3 S c 
"SeP 

iil 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) DNA-BINDING 
PROTEIN, DNA, PAIRED BOX, 
TRANSCRIPTION 1 PFHI IT awym 


\ g 

1 § £ fl 5 

ills 

sip 

H 2 5 u 

& S H § 

55 o o < 

U O, Ai p 


3 
i 
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i 


DNA-BINDING PROTEIN 




Coumponnd 


PIT-l; CHAIN: A, B; DNA; 
CHAIN: C, D; 


HOMEOBOX PROTEIN HOX-Bl; 
CHAIN: A; PBXl; CHAIN: B; 
DNA CHAIN: D; DNA CHAIN: E: 


INSULIN GENE ENHANCER 
PROTEIN ISL-1; CHAIN; NULL; 


PAIRED PROTEIN; CHAIN: A, B, 
C; DNA; CHAIN: D, E, F 


PAIRED PROTEIN; CHAIN: A, B, 
C; DNA; CHAIN: D, E, F 


DNA-BINDING PROTEIN OCT-2 
POU HOMEODOMAIN (NMR, 
AVERAGE STRUCTURE) IHDP 
3 


OCT-3; lOCP 5 CHAIN: NULL; 
lOCP 6 


DNA BINDING PROTEIN OCT-1 
POU HOMEODOMAIN DNA- 
BINDING PROTEIN MUTANT 
WITH IPOG 3 ARG GLY SER 
HIS MET INSERTED AT THE N- 
TERMINUS AND ASP ILE IPOG 
4 INSERTED AT THE C- 
TERMINUS (INS(RGSHM- 
R6),INS(I66-DD IPOG 5 (NMR. 
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D score 
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Coumpoand 


1 3 STRUCTURES) IPOG 6 | 




HYDROLASE(CARBOXYLIC 
ESTERASE) LIPASE (E.C.3.1.1.3) 
COMPLEXED WITH COLIPASE 
AND INHIBITED ILPB 3 BY 
UNDECANE PHOSPHONATE 
METHYL ESTER (TWO 
CONFORMATIONS) ILPB 4 


HYDROLASE(CARBOXYLIC 
ESTERASE) LIPASE (E.C.3.1.1.3) 
COMPLEXED WITH COLIPASE 
AND INHIBITED ILPB 3 BY 
UNDECANE PHOSPHONATE 
METHYL ESTER (TWO 
1 CONFORMATIONS) ILPB 4 


HYDROLASE(CARBOXYLIC 
ESTERASE) LIPASE (E.C.3.1.1.3) 
COMPLEXED WITH COLIPASE 
AND INHIBITED ILPB 3 BY 
UNDECANE PHOSPHONATE 
METHYL ESTER (TWO 
CONFORMATIONS) ILPB 4 


LIPASE PROTEIN COFACTOR 
PORCINE PANCREATIC 
PROCOLIPASE B IPCN 3 (NMR, 
MINIMIZED AVERAGE 
STRUCTURE) IPCN 4 


LIPASE PROTEIN COFACTOR 
PORCINE PANCREATIC 
PROCOLIPASE B IPCN 3 (NMR, 
MINIMIZED AVERAGE 
STRUCTURE) IPCN 4 


| LIPASE PROTEIN COFACTOR 


SEQFOL 
D score 










50.95 
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COMPLEX 

(TRANSFERASE/PEPTIDE) 
COMPLEX 

(TRANSFERASE/PEPTIDE) 


i h 

g 

& Z fx 

fas! 

> S £ p 


PHOSPHORYLATION SIGNAL 
TRANSDUCTION, TYROSINE 
KINASE, TRANSFERASE, 2 
PHOSPHOTRANSFERASE, 
PHOSPHOR YLATTOM 


SIGNALING 

PROTEIN/TRAI4SFERASE NAK; 
COMPLEX, SIGNAL 
TRANSDUCTION, 
PHOSPHOTYROSINE BINDING 2 
DOMAIN (PTB), ASYMMETRIC CELL 
DIVISION 


PHOSPHOTRANSFERASE C-SRC, 
P60-SRC; SRC, 'rYROSINE KINASE, . 
PHOSPHORYLATION, SH2, SH3, 2 
PHOSPHOTYROSINE, PROTO- 
ONCOGENE, 

PHOSPHOTR AMSFRR A «P 


1 


Coumpound 


PORCINE PANCREATIC 
PROCOLIPASE B IPCN 3 (NMR, 
MINIMIZED AVERAGE 
STRUCTURE) IPCN 4 


C-SRC TYROSINE KINASE; 
CPIAIN: A, B; ACE-FORMYL 
PHOSPHOTYR-GLU-(N,N- 
DIPENTYL AMINE); CHAIN: C, 
D; 


PP60 V-SRC TYROSINE KINASE 
TRANSFORMING PROTEIN; 
CHAIN: NULL; 


! P55 BLK PROTEIN TYROSINE 
1 KINASE; CHAIN: NULL; 


NUMB PROTEIN; CHAIN: A; 
NUMB ASSOCIATE KINASE; 
CHAIN: B; 


TYROSINE-PROTEIN KINASE 
SRC; CHAIN: NULL; 


PHOSPHOTRANSFERASE V- 
SRC TYROSINE KINASE 


SEQFOL 
D score 
















PMF 
score 
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0.11 


0.95 
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Verify 
score 
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PDB annotation 

REGULATION 


TRANSLATION INITIATION FACTOR 
SUI 1 ; TRANSLATION INITIATION 
FACTOR 


TRANSLATION INITIATION FACTOR 
SUIl; TRANSLATION INITIATION 
FACTOR 


HEXAMERIZATION DOMAIN 
HEXAMERIZATION DOMAIN, 
ATPASE. TRANSPORT 


CHAPERONE AAA-ATPASE, CLPY, 
ATP-DEPENDENT PROTEOLYSIS 


PHOSPHOTRANSFERASE C-SRC, 
P60-SRC; SRC, TYROSINE KINASE, 
PHOSPHORYLATION, SH2, SH3, 2 
PHOSPHOTYROSINE, PROTO- 
ONCOGENE, 

PHOSPHOTRANSFER ASF! 


COMPLEX (SIGNAL 
TRANSDUCTION/PEPTIDE) 
COMPLEX (SIGNAL 
TRANSDUCTION/PEPTIDE), SH3 
DOMAIN 




Coumpound 


EIFl; CHAIN: NULL; 


EIFl; CHAIN: NULL; 


N-ETHYLMALEIMIDE- 
SENSITIVE FUSION PROTEIN; 
CHAIN: A; 


HEAT SHOCK PROTEIN HSLU; 
CHAIN: A; 


TYROSINE-PROTEIN KINASE 
SRC; CHAIN: NULL; 


GRB2; CHAIN: A; SOS-1; 
CHAIN: B; 


SIGNAL TRANSDUCTION 
PROTEIN GROWTH FACTOR 
RECEPTOR-BOUND PROTEIN 2 
(GRB2, N-TERMINAL IGBR 3 
SH3 DOMAIN) COMPLEXED 
WITH SOS-A PEPTIDE IGBR 4 
(NMR, 29 STRUCTURES) IGBR 
5 


SEQFOL 
D score 
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PDB annotation 




SIGNAL TRANSDUCTION ADAPTOR 
SH2, SH3 IGRI 14 


COMPLEX (KINASE/PEPTIDE) 


TYROSINE KINASE TYROSINE 
KINASE-INHIBITOR COMPLEX, 
DOWN-REGULATED KINASE, 2 
ORDERED ACTIVATION LOOP 


SIGNAL TRANSDUCTION PROTEIN 
SRC-HOMOLOGY 3 (SH3) DOMAIN, 
PEPTIDE-BINDING PROTEIN, ISEM 
18 2 GUANINE NUCLEOTIDE 
EXCHANGE FACTOR ISEM 19 


TRANSFERASE TRANSFERASE, 
TYROSINE KINASE, SH3, SH2, 
ONCOPROTEIN 




TRANSPORT PROTEIN SERINE-RICH 
RNA POLYMERASE I SUPPRESSOR ' 
PROTEIN; ARM REPEAT 


STRUCTURAL PROTEIN 
ARMADILLO REPEAT, BETA- 


Coumpound 


ADAPTOR PROTEIN 
CONTAINING SH2 AND SH3 
GROWTH FACTOR RECEPTOR- 
BOUND PROTEIN 2 (GRB2) 
1 GFC 3 (C-TERMINAL SH3 
DOMAIN) (NMR, MINIMIZED 
MEAN STRUCTURE) IGFC 4 


GROWTH FACTOR BOUND 
PROTEIN 2; IGRI 5 CHAIN: A, B; 
IGRI 6 


P56==LCK== TYROSINE 
KINASE; ILCK 7 CHAIN: A; 
ILCK 8 TAIL PHOSPHOPEPTIDE 
TEGQ(PHOSPHO)YQPQPA; 
ILCK 14 CHAIN: B; ILCK 15 


HAEMATOPOETIC CELL 
KINASE (HCK); CHAIN: A; 


SEM-5; 1 SEM 3 CHAIN: A, B; 
ISEM 5 10-RESIDUE PROLINE- 
RICH PEPTIDE FROM MSOS 
ISEM 8 CHAIN: C, D ISEM 10 


ABL TYROSINE KINASE; 
CHAIN: NULL; 




KARYOPHER1N ALPHA; 
CHAIN: A, B; MYC PROTO- 
ONCOGENE PROTEIN; CHAIN: 
C,D,E,F; 


BETA-CATENIN; CHAIN: NULL; 


SEQFOL 
D score 
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PD B annotation 

CATENIN. STRUCTURAL PROTRTM 


COMPLEX (INHIBITOR/NUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR IRECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


COMPLEX (INHIBITOR/NUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI- ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP.RIB ONUCLEOPROTE IN 


1 COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP .RIB ONUCLEOPR OTETM 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP .RIB ONtJCLEOPR OTETN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP^RIBONlLfCLEOPROTRrNI 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP.RIBONUCLEOPROTEIN 


Coumpound 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A',* CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


SEQFOL 
D score 
















PMF 
score 


0.27 


1.00 


o 
d 


0.74 


0.37 


CO 

d 


0.82 


Verify 
score 


0.16 


0.54 


oro 


0.36 


0.41 


CO 

o 

d 
i 


00 
CN 

d 


Psi Blast 


CN 
VD 


cn 

6 

«n 


on 

1 

«n 


i> 
o 
d> 

VO 
ON 


le-23 


CN 
CN 

<6 

«n 
«— i 


o 

CN 

6 


8* 


Ol 

VO 

m 


ro 


o 
ro 


m 




o 

a 


© ■ 

CO 


STAR 
T AA 


CN 


G\ 
CN 


VO 


CN 


o 
cn 


CO 

wo 


VO 
CN 


CHAI 
NID 


< 


< 




< 


< 


< 






1 


* 


— < 


a 

— « 


1 


i - 

— < 


1 


gal 


1671 


— » 


2 v 


^ i 

O v 


^ i 
2 ; 




O 



746 



WO 02/070539 ~ W PCT/US02/05095 



PDB annotation 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNPjRIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNPjRIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIB ONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, CELL 
ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, CELL 
ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, CELL 
ADHESION 


1 CELL ADHESION LEUCINE RICH I 


Coumpound 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2A»; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B,D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, | 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


INTERNALIN B; CHAIN: A; 


INTERNALIN B; CHAIN: A; 


INTERNALIN B; CHAIN: A; 


INTERNALIN B; CHAIN: A; 


SEQFOL 
D score 
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PDB annotation 


SUBUNIT, BETA SUBUNIT 


CONTRACTILE PROTEIN LEUCINE- 
RICH REPEAT, BETA-BETA- ALPHA 
CYLINDER, DYNEIN, 2 
CHLAMYDOMONAS, FLAGELLA 


CONTRACTILE PROTEIN LEUCINE- 
RICH REPEAT, BETA-BETA- ALPHA 
CYLINDER, DYNEIN, 2 
CHLAMYDOMONAS, FLAGELLA 


CONTRACTILE PROTEIN LEU CINE- 
RICH REPEAT, BETA-BETA-ALPHA 
CYLINDER, DYNEIN, 2 
CHLAMYDOMONAS, FLAGELLA 


CONTRACTILE PROTEIN LEUCINE- 
RICH REPEAT, BETA-BETA- ALPHA 
CYLINDER, DYNEIN, 2 
CHLAMYDOMONAS, FLAGELLA 


CONTRACTILE PROTEIN LEUCINE- 
RICH REPEAT, BETA-BETA-ALPHA 
CYLINDER, DYNEIN, 2 
CHLAMYDOMONAS, FLAGELLA 


SIGNALING PROTEIN 
PHOTORECEPTOR, G PROTEIN- 
COUPLED RECEPTOR, MEMBRANE 
PROTEIN, 2 RETINAL PROTEIN, 
VISUAL PIGMENT 


RNA BINDING PROTEIN TAP (NFXl); 
RIBONUCLEOPROTEIN (RNP,RBD 
OR RRM) AND LEUCINE-RICH- 
REPEAT 2 (LRR) 


RNA BINDING PROTEIN TAP (NFXl); 
RIBONUCLEOPROTEIN (RNP,RBD 
OR RRM) AND LEUCINE-RICH- 


punt 


JNIT; CHAIN: 


JEIN; CHAIN: 


JEIN; CHAIN: 


vJEIN; CHAIN: 


vIEIN; CHAIN: 


sfEIN; CHAIN: 


AIN: A, B 


NUCLEAR RNA EXPORT 
FACTOR 1; CHAIN: A, B; 




Coump< 


RASE BETA SUBl 
B,D; 


OUTER ARM UYt 
A; 


OUTER ARM DYT 
A; 


OUTER ARM DYT 
A; 


OUTER ARM DY1 
A; 


OUTER ARM DY1 
A; 


RHODOPSIN; CH 


NUCLEAR RNA I 
FACTOR 1; CHAI 


SEQFOL 
D score 
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Is* 

Is I 

leg. 

2 9 w 

sis 

< so a 2 < 


TRANSCRIPTION RNAIP; RANGAP; 
GTPASE-ACTIVATING PROTEIN 
FOR SPIl, GTPASE-ACTIVATING 
PROTEIN, GAP, RNAIP, RANGAP, 
LRR, LEUCINE - 2 R[CH REPEAT 
PROTEIN, TWINNING, 
HEMIHEDRAL TWINNING, 3 
MEROHEDRAL TWINNING, 


ACETYLATION RNASE INHIBITOR, 
RIBONUCLEASE/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINR-RTrH 1?P.PPAT5 


ACETYLATION RNASE INHIBITOR, 
RIBONUCLEASE/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH! RRPRATS 


ACETYLATIONt RNASE INHIBITOR, 
RIBONUCLEASE/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH REPEATS 


CO J, W [j 
f/l J H 

u ^ 

CO ffl *F 

SBBi 


-5 

£2 2 

IfiiBg 

1 CO [j 
( £ 3 ► J 

! * fc" 

\&$% 
w ca . 

wti b 

'CO * < 

SBS 


Coumpound 


NUCLEAR RNA EXPORT 
FACTOR 1; CHAIN; A, B; 


GTPASE-ACTIVATING 
PROTEIN RNA1 SCHPO; 
CHAIN: A, B; 


RIBONUCLEASE INHIBITOR; 
CHAIN: NULL; 


RIBONUCLEASE INHIBITOR; 
CHAIN: NULL; 


RIBONUCLEASE INHIBITOR; 
CHAIN: NULL; 


CYCLIN A/CDK2- ASSOCIATED 
PI 9; CHAIN: A, C; CYCLIN 
A/CDK2-ASSOCIATED P45; 
CHAIN: B, D: 


CYCLIN A/CDK2-AS S OCIATED 
PI 9; CHAIN: A, C; CYCLIN 
A/CDK2-ASSOC1ATED P45; 


SEQFOL 
D score 
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PDB annotation 


UBIQUITIN PROTEIN LIGASE | 


LIGASE CYCLIN A/CDK2- 
ASSOCIATED P45; CYCLIN A/CDK2- 
ASSOCIATED P19; SKPl, SKP2, F- 
BOX, LRRS, LEUCINE-RICH 
REPEATS, SCF, 2 UBIQUITIN, E3, 
UBIQUITIN PROTEIN LIGASE 




TRANSFERASE ALLOSTERY, 
COOPERATIVITY, GLYCEROL 
KINASE, CRYSTAL 2 STRUCTURE, 
TRANSFERASE 


TRANSFERASE ALLOSTERY, 
COOPERATIVITY, GLYCEROL 
KINASE, CRYSTAL 2 STRUCTURE, 
TRANSFERASE 




ANTIFUNGAL PROTEIN OSMOTIN- 
LIKE PROTEIN; ANTIFUNGAL 
PROTEIN, PATHOGENESIS- 
RELATED PROTEIN, PR-5D, 2 
OSMOTIN, THAUMATIN-LIKE 
PROTEIN 


HYDROLASE INHIBITOR ALL-BETA 
STRUCTURE, HYDROLASE 
INHIBITOR 


HYDROLASE INHIBITOR ALL-BETA 
STRUCTURE, HYDROLASE 
INHIBITOR 


HYDROLASE INHIBITOR ALL-BETA 
STRUCTURE, HYDROLASE 
INHIBITOR 


HYDROLASE INHIBITOR ALL-BETA 
STRUCTURE, HYDROLASE 










d 


6 














Coumpound 


CHAIN: B, D; 


SKP2; CHAIN: A, C; SKPl; 
CHAIN: B, D; 




GLYCEROL KINASE; CHAIN: 
Y,Z,X; 


GLYCEROL KINASE; CHAIN: 
Y,Z,X; 




PR-5D; CHAIN: NULL; 


BOWMAN-BIRK TRYPSIN 
INHIBITOR; CHAIN: A 


BOWMAN-BIRK TRYPSIN 
INHIBITOR; CHAIN: A 


BOWMAN-BIRK TRYPSIN 
INHIBITOR; CHAIN: A 


BOWMAN-BIRK TRYPSIN 
INHIBITOR; CHAIN: A 
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INHIBITOR 

PLANT PROTEIN TWO 
HOMOLOGOUS HEVEIN-LIKE 
DOMAINS 


PLANT PROTEIN TWO 
HOMOLOGOUS HEVEIN-LIKE 
DOMAINS 


PLANT PROTEIN TWO 
HOMOLOGOUS HEVEIN-LIKE 
DOMAINS 


SUGAR BINDING PROTEIN UDA; 
LECTIN, HEVEIN DOMAIN, UDA, 
SUPERANTIGKN 


SUGAR BINDING PROTEIN UDA; 
LECTIN, HEVEIN DOMAIN, UDA, 
SUPERANTIGEN 


SUGAR BINDING PROTEIN UDA; 
LECTIN, HEVEIN DOMAIN, UDA, 
SUPERANTIGEN 


SUGAR BINDING PROTEIN UDA; 
LECTIN, HEVEIN DOMAIN, UDA, 
SUPERANTIGRN 


SUGAR BINDING PROTEIN UDA; 
LECTIN, HEVEIN DOMAIN, UDA, 
SUPERANTIGRN 


SUGAR BINDING PROTEIN UDA; 
LECTIN, HEVEIN DOMAIN, UDA, 
SUPERANTIGEN, SACCHARIDE 
BINDING 


SUGAR BINDING PROTEIN UDA; 
LECTIN, HEVEXN DOMAIN, UDA, 
SUPERANTIGEN, SACCHARIDE 
BINDING 

SUGAR BINDING PROTEIN UDA: 


Coumpound 


AGGLUTININ ISOLECTIN VI; 
CHAIN: A 


AGGLUTININ ISOLECTIN VI; 
CHAIN: A 


AGGLUTININ ISOLECTIN VI; 
CHAIN: A 


AGGLUTININ ISOLECTIN 
VI/AGGLUTININ ISOLECTIN V; 
CHAIN: A; 


AGGLUTININ ISOLECTIN 
VI/AGGLUTININ ISOLECTIN V; 
CHAIN: A; 


AGGLUTININ ISOLECTIN 
VI/AGGLUTININ ISOLECTIN V; 
CHAIN: A; 


AGGLUTININ ISOLECTIN 
VI/AGGLUTININ ISOLECTIN V; 
CHAIN: A; 


AGGLUTININ ISOLECTIN 
VI/AGGLUTININ ISOLECTIN V; 
CHAIN: A; 


AGGLUTININ ISOLECTIN 
I/AGGLUTININ ISOLECTIN V/ 
CHAIN: A; 


AGGLUTININ ISOLECTIN 
I/AGGLUTININ ISOLECTIN V/ 
CHAIN: A; 

AGGLUTININ ISOLECTIN 
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D score 
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LECTIN, HEVEIN DOMAIN, UDA, 
SUPERANTIGEN, SACCHARIDE 
BINDING 


SUGAR BINDING PROTEIN UDA; 
LECTIN, HEVEIN DOMAIN, UDA, 
SUPERANTIGEN, SACCHARIDE 
BINDING 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, SIGNALLING 
PROTEIN 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, SIGNALLING 
PROTEIN 


HORMONE RECEPTOR HORMONE 
RECEPTOR, INSULIN RECEPTOR 
FAMILY 


GLYCOPROTEIN GLYCOPROTEIN 


GLYCOPROTEIN GLYCOPROTEIN 1 


GLYCOPROTEIN GLYCOPROTEIN 


SIGNALLING PROTEIN TYPE I 
RECEPTOR, STNFRl; INCF 8 
BINDING PROTEIN, CYTOKINE 
INCF 19 


OUTER MEMBRANE PROTEIN 
OSMOPORIN; OUTER MEMBRANE 
PROTEIN, NON-SPECIFIC PORIN, 
OSMOPORIN, 2 BETA-BARREL, 
TRANSMEMBRANE 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 


Coumpound 


I/AGGLUTININ ISOLECTIN V/ 
CHAIN: A; 


AGGLUTININ ISOLECTIN 
I/AGGLUTININ ISOLECTIN V/ 
CHAIN: A; 


TUMOR NECROSIS FACTOR 
RECEPTOR; CHAIN: A, B; 


TUMOR NECROSIS FACTOR 
RECEPTOR; CHAIN: A, B; 


•INSULIN-LIKE GROWTH 
FACTOR RECEPTOR 1; CHAIN: 

A; 








TUMOR NECROSIS FACTOR 
RECEPTOR; INCF 4 CHAIN: A, 
B; INCF 5 


OMPK36; CHAIN: A, B, C; 


FACTOR IXA; CHAIN: C, L,; D- 
PHE-PRO-ARG; CHAIN: I; 


NULL: 


NULL: 


NULL: 


LAMININ; CHAIN: 


LAMININ; CHAIN: 


LAMININ; CHAIN: 


SEQFOL 
D score 
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BINDING, HYDROLASE, 3 
GLYCOPROTEXN 


MEMBRANE A DHESION SHORT 
CONSENSUS REPEAT, SUSHI, 
COMPLEMENT CONTROL PROTEIN, 
2 N-GLYCOSYLATION, MULTI- 
DOMAIN. MEMBRANE ADHESION 


SERINE PROTEASE INHIBITOR 
FACTOR XA INHIBITOR; 
ANTISTASIN, CRYSTAL 
STRUCTURE, FACTOR XA 
INHIBITOR, 2 SERINE PROTEASE 
INHIBITOR. THROMBOSIS 


SERINE PROTEASE INHIBITOR 
FACTOR XA INHIBITOR; 
ANTISTASIN, CRYSTAL 
STRUCTURE, FACTOR XA 
INHIBITOR, 2 SERINE PROTEASE 
INHIBITOR. THROMBOSIS 


SERINE PROTE ASE INHIBITOR 
FACTOR XA INHIBITOR; 
ANTISTASIN, CRYSTAL 
STRUCTURE, FACTOR XA 
INHIBITOR, 2 SERINE PROTEASE 
INHIBITOR. THROMBOSIS 


SERINE PROTEASE INHIBITOR 
FACTOR XA INHIBITOR; 
ANTISTASIN, CRYSTAL 
STRUCTURE, FACTOR XA 
INHIBITOR, 2 SERINE PROTEASE 
INHIBITOR, THROMBOSIS 




Coumpound 




HUMAN BETA2- 
GLYCOPROTEIN I; CHAIN: A; 


ANTISTASIN; CHAIN: NULL; 


ANTISTASIN; CHAIN: NULL; 


ANTISTASIN; CHAIN: NULL; 


ANTISTASIN; CHAIN: NULL; 


MEMBRANE PROTEIN 
VITELLINE MEMBRANE 
OUTER LAYER PROTEIN I 


SEQFOh 
D score 
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OXIDOREDUCTASE ALPHA-BETA- 
ALPHA MOTIF, FLAVIN 
CONTAINING PROTEIN, OXIDASE 


FLAVOENZYME FLAVOENZYME, 
NAD BIOSYNTHESIS, FAD, 
OXIDOREDUCTASE 


OXIDOREDUCTASE ADR, NADPH: 
ADRENODOXIN 
OXIDOREDUCTASE; 
FLAVOENZYME, MAD ANALYSIS, 
ELECTRON TRANSFERASE 


OXIDOREDUCTASE TETRAHEME 
FLAVOCYTOCHROME C 
FUMARATE REDUCTASE, 2 
OXIDOREDUCTASE 


OXIDOREDUCTASE IRON-SULFUR 
FLAVOPROTEIN, ELECTRON 
TRANSFER, OXIDOREDUCTASE 


OXIDOREDUCTASE IRON-SULFUR 
FLAVOPROTEIN, ELECTRON 
TRANSFER, OXIDOREDUCTASE 


OXIDOREDUCTASE LIPOAMIDE 
DEHYDROGENASE, L PROTEIN, E3, 
DLDH, DIHYDROLIPOAMIDE 
DEHYDROGENASE, MULTIENZYME 
COMPLEX 2 PROTEIN, PYRUVATE 
DEHYDROGENASE COMPLEX, 
GLYCINE 3 DECARBOXYLASE 
COMPLEX, FLAVOPROTEIN 


Coumpoand 


1VM0 3 1 


LECTIN (AGGLUTININ) WHEAT 
GERM AGGLUTININ 
(ISOLECTIN2)9WGA3 




D-AMINO ACID OXIDASE; 
CHAIN: A; 


L-ASPARTATE OXIDASE; 
CHAIN: A; 


ADRENODOXIN REDUCTASE; 
CHAIN: A; 


FLAVOCYTOCHROME C 
FUMARATE REDUCTASE; 
CHAIN: A; 


TRIMETHYLAMINE 
DEHYDROGENASE; CHAIN: A, 
B; 


TRIMETHYLAMINE 
DEHYDROGENASE; CHAIN: A, 
B; 


DIHYDROLIPOAMIDE 
DEHYDROGENASE; CHAIN: A, 
B, C, D; 


SEQFOL 
D score 
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OXIDOREDUCTASE LIPOAMIDE 
DEHYDROGENASE, L PROTEIN, E3, 
DLDH, DIHYDROLIPOAMIDE 
DEHYDROGENASE, MULTIENZYME 
COMPLEX 2 PROTEIN, PYRUVATE 
DEHYDROGENASE COMPLEX, 
GLYCINE 3 DECARBOXYLASE 
COMPLEX. FLAVnPPOTPTW 


COMPLEX 

(OXIDOREDUCTASE/TRANSFERASE 
) E3BD; RED OX- ACTIVE CENTER, 
GLYCOLYSIS, OXIDOREDUCTASE 


OXIDOREDUCTASE 
FLAVOENZYME, OXIDASE, 
ENANTIOMERIC SPECIFICITY, O- 2 
AMTNOBENZOATE, ACTIVE SITE 
FUNNEL, HELICAL DOMAIN, FAD- 3 
BINDINCr nOlU A TTvT 


i 




§11 

IP 


OXIDOREDUCTASE REDOX-ACITVE 
CENTER, OXIDOREDUCTASE, 


Coumpound 


DIHYDROLIPOAMIDE 
DEHYDROGENASE; CHAIN: A, 
B, C, D; 


DIHYDROLIPOAMIDE 
DEHYDROGENASE; CHAIN: A, 
B; DIHYDROLIPOAMIDE 
ACETYLTRANSFERASE; 
CHAIN: C: 


L-AMINO ACID OXIDASE; 
CHAIN: A, B, C, D, E, F, G, H; 


ELECTRON 

TRANSPORT(FLAVOCYTOCHR 
OME) FLAVOCYTOCHROME C 
SULFIDE DEHYDROGENASE 
(FCSD) 1FCD3 


ELECTRON 

TRANSPORT(FLAVOCYTOCHR 
OME) FLAVOCYTOCHROME C 
SULFIDE DEHYDROGENASE 
(FCSD) IFCD 3 


TRYPANOTHIONE 
REDUCTASE; CHAIN: A, B; 


TRYPANOTHIONE 
REDUCTASE; CHAIN: A, B; 


SEQFOL 
D score 
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0 £ 0 E 
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owoc 

O 0 0 E 
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i W W ti 

< CO CO c/ 
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; [=2 {. 
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ass 


OXIDOREDUCTASE 
OXIDOREDUCTASE 




TRANSCRIPTION INHIBITOR BETA- 
PROPELLER 


TRANSCRIPTION INHIBITOR BETA- 
PROPELLER 


COMPLEX (GTP- 
BINDING/TRAN SDUCER) BETAl, 
TRANSDUCIN BETA SUBUNIT: 


Coumpoand 


DIHYDROLIPOAMIDE 
DEHYDROGENASE (E.C.1.8.1.4) 
COMPLEX WITH ILVL 3 
NICOTINAMIDE-ADENINE- 
DINUCLEOTIDE (NAD+) ILVL 4 


OXIDOREDUCTASE (H202(A)) 
NADH PEROXIDASE (NPX) 
(E.C.I. 1 1.1.1) MUTANT WITH 
CYS 42 INHP 3 REPLACED BY 
ALA (C42A) INHP 4 


SURFACE PROTEIN; CHAIN: 
NULL; 


L-ALANINE 

DEHYDROGENASE; CHAIN: A; 


FLAVOCYTOCHROME C3 
FUMARATE REDUCTASE; - 
CHAIN: A, D; 


OXIDOREDUCTASE 
DIHYDROLIPOAMIDE 
DEHYDROGENASE (E.C. 1.8.1 .4) 
3LAD 3 


TRANSCRIPTIONAL 
REPRESSOR TUPl; CHAIN: A, 
B,C; 


TRANSCRIPTIONAL 
REPRESSOR TUPl; CHAIN: A, 
B,C; 


GT-ALPHA/GI-ALPHA 
CHIMERA; CHAIN: A; GT- 
BETA; CHAIN: B; GT-GAMMA: 


SEQFOL 
D score 
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PDB annotation 


INTEGRAL MEMBRANE PROTEIN 
PORIN MATRIX PORIN, OMPF 
PORIN; 20MF 7 PORIN, MEMBRANE 
PROTEIN 20MF 12 


STRUCTURAL, PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINICER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 




< CM $ 

2 S g S 
m & h c 

SOU 5 

^ £ £ 
u<u£ 
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: I o p < 

2 35 P 

3 9 s c 
o < o £ 
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fi 

u S 


TRANSFERASE PYRUVATE KINASE, 
GLYCOLYTIC ENZYME, 
HOMOTETRAMER. 2 TRANSFERASE 


|p 
nnz 

Si! 
ill 

ooS 


Coumpound 


MATRIX PORIN OUTER 
MEMBRANE PROTEIN F; 20MF 
5 CHAIN: NULL; 20MF 6 


ALPHA SPECTRIN; CHAIN: A, 
B,C; 


HYDROLASE CYTIDINE 
DEAMINASE (CDA) (E.C.3.5.4.5) 
COMPLEXED WITH 1CTT3 3,4- 
DIHYDROZEBULARINE (DHZ) 
1CTT4 


THERMOSOME; CHAIN: A, B; 


THERMOSOME; CHAIN: A, B; 


psT 
pf 
Q 
u 

s* 
i* 

0 o 


PYRUVATE KINASE; CHAIN: A, 
B,C,D,E, F, H, G; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 


SEQFOL 
D score 
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PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) TFIIIA; 5S 
GENE; NMR, TFIIIA, PROTEIN, DNA, 
TRANSCRIPTION FACTOR, 5S RNA 2 
GENE, DNA BINDING PROTEIN, 
ZINC FINGER, COMPLEX 3 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 


Coumpound 


SITE; CHAIN: B, C; [ 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B s C; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER | 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


TRANSCRIPTION FACTOR IIIA; 
CHAIN: A; 5S RNA GENE; 
CHAIN: E, F; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A,B; 


SEQFOL 
D score 
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PDB annotation 


SUBUNIT, BETA SUBUNIT 


CONTRACTILE PROTEIN LEUCINE- 
RICH REPEAT, BETA-BETA- ALPHA 
CYLINDER, DYNEIN, 2 
CHLAMYDOMONAS, FLAGELLA 


TRANSCRIPTION RNAIP; RANGAP; 
GTPASE-ACTIVATING PROTEIN 
FOR SPIl, GTPASE-ACTIVATING 
PROTEIN, GAP, RNAIP, RANGAP, 
LRR, LEUCINE- 2 RICH REPEAT 
PROTEIN, TWINNING, 
HEMIHEDRAL TWINNING, 3 
MEROHEDRAL TWINNING, 
MEROHEDRY 


ACETYLATION RNASE INHIBITOR, 
RIBONUCLEASE/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH REPEATS 




ENDOCYTOSIS/EXOCYTOSIS 
SUPERHELIX OF HELICES 


ENDOCYTOSIS/EXOCYTOSIS 
SUPERHELIX OF HELICES 




RNA-BINDING PROTEIN/RNA TRA 
PRE-MRNA; SPLICING 
REGULATION, RNP DOMAIN, RNA 
COMPLEX 


RNA-BINDING PROTEIN/RNA TRA 
PRE-MRNA; SPLICING 
REGULATION, RNP DOMAIN, RNA 
COMPLEX 


| GENE REGULATION/RNA POLY(A) | 


Coumpound 


RASE BETA SUBUNIT; CHAIN: 
B, D; 


OUTER ARM DYNEIN; CHAIN: 
A; 


GTPASE-ACTIVATING 
PROTEIN RNAl_SCHPO; 
CHAIN: A, B; 


RIBONUCLEASE INHIBITOR; 
CHAIN: NULL; 




TARGET OF MYBl; CHAIN: A, 
B; 


TARGET OF MYBl; CHAIN: A, 
B; 
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PDB annotation 


BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PRO:rEIN 1, PABP I; RRM, 
PROTEIN-RNA, COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN I, PABP I; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/ RNA 


Coumpound 
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PDB annotation 




GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN I, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


RNA BINDING PROTEIN RNA- 
BINDING DOMAIN 


RNA BINDING PROTEIN RNA- 
BINDING DOMAIN 


RNA BINDING PROTEIN RNA- 
BINDING DOMAIN 


RIBONUCLEOPROTEIN Ul Al 17; 
RIBONUCLEOPROTEIN, RNP 
DOMAIN, SPLICEOSOME 


STRUCTURAL PROTEIN PROTEIN 
C23; RNP, RBD, RRM, RNA BINDING 
DOMAIN, NUCLEOLUS 


NUCLEAR PROTEIN 
HETEROGENEOUS NUCLEAR 
RIBONUCLEOPROTEIN Al , 


Coumpound 
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HU ANTIGEN C; CHAIN: A; 


HU ANTIGEN C; CHAIN: A; 


HU ANTIGEN C; CHAIN: A; 


Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN A; 
CHAIN: NULL; 


NUCLEOLIN RBD2; CHAIN: A; 


HNRNP Al; CHAIN: NULL; 


SEQFOL 
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PDB annotation 


ANKYRIN MOTIF 
COMPLEX (IhlHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX ! 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 


COMPLEX (IN HIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 


Coampound 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN; A; P19INK4D; CHAIN- 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


SEQFOL 
D score 
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score 
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PDB annotation 


KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PROTEIN/KINASE) 


HORMONE/GROWTH FACTOR P 1 8- 
INK4C; CELL CYCLE INHIBITOR, 
P18INK4C, TUMOR, SUPPRESSOR, 
CYCLIN- 2 DEPENDENT KINASE, 
HORMONE/GROWTH FACTOR 


HORMONE/GROWTH FACTOR P I 
INK4C; CELL CYCLE INHIBITOR, 
P18INK4C, TUMOR, SUPPRESSOR, 
CYCLIN- 2 DEPENDENT KINASE, 
HORMONE/GROWTH FACTOR 


SIGNALING PROTEIN HELIX-TURN - 
HELIX, ANKYRIN REPEAT 


SIGNALING PROTEIN HELIX-TURN- 
HELIX. ANKYRIN REPEAT 


CELL CYCLE INHIBITOR PI 8- 
INK4C(INK6); CELL CYCLE 
INHIBITOR, P18-INK4C(INK6), 
ANKYRIN REPEAT, 2 CDK 4/6 
INHIBITOR 


INK4C(INK6); CELL CYCLE 
INHIBITOR, P18-INK4C(INK6), 
ANKYRIN REPEAT, 2 CDK 4/6 
INHIBITOR 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFICB COMPLEX 


Coumpound 




CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A; 


CYCLIN-DEPENDENT KINASE 
4 INHIBITOR B; CHAIN: A; 


CYCLIN-DEPENDENT KINASE 
4 INHIBITOR B; CHAIN: A; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A, B; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A, B; 


V-B P65 SUBUNIT; 
; NF-KAPPA-B P50D 

CHAIN: C; I-KAPPA- 

CHAIN: D; 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 


NF-KAPP; 
CHAIN: A: 
SUBUNIT; 
B-ALPHA; 


SEQFOL 
D score 
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PDH annotation 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB CONtPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSQUPTION FACTOR, 
IKB/NFKB COMPLEX 


< 

0 

ag 

< < 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HPT TV 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 


Conmpound 


..@zs 

5 °? 3 

s § 2i o q 

J 5 z o u 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D: 


NF-KAPPA-B P65 SUBUNIT; • 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D; 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D: 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D; 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA: CHAIN: D: 


MYOTROPHIN; CHAIN: NULL 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B PSO; CHAIN: B, D; 
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D score 
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PDB annotation 


(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HELIX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HELIX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HELIX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
| ANKYRIN 2 REPEAT HELIX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HELIX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HELIX 


TRANSCRIPTION REGULATION 
TRANSCRIPTION REGULATION, 
ANKYRIN REPEATS, CELL-CYCI 


TRANSCRIPTION REGULATION 
TRANSCRIPTION REGULATION, 
ANKYRIN REPEATS, CELL-CYCI 


Coumpound 


I-KAPPA-B-ALPHA; CHAIN: E, 
p; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 

t 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


REGULATORY PROTEIN SWI6; 
CHAIN: A, B; 


REGULATORY PROTEIN SWI6; 
CHAIN: A, B; 


SEQFOL 
D score 
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PDB; annotation 


COMPLEX (AN TI- 
ONCOGENE/ANKYRIN REPEATS) 
P53BP2; ANKYRIN REPEATS, SH3, 
P53, TUMOR SUPPRESSOR, 
MULTIGENE 2 FAMILY, NUCLEAR 
PROTEIN, PHOSPHORYLATION, 
DISEASE MUTATION, 3 
POLYMORPHISM, COMPLEX (ANTI- 
ONCOGENE/ANKYRIN REPEATS^ 


COMPLEX (ANT1- 
ON CO GENE/ ANKYRIN REPEATS) 
P53BP2; ANKYRIN REPEATS, SH3, 
P53, TUMOR SLfPPRESSOR, 
MULTIGENE 2 FAMILY, NUCLEAR 
PROTEIN, PHOSPHORYLATION, 
DISEASE MUTATION, 3 
POLYMORPHISM, COMPLEX (ANTI- 
ONCOGENE/ANKYRIN REPEATS) 


KINASE KINASE, SIGNAL 

TRANSDUCTION, 

CALCIUM/CALMODULTN 


PROTEIN KINASE CDK2; PROTEIN 


Coumpound 


P53; CHAIN: A; 53BP2; CHAIN: 
B; 


P53; CHAIN: A; 53BP2; CHAIN: 
B; 


CALCIUM/CALMODULIN- 
DEPENDENT PROTEIN KINASE; 
CHAIN: NULL: 


TRANSFERASE(PHOSPHOTRA 
NSFERASE) $C-/AMP$- 
DEPENDENT PROTEIN KINASE 
(E.C.2.7.1.37) ($C/APK$) I APM 3 
(CATALYTIC SUBUNIT) ALPHA 
ISOENZYME MUTANT WITH 
SER 139 1 APM 4 REPLACED BY 
ALA (/S139A$) COMPLEX WITH 
THE PEPTIDE IAPM 5 
INHIBITOR PKI(5-24) AND THE 
DETERGENT MEGA- 8 lAPM 6 
CYCLIN-DEPENDENT PROTEIN 
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PDI5 annotation 


51UNAL 1 KAN SDUCTION 
COMPLEX (GTP- 

BINDING/TRANSDUCER) BETAl, 
TRANSDUCIN BETA SUBUNIT; 
GAMMA 1, TRANSDUCIN GAMMA 
SUBUNIT; COMPLEX (GTP- 
BINDING/TRANSDUCER), G 
PROTEIN, HETEROTRIMER 2 
SIGNAL TRANSDT JHTTON 


5 h s 

W g < m ^ 

- SSglSfiS 

.Illpll 

lllipll 


COMPLEX 

(TRANSFERASl J/PEPTIDE) 
COMPLEX 

(TRANSFERASE/PEPTIDE) 




COMPLEX (PROTO- 
ONCOGENE/EARL Y PROTEIN) SRC 
HOMOLOGY 2 DOMAIN; SH2 
DOMAIN, SIGNAL TRANSDUCTION. 


Coumpound 


GT- ALP HA/GI- ALPHA 
CHIMERA; CHAIN; A; GT- 
BETA; CHAIN: B; GT-GAMMA; 
CHAIN: G; 


GT-ALP HA/GI- ALPHA 
CHIMERA; CHAIN: A; GT- 
BETA; CHAIN: B; GT-GAMMA; 
CHAIN: G; 


C-SRC TYROSINE KINASE; 
CHAIN: A, B; ACE-FORMYL I 
PHOSPHOTYR-GLU-(N,N- 
DIPENTYL AMINE); CHAIN: C, 


TRANSFERASE(PHOSPHOTRA 
NSFERASE) PROTO- 
ONCOGENE TYROSINE 
KINASE (E.C.2.7.L112) 1AB2 3 
(SRC HOMOLOGY 2 DOMAIN) 
(ABELSON, SH2 ABL) 1AB2 4 
(NMR, 20 STRUCTURES) 1AB2 5 


FYN PROTEIN-TYROSINE 
KINASE; CHAIN: F; 
PHOSPHOTYROSYL PEPTIDE; 
CHAIN: P 


SEQFOL 
D score 


104.96 










PMF 
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PDB annotation 


o 


HALOPEROXIDASE 
CHLOROPEROXIDASE Al, 
HALOPEROXIDASE Al; 
HALOPEROXIDASE, 
OXIDOREDUCTASE 


HALOPEROXIDASE 
HALOPEROXIDASE F; 
HALOPEROXIDASE, 
OXIDOREDUCTASE, PROPIONATE 
COMPLEX 


HALOPEROXIDASE 
HALOPEROXIDASE F; 
HALOPEROXIDASE, 
OXIDOREDUCTASE, PROPIONATE 
COMPLEX 


AMINOPEPTIDASE 
AMINOPEPTIDASE, PROLINE 
IMINOPEPTIDASE, SERINE 
PROTEASE, 2 XANTHOMONAS 
CAMPESTRIS . 


HYDROLASE HYDROLASE, 
HALOALKANE DEHALOGENASE, 
ALPHA/BETA-HYDROLASE 


HYDROLASE HYDROLASE, 
HALOALKANE DEHALOGENASE, 
ALPHA/BETA-HYDROLASE 


HALOPEROXIDASE 
HALOPEROXIDASE A2, 
CHLOROPEROXIDASE A2; 
HALOPEROXIDASE, 
OXIDOREDUCTASE, PEROXIDASE, 
ALPHA/BETA 2 HYDROLASE FOLD, 
MUTANT M99T 


Coumpound 




BROMOPEROXIDASE Al; 
CHAIN; NULL; 


CHLOROPEROXIDASE F; 
CHAIN: NULL; 


CHLOROPEROXIDASE F; 
CHAIN: NULL; 


PROLINE IMINOPEPTIDASE; 
CHAIN: A, B; 


HALOALKANE 
DEHALOGENASE; CHAIN: 
NULL; 


HALOALKANE 
DEHALOGENASE; CHAIN: 
NULL; 


BROMOPEROXIDASE A2; 
CHAIN: NULL; 




SEQFOL 
D score 




95.47 


83.84 




67.83 


94.11 | 




89.86 


PMF 
score 








1.00 






1.00 




Verify 
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0.61 






0.45 
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PDB annotation 


HYDROLASE BPHD; HYDROLASE, 
PCB DEGRADATION 


HYDROLASE .A/B HYDROLASE 
FOLD, DEHALOGENASE I-S BOND 


HYDROLASE A/B HYDROLASE 
FOLD, DEHALOGENASE I-S BOND 


HYDROLASE LINB, 1,3,4,6- 
TETRACHLOR.O- 1 ,4- 
CYCLOHEXADIENE 
DEHALOGENASE, LINDANE, 
BIODEGRADATION, ALPHA/BETA- 
; HYDROLASE 


HYDROLASE TRIACYLGLYCEROL- 
HYDROLASE, X-RAY 
CRYSTALLOGRAPHY, 2 
PSEUDOMONADACEAE, 
OXYANION, CIS-PEPTIDE, 
HYDROLASE 


HYDROLASE TRIACYLGLYCEROL- 
HYDROLASE, X-RAY 
CRYSTALLOGRAPHY, 2 
PSEUDOMONADACEAE, 
OXYANION, C IS-PEPTIDE, 
HYDROLASE 


HYDROLASE HYDROLASE, 
ALPHA/BETA HYDROLASE FOLD, 
EPOXIDE DEGRADATION, 2 
EPICHLOROHYDRIN 


HYDROLASE IIOMODIMER, 
ALPHA/BETA 1JYDROLASE FOLD, 
DISUBSTITUTBD UREA 2 INHIBITOR 


Coumpound 


< 

§ 

x ^ 

O CM 

Si 
cm £ o 


HALOALKANE 
DEHALOGENASE; 1- 
CHLOROHEXANE CHAIN: A: 


HALOALKANE 
DEHALOGENASE; 1- 
CHLOROHEXANE CHAIN: A: 


HALOALKANE 
DEHALOGENASE; CHAIN: A; 


[ TRIACYLGLYCEROL i 
{ HYDROLASE; CHAIN: NULL; 


TRIACYLGLYCEROL 
HYDROLASE; CHAIN: NULL; 


SOLUBLE EPOXIDE 
HYDROLASE; CHAIN: A, B, C, 
D; 


EPOXIDE HYDROLASE; CHAIN: 
A,B; 


SEQFOL 
D score 


77.34 


122.01 










119.90 




PMF 
score 
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PDi; annotation 


ANALOGUE, ENANTIOSELECTIVITY 








METAL BINDING PROTEIN RING 
FINGER PROTEIN MATl; RING 
FINGER CC3HC4*) 


DNA-BINDING PROTEIN V(D)J 
RECOMBINATJON ACTIVATING 
PROTEIN 1; RAGI, V(D)J 
RECOMBINATION, ANTIBODY, 
MAD, RING FINGER, 2 ZINC 
1 BINUCLEAR CLUSTER, ZINC 
FINGER. DNA-BINDING PROTEIN 


DNA-BINDING PROTEIN V(D)J 
RECOMBINATION ACTIVATING 
PROTEIN 1; RAGI, V(D)J 
RECOMBINATION, ANTIBODY, 
MAD, RING FINGER, 2 ZINC 
BINUCLEAR CLUSTER, ZINC 
FINGER. DNA-BINDING PROTEIN 




TOXIN BINDING PROTEIN TWO 
DOMAINS: BET A PROPELLER AND 
ALPHA/BETA FOLD 


TRANSCRIPTION INHIBITOR BETA- 
PROPELLER 


Coumpound 






VIRUS EQUINE HERPES VIRUS- 
1 (C3HC4, OR RING DOMAIN) 
ICHC 3 (NMR, 1 STRUCTURE) 
ICHC 4 


VIRUS EQUINE I-IERPES VTRUS- 
1 (C3HC4, OR RING DOMAIN) 
ICHC 3 (NMR, 1 STRUCTURE) 
ICHC 4 


CDK- ACTIVATING KINASE 
ASSEMBLY FACTOR MATl; 
CHAIN: A; 


RAGI; CHAIN: NULL; 


RAGI; CHAIN: NULL; 




TOLB PROTEIN; CHAIN: A; 


TRANSCRIPTIONAL 
REPRESSOR TUP I; CHAIN: A, 
B, C; 


SEQFOL 
D score 






















PMF 
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PDB annotation 


METHYLTRANSFERASE, 

NEUROTRANSMITTER 

DEGRADATION 




REPLICATION DNA NUCLEOTIDE 
EXCISION REPAIR, UVRABC, 
HELICASE, 2 

HYPERTHERMOSTABLE PROTEIN 


REPLICATION DNA NUCLEOTIDE 
EXCISION REPAIR, UVRABC, 
HELICASE, 2 

HYPERTHERMOSTABLE PROTEIN 


HYDROLASE HEPATITIS C VIRUS, 
BIFUNCTIONA L,PROTEASE- 
HELICASE 


HYDROLASE IfVRB; MULTIDOMAIN 
PROTEIN 


HYDROLASE UVRB; MULTIDOMAIN 
PROTEIN 


GENE REGULATION APO PROTEIN 


GENE REGULATION APO PROTEIN 


TRANSLATION YEAST INITIATION 
FACTOR 4A, El F4 A; HELICASE, 
INITIATION FACTOR 4A, DEAD-BOX 
PROTEIN 


TRANSLATION EUKARYOTIC 
INITIATION FACTOR 4A; IF4A, 
HELICASE. DEAD-BOX PROTETM 


TRANSLATION EUKARYOTIC 
INITIATION FA CTOR 4A; IF4A, 
HELICASE, DE/VD-BOX PROTEIN 
TRANSLATION EUKARYOTIC 


Coumpound 






DNA NUCLEOTIDE EXCISION 
REPAIR ENZYME UVRB; 
CHAIN: A; 


DNA NUCLEOTIDE EXCISION 
REPAIR ENZYME UVRB; 
CHAIN: A; 


PROTE ASE/HELICASE NS3 ; 
CHAIN: A, B; 


EXCINUCLEASE ABC SUBUNIT 
B; CHAIN: A; 


EXCINUCLEASE ABC SUBUNIT 
B; CHAIN: A: 


EXCINUCLEASE UVRABC 
COMPONENT UVRB: CHAIN: A: 


EXCINUCLEASE UVRABC 
COMPONENT UVRB: CHAIN: A: 


EUKARYOTIC INITIATION 
FACTOR 4A; CHAIN: A; 


YEAST INITIATION FACTOR 
4A; CHAIN: A, B; 


YEAST INITIATION FACTOR 
4A; CHAIN: A, B; 

YEAST INITIATION FACTOR 


SEQFOL 
D score 


























PMF 
score 
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PDlt annotation 


i ipe in 

CONNECTIN A71, CONNECTIN; 
TITIN, CONNECTIN, FIBRONECTIN 
TYPE III 


SIGNALING PROTEIN CYTOKINE 
RECEPTOR, GLYCOPROTEIN 130, 
GP130, INTERLEUKINE 6 2 
RECEPTOR BETA SUBUNIT, 
SIGNALING PROTEIN 






STRUCTURAL PROTEIN I-DOMAIN, 
METAL BINDING, COLLAGEN, 
ADHESTOTST 


STRUCTURAL PROTEIN I-DOMAIN, 
METAL BINDING, COLLAGEN, 
ADHESION 


Coumpound 


TITIN; CHAIN: NULL; 


GP130; CHAIN: A, B; 


NEURAL ADHESION 
MOLECULE DROSOPHILA 
NEUROGLIAN 

(CHYMOTRYPTIC FRAGMENT 
CONTAINING THE 1 CFB 3 TWO 
AMINO PROXIMAL 
FIBRONECTIN TYPE III 
REPEATS ICFB 4 (RESIDUES 
1 610-814)) ICFB 5 


NEURAL ADHESION 
MOLECULE DROSOPHILA 
NEUROGLIAN 

(CHYMOTRYPTIC FRAGMENT 
CONTAINING THE ICFB 3 TWO 
AMINO PROXIMAL 
FIBRONECTIN TYPE III 
REPEATS ICFB 4 (RESIDUES 
610-814)) ICFB 5 


INTEGRIN ALPHA- 1 ; CHAIN: A, 
B; 


INTEGRIN ALPHA-1; CHAIN: A, 
B; 

CELL ADHESION PROTEIN 
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PDB annotation 


CELL ADHESION LFA-1, ALPHA- 
L\,BETA-2 INTEGRIN, A-DOMAIN; 
1LFA8 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN, RGD, 
EXTRACELLUL AR MATRIX, 2 
HEPARIN-BINDING, 
GLYCOPROTEIN 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN, RGD, 
EXTRACELLULAR MATRIX, 2 
HEPARIN-BTND [NG, 
GLYCOPROTEIN 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN, RGD, 
EXTRACELLULAR MATRIX, 2 
HEPARIN-BINDiCNG, 
GLYCOPROTEIN 


CELL ADHESION INTEGRIN, CELL 
ADHESION 


CELL ADHESION INTEGRIN, CELL 
ADHESION 


STRUCTURAL FROTEIN INTEGRIN, 
HEMIDESMOSOME, FIBRONECTIN, 
CARCINOMA, STRUCTURAL 2 
PROTEIN 


STRUCTURAL PROTEIN TENASCIN, 
FIBRONECTIN TYPE-HI, HEPARIN, 
EXTRACELLULAR 2 MATRIX, 
ADHESION, FUSION PROTEIN, 
STRUCTURAL PROTEIN 


Coumpound 


CDl 1A; ILFA 5 CHAIN: A, B; 
1LFA6 


FIBRONECTIN; CHAIN: NULL; 


FIBRONECTIN; CHAIN: NULL; 


FIBRONECTIN; CHAIN: NULL; 


ALPHAl BETAl INTEGRIN; 
CHAIN: A; ALPHAl BETAl 
INTEGRIN; CHAIN: B: 


ALPHAl BETAl INTEGRIN; 
CHAIN: A; ALPHAl BETAl 
INTEGRIN; CHAIN: B; 


INTEGRIN BETA-4 SUBUNIT; 
CHAIN: A, B; 


TENASCIN; CHAIN: A, B; 


SEQFOL 
D score 




75.51 
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PDB annotation 


STRUCTURAL PROTEIN TENASCIN, 
FIBRONECTIN TYPE-UI, HEPARIN, 
EXTRACELLULAR 2 MATRIX, 
ADHESION, FUSION PROTEIN, 
STRUCTURAL PROTEIN 


STRUCTURAL PROTEIN TENASCIN, 
FIBRONECTIN TYPE-UI, HEPARIN, 
EXTRACELLULAR 2 MATRIX, 
ADHESION, FUSION PROTEIN, 
STRUCTURAL PROTEIN 








PROTEIN BINDING ED-B, 
FIBRONECTIN, TYPEIH DOMAIN, 
ANGIOGENESIS, PROTEIN 2 
BINDING 


PROTEIN BINDING ED-B, 
FIBRONECTIN, TYPEffl DOMAIN, 
ANGIOGENESIS, PROTEIN 2 
BINDING 


COAGULATION FACTOR 




Coumpound 


TENASCIN; CHAIN: A, B; 


TENASCIN; CHAIN: A s B; 


CELL ADHESION PROTEIN 
TENASCIN (THIRD 
FIBRONECTIN TYPE III 
REPEAT) ITEN 3 


CELL ADHESION PROTEIN 
TENASCIN (THIRD 
FIBRONECTIN TYPE III 
REPEAT) ITEN 3 


GLYCOPROTEIN FIBRONECTIN 
(TENTH TYPE III MODULE) 
(NMR, 36 STRUCTURES) ITTF 3 


FIBRONECTIN; CHAIN: A; 


FIBRONECTIN; CHAIN: A; 


HUMAN TISSUE FACTOR; 2HFT 
4 CHAIN: NULL; 2HFT 5 


HORMONE/RECEPTOR HUMAN 
GROWTH HORMONE 
COMPLEXED WITH ITS 


SEQFOL 
D score 


77.31 
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PDB annotation 


CDK6; P19INK4D; CYCLIN 
DEPENDENT KINASE, CYCLIN 
DEPENDENT KINASE INHIBITORY 2 
PROTEIN, CDK, INK4, CELL CYCLE, 
COMPLEX (KINASE/INHIBITOR) 
HEADER HELIX 










TRANSFERASE KINASE DOMAIN, 
AUTOINHIBITORY FRAGMENT, 
HOMODIMER 


SERINE/THREON lJMIi-PKU 1 blN 
KINASE CSBP, RK, P38; PROTEIN 
SER/THR-KINASE, 
SERINE/THREONINE-PROTEIN 
KINASE 


Coumpound 


6; CHAIN: A, C; CYCLIN- 
DEPEN.DENT KINASE 
INHIBITOR; CHAIN: B, D; 


PHOSPHOTRANSFERASE 
CAMP-DEPENDENT PROTEIN 
KINASE CATALYTIC SUBUNIT 
ICMK 3 (E.C.2.7.1.37) ICMK 4 


PHOSPHOTRANSFERASE 
CAMP-DEPENDENT PROTEIN 
KINASE CATALYTIC SUBUNIT 
ICMK 3 (E.C.2.7.1.37) ICMK 4 


TRANSFERASE(PHOSPHOTRA 
NSFERASE) CAMP- 
DEPENDENT PROTEIN KINASE 
(E.C.2.7.1.37) (CAPK) ICTP 3 
(CATALYTIC SUBUNIT) ICTP 4 


TRANSFERASE(PHOSPHOTRA 
NSFERASE) CAMP- 
DEPENDENT PROTEIN KINASE 
(E.C.2.7.1.37) (CAPK) ICTP 3 
(CATALYTIC SUBUNIT) ICTP 4 


SERINE/THREONINE-PROTEIN 
KINASE PAK-ALPHA; CHAIN: 


A, B; SERINE/THREONINE- 
PROTEIN KINASE PAK-ALPHA; 
CHAIN: C, D; 


P38 MAP KINASE; CHAIN: 
NULL; 


SEQFOL 
D score 
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PDH annotation 


2f* 
« c 
X t 

£ < 

3 c 
§2 

w w 

GO <£ 

4 

u S 


h 

h C 

efg 

< a 

W GO 

5s 


transferask mitogen 
activated protein kinase; 
transferase;, map kinase, 
serine/threonine-protein 

KINASR 2 P38 


KINASE RABBIT MUSCLE 
PHOSPHORYLASE KINASE; 
GLYCOGEN METABOLISM, 
TRANSFERASE, 

SERINE/THREONINE-PROTEIN, 2 
KINASE, ATP-BINDING, 
CALMODULTN-.RTNnnsjrr 


TRANSFERASE MAP KINASE, 
SERINE/THREONINE PROTEIN 
KINASE. TRANSFRRASF 


TRANSFERASE MAP KINASE, 
SERINE/THREONINE PROTEIN 
KINASE. TRANSFRRASF 


SERINE KINASE SERINE KINASE, 
TITIN, MUSCLE, AUTOINHIBITION 


AMINE/CARBO;XYLATE LIGASE 
AMINE/CARBOXYLATE LIGASE 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP.RrR ONT ICT FOPT? HTPTM 


RNA-BINDING PROTEIN/RNA TRA 
PRE-MRNA: SPLICING 


Coumpound 


TWITCHIN; CHAIN: NULL; 


TWITCHIN; CHAIN: A, B; 


MAP KINASE P38; CHAIN: 
NULL; 


PHOSPHORYLASE KINASE; 
CHAIN: NULL; 


ERK2; CHAIN: NULL; 


ERK2; CHAIN: NULL; 


TITIN; CHAIN: A, B; 


GLUTATHIONE SYNTHETASE; 
CHAIN: A; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


SXL-LETHAL PROTEIN; CHAIN: 
A, B; RNA (5'- 
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60.82 






58.82 










PMF 
score 


d 


0.55 




0.81 


0.13 




0.54 


0.52 


0.52 


«n 

Os 

d 


Verify 
score 


r- 

CN 

d 


-0.31 




0.03 


-0.10 




r- 

CN 

d 
• 


-0.23 


0.40 


0.26 


Psi Blast 


m 
i 

CO 


6.4e-34 


CN 
VO 


3 

CN 
CO 


l.le-24 


l.le-24 


3.2e-27 


0.0088 


r- 
o 

<& 

«n 


2 
On 




VO ' 

in 
m 


vO 

«n 
rn 


m 
o 


VO 

in 
m 




CN 

as 

CO 


CM 

VO 

m 


VO 

\o 


»n 
o> 


VO 

i— * 

VO' 


STAR 
TAA 


00 

si- 


o 




LZl 


CN 


-n 

50 


z> 
n 


m 


vo 

00 
00 


rs 
n 


CHAI 

■ 




< 










< 


< 


m 


< 


: 


lKoa 


s 


o 


3 

a 


ipme 


ipme 




2hgs 


C t 

S ; 


M 

*«» 

o 


lag 


N < 
^ 1 


N ( 

1 


N ( 

^ 1 


N ( 

1 




d* * 
N t 
^ 1 


M 


1726 


1728 


o 

N 



796 



WO 02/070539 



PCT/US02/05095 



! 

O 

u 




GO 



< * 5 



u 



< m 




s e d 
o « s a 



" < 

Is 



*5 



Oh 0« 



w < 



co 



3 

* 



3 






o * , 

ill 

Sou 
« g co 



a 

PQ 
< 



• ?. 

^ * CO 



1 

a 

CO 



1 ; 

p 

a r 



O 

3 

CO 



o 



S3h 



Sag 

co Z 



797 



WO 02/070539 W W PC T/US02/05095 




798 



WO 02/070539 



PCT/US02/05095 



PDB annotation 


RIBONUCLEOPROTEIN PTB, PTB- 
C198, HETEROGENEOUS NUCLEAR 
POLYPYRIMIDINE TRACT BINDING 
PROTEIN, RNP, RNA, SPICING, 2 
TRANSLATION 




COMPLEX 

(RIBONUCLEOPROTEIN/RNA) 


RNA BINDING PROTEIN RNA- 
BINDING DOMAIN 


RNA-BINDING DOMAIN RNA- 
BINDING DOMAIN, ALTERNATIVE 
SPLICING 


NUCLEAR PROTEIN Ul SNRNP A 
PROTEIN; RNA BINDING DOMAIN, 
NUCLEAR PROTEIN 


NUCLEAR PROTEIN Ul SNRNP A 
PROTEIN; RNA BINDING DOMAIN, 
NUCLEAR PROTEIN 


COMPLEX 

(RIBONUCLEOPROTEIN/DNA) 
HNRNP Al, UP1; COMPLEX 
(RIBONUCLEOPROTEIN/DNA), 


Coumpound 


POLYPYRIMIDINE TRACT- 
BINDING PROTEIN; CHAIN: A; 


RNA-BINDING PROTEIN SEX- 
LETHAL PROTEIN (C- 
TERMINUS, OR SECOND RNA- 
BINDING DOMAIN ISXL 3 
(RBD-2), RESIDUES 199 - 294 
PLUS N-TERMINAL MET) 1 SXL 
4 (NMR, 17 STRUCTURES) ISXL 
5 


UlA SPLICEOSOMAL PROTEIN; 
lURN 5 CHAIN: A, B, C; I URN 6 
RNA 2 1 MER HAIRPIN (5 ( - 
(AP*AP*UP*CP*CP*AP*UP*UP* 
lURN 1 1 CHAIN: P, Q, R lURN 
13 


MUSASHIl; CHAIN: A; 


SEX-LETHAL PROTEIN; CHAIN: 
NULL; 


Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN A; 
CHAIN: NULL; 


Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN A; 
CHAIN: NULL; 


HETEROGENEOUS NUCLEAR 
RIBONUCLEOPROTEIN Al ; 
CHAIN: A; 12-NUCLEOTIDE 
SINGLE-STRANDED 


SEQFOL 
D score 
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PDH annotation 


HETEROGENEOUS NUCLEAR 2 
RIB ONUCLEOl »R OTFTM Al 


COMPLEX 

(RIBONUCLEOPROTEIN/DNA) 
HNRNP Al, UP I; COMPLEX 
(RIBONUCLEO PROTEIN/DNA), 
HETEROGENEOUS NUCLEAR 2 
RIBONUCLROPROTFTN A1 


RNA BINDING DOMAIN RNA 
BINDING DOM AIN, RBD, RNA 
RECOGNITION MOTIF, RRM, 2 
SPLICING INHIBITOR, 
TRANSLATIONAL INHIBITOR, SEX 3 
DETERMINATION, X CHROMOSOME 
DOSAGK COMPF/NSATTrvw 


RNA BINDING DOMAIN RKA 
BINDING DOMAIN, RBD, RNA 
RECOGNITION MOTIF, RRM, 2 
SPLICING INHIBITOR, 
TRANSLATION AL INHIBITOR, SEX 3 
DETERMINATION, X CHROMOSOME 
DOSAGE COMPENSATION 


PLASMA PROTEIN PLASMA 
PROTEIN, METAL-BINDING, LIPID- 
BINDING 




Coumpound 


TELOMETRIC DNA; CHAIN: B; 


HETEROGENEOUS NUCLEAR 
RIBONUCLEOPROTEIN Al; 
CHAIN: A; 12-NUCLEOTIDE 
SINGLE-STRANDED 
TELOMETRIC DNA; CHAIN: B; 


SEX-LETHAL; CHAIN: A, B, C; 


SEX-LETHAL; CHAIN: A, B, C; 


SERUM ALBUMIN; CHAIN: A; 


DNA-BINDING PROTEIN 
ANTENNAPEDIA PROTEIN 
(HOMEODOMAIN) MUTANT 
WITH CYS 39 lAHD 3 
REPLACED BY SER (C39S) 
COMPLEX WITH DNA (NMR, 
lAHD 4 16 STRUCTURES) lAHD 


SEQFOL 
D score 
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score 
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PDB annotation 




PROTEIN/DNA HOMEODOMAIN, 
DNA, COMPLEX, DNA-BINDING 
PROTEIN, PROTEIN/DNA 


TRANSCRIPTION/DNA 
ULTRABITHORAX; PBX PROTEIN; 
DNA BINDING, HOMEODOMAIN, 
HOMEOTIC PROTEINS, 
DEVELOPMENT, 2 SPECIFICITY 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) DNA-BINDING 
PROTEIN, DNA, PAIRED BOX, 
TRANSCRIPTION 2 REGULATION 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) DNA-BINDING 
PROTEIN, DNA, PAIRED BOX, 
TRANSCRIPTION 2 REGULATION 










Coumpound 




HOMEOBOX PROTEIN HOX-Bl; 
CHAIN: A; PBXl ; CHAIN: B; 
DNA CHAIN: D; DNA CHAIN: E; 


ULTRABITHORAX HOMEOTIC 
PROTEIN IV; CHAIN: A; 
HOMEOBOX PROTEIN 
EXTRADENTICLE; CHAIN: B; 
DNA (5'- CHAIN; C; DNA (5*- 
CHAIN: D; 


PAIRED PROTEIN; CHAIN: A, B, 
C; DNA; CHAIN: D, E, F 


PAIRED PROTEIN; CHAIN: A, B, 
C; DNA; CHAIN: D, E, F 


TRANSCRIPTION REGULATION 
TRANSCRIPTION FACTOR 
LFBl (HOMEODOMAIN) ILFB 3 


GENE REGULATING PROTEIN 
REPRESSOR PROTEIN FROM 
BACTERIOPHAGE 434 (DNA- 
BINDING IPRA 3 DOMAIN, 
RESIDUES 1-69) (NMR, 20 
STRUCTURES) IPRA 4 


GENE REGULATING PROTEIN 
REPRESSOR (AMINO- 
TERMINAL DOMAIN) (Rl-69) 
1R69 4 


DNA-BINDING PROTEIN 
ANTENNAPEDIA PROTEIN 
(HOMEODOMAIN) MUTANT 


SEQFOL 
D score 
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PjDE1 annotation 




TRANSCRIPTION REGULATION 
GENE REGULATING PROTEIN, 
TRANSCRIPTION REGUT ATTON 




COMPLEX (DNA-BINDING 
PROTEIN/DNA) HD; 
HOMEODOMAIN, COMPLEX (DNA- 
BINDING PROTRTN/TTNJA^i 






PROTEIN/DNA HOMEODOMAIN, 
DNA, COMPLEX, DNA-BINDING 
PROTEIN. PROTEIN/DNA 


TRANSCRIPTION/DNA 
ULTRABITHORAX; PBX PROTEIN; 
DNA BINDING, IHOMEODOMAIN, 
HOMEOTIC PROTEINS, 
DEVELOPMENT, 2 SPECIFICITY 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) DNA-BINDING 


Coumpound 


WITH CYS 39 ISAN 3 
REPLACED BY SER AND 
RESIDUES 1-6 DELETED 
(C39S,DEL 1-6) ISAN 4 (NMR, 20 
STRUCTURES) ISAN 5 


PHAGE 434 CRO PROTEIN; 
CHAIN: NULL; 


GENE REGULATING PROTEIN 
CRO PROTEIN 2CRO 4 


ANTENNAPEDIA PROTEIN; 
CHAIN: A, B; DNA; CHAIN: C, 
D, E, F; 




DNA-BINDING PROTEIN 
ANTENNAPEDIA PROTEIN 
(HOMEODOMAIN) MUTANT 
WITH CYS 39 1 AHD 3 
REPLACED BY SER (C39S) 
COMPLEX WITH DNA (NMR, 
lAHD 4 16 STRUCTURES) lAHD 
5 


HOMEOBOX PROTEIN HOX-Bl; 
CHAIN: A; PBXl; CHAIN: B; 
DNA CHAIN: D: DNA CHAIN: E: 


ULTRABITHORAX HOMEOTIC 
PROTEIN IV; CHAIN: A; 
HOMEOBOX PROTEIN 
EXTRADENTICLE; CHAIN: B; 
DNA (5'- CHAIN: C; DNA (5'- 
CHAIN: D; 


PAIRED PROTEIN; CHAIN: A, B, 
C; DNA; CHAIN; D, E, F 


SEQFOL 
D score 
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score 
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FDB annotation 


PROTEIN, DNA, PAIRED BOX, 
TRANSCRIPTION 2 REGULATION 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) DNA-BINDING 
PROTEIN, DNA, PAIRED BOX, 
TRANSCRIPTION 2 REGULATION 










TRANSCRIPTION REGULATION 
GENE REGULATING PROTEIN, 
TRANSCRIPTION REGULATION 




COMPLEX (DNA-BINDING 
PROTEIN/DNA) HD; 


Coumpound 




PAIRED PROTEIN; CHAIN: A, B, 
C; DNA; CHAIN: D, E, F 


TRANSCRIPTION REGULATION 
TRANSCRIPTION FACTOR 
LFBI (HOMEODOMAIN) ILFB 3 


GENE REGULATING PROTEIN 
REPRESSOR PROTEIN FROM 
BACTERIOPHAGE 434 (DNA- 
BINDING IPRA 3 DOMAIN, 
RESIDUES 1-69) (NMR, 20 
, STRUCTURES) IPRA 4 


GENE REGULATING PROTEIN 
REPRESSOR (AMINO- 
TERMINAL DOMAIN) (Rl-69) 
1R69 4 


DNA-BINDING PROTEIN 
ANTENNAPEDIA PROTEIN 
mOMEODOMAINl MUTANT 


WITH CYS39 ISAN 3 
REPLACED BY SER AND 
RESIDUES 1-6 DELETED 
(C39S,DEL 1-6) ISAN 4 (NMR, 20 
STRUCTURES) ISAN 5 


PHAGE 434 CRO PROTEIN; 
CHAIN: NULL; 


GENE REGULATING PROTEIN 
CRO PROTEIN 2CRO 4 


ANTENNAPEDIA PROTEIN; 
CHAIN: A, B; DNA; CHAIN: C, 


SEQFOL 
D score 




















PMF 
score 




1.00 


0.99 


0.51 


0.78 


0.19 
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0.29 
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PD3 annotation 


HOMEODOMAIN, COMPLEX (DNA- 
B ENDING PROTEIN/DNA 1 ) 


MUSCLE PROTEIN CTNC; CARDIAC, 
MUSCLE PROTEIN, REGULATORY, 
CALCIUM RTNDTNn 


3 > 

B * 
3 O 

335 

u u C 


MUSCLE PROTEIN CTNC; CARDIAC, 
MUSCLE PROTEIN, REGULATORY, 
calcium RFMnrwn 


CALCIUM-BINDING PROTEIN 
CALMODULIN CERIUM TR1C- 
DOMAIN, RESIDUES 1 - 75; CERIUM- 
LOADED, CALCIUM-BINDING 
PROTEIN 


CALCIUM-BINDING CNTNC; 
CALCIUM-BINDING, REGULATION, 
TROPONIN C, CARDIAC MUSCLE 2 
CONTRACTION 


HYDROLASE CALCINEURIN; 
HYDROLASE, PHOSPHATASE, 
IMMUNOSI IPPRFSSTOM 


3 1 

si lz 

> d H ^ c: 

3 S < C 

sgp < 
2 o o u 

S O < CN 


CALCIUM-BINDING PROTEIN 
SNTNC; CALCIUM-BINDING, 
REGULATION, TROPONIN C, 
SKELETAL MU SCLE, 2 
CONTRACTION 

MUSCLE PROTEIN MDE: MUSCLE 


Coumpound 


Q 


TROPONIN C; CHAIN: NULL; 


TROPONIN C; CHAIN: NULL; 


TROPONIN C; CHAIN: NULL; 


CALMODULIN; CHAIN: NULL; 


CARDIAC N-TROPONIN C; 
CHAIN: NULL; 


SERINE/THREONINE 
PHOSPHATASE 2B; CHAIN: A, 

B; ! 


TROPONIN C; CHAIN: A, B; 


N-TROPONIN C; CHAIN: NULL; 
MYOSIN; CHAIN: A, B, C, D, E, 


SEQFOL 
D score 






82.97 
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70.90 
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PDB annotation 


PROTEIN I 


CALCIUM BINDING CALCIUM 
BINDING 














CALCIUM-BINDING PROTEIN 
CALMODULIN APO TR2C-DOMAIN; 
ICMF 9 

STRUCTURAL PROTEIN HELIX- 


Coumpound 


X 

o 


CALCIUM-BINDING PROTEIN; 
CHAIN: NULL; 


CALCIUM-BINDING PROTEIN 
CALMODULIN COMPLEXED 
WITH CALMODULIN-BINDING 
DOMAIN OP ICDM 3 
CALMODULIN-DEPENDENT 
PROTEIN KINASE II ICDM 4 


CALCIUM-BINDING PROTEIN 
CALMODULIN COMPLEXED 
WITH CALMODULIN-BINDING 
DOMAIN OF ICDM 3 
CALMODULIN-DEPENDENT 
PROTEIN KINASE II ICDM 4 


CALCIUM-BINDING PROTEIN 
CALMODULIN COMPLEXED 
WITH CALMODULIN-BINDING 
DOMAIN OF ICDM 3 
CALMODULIN-DEPENDENT 
PROTEIN KINASE 11 ICDM 4 


CALCIUM-BINDING PROTEIN 
CALMODULIN (VERTEBRATE) 
ICLL 3 


CALCIUM-BINDING PROTEIN 
CALMODULIN (VERTEBRATE) 
1CLL3 


CALCIUM-BINDING PROTEIN 
CALMODULIN (VERTEBRATE) 
1CLL3 


CALMODULIN (VERTEBRATE); 
ICMF 6 CHAIN: NULL; ICMF 7 

CARDIAC TROPONIN C; 
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PD B annotation 


TURN-HELIX 

STRUCTURAL. PROTEIN HELEK- 
TURN-HFJ.TV 


s 

o c 
|E 

11 


lie 
!§§ 

) J c 

) c/ 

• y s 

> o c 

5 s 


; 

5 
3 
5 

| 

> 

i 

) 




CALCIUM-REGULATED MUSCLE 
CONTRACTION MUSCLE 
CONTRACTION, CALCIUM- 
BINDING, TROPONIN, E-F HAND, 2 
OPEN CONFORMATION 
REGULATORY DOMAIN, CALCIUM- 
REGULATED 3 MUSCLE 
contr a rnnw 


CALCIUM-REGULATED MUSCLE 
CONTRACTION MUSCLE 
CONTRACTION, CALCIUM- 
BINDING, TROPONIN, E-F HAND, 2 
OPEN CONFORMATION 
REGULATORY DOMAIN, CALCIUM- 
REGULATED 3 MUSCLE 


CALCIUM-REGULATED MUSCLE 
CONTRACTION MUSCLE 


Coumpound 


CARDIAC TROPONIN C; 
CHAIN; A; 


CALMODULIN; CHAIN: A; 


CALMODULIN; CHAIN: A; 


CALCIUM BINDING 
PARVALBUMIN (PIKE, PI 5.0, 
ALPHA COMPONENT) 
COMPLEXED WITH IPVAA I 
TWO CALCIUM IONS 1 
(SYNCHROTRON X-RAY 
DIFFRACTION) 1 PVAA 2 


CALCIUM-BINDING PROTEIN 
ALPHA-PARVALBUMIN IRTP 3 


TROPONIN C; CHAIN: NULL; 


TROPONIN C; CHAIN: NULL; 


TROPONIN C; CHAIN: NULL; 


SEQFOL 
D score 








55.26 
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89.65 
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PDI! annotation 


0 

3 

Q 

S 

PQ 
i 

d pi 
o c 

cj £ 

*S 
3^ 

& fr 

6£ 
a £ 


CALMODULIN, CALCIUM BINDING, 
HELIX-LOOP-HELIX, SIGNALLING, 2 
COMPLEX(CALCIUM-BINDING 


CO H 

0 9 

!b|(s 

' PQ ft 

! cu ~ a 
' w 5 a 

dot 


MUSCLE PROTEIN MYOSIN, 
CALCIUM BINDING PROTEIN, 
MUS f!T .V. PR nTfcrw 


p 0 
s S a 

owe 
uqt 


7 
\ 
> 

1 

! 

) 




TRANSFERASE ATP:AMP- 
PHOSPHOTRANSFERASE, 
TRANSFER ASF. 


} 

1 




Coumpound 




CALMODULIN; CHAIN: A; 
RS20; CHAIN: B; 


SCALLOP MYOSIN; CHAIN: A, 
B, C; 


SCALLOP MYOSIN; CHAIN: A, 
B, C; 


TROPONIN C; CHAIN: NULL; 


CALCIUM BINDING CALCIUM- 
BINDING PARVALBUMIN 
($P*I=4.25) 4CPV 3 


TRANSFERASE URIDYLATE 
KINASE (E.C.2.7.4.-) 
COMPLEXED WITH ADP AND 
AMP 1UKZ3 


ADENYLATE KINASE; CHAIN: 
A, B; 


TRANSFERASE(PHOSPHOTRA 
NSFERASE) ADENYLATE 
KINASE (E.C.2.7.4.3) 3ADK 4 


TRANSFERASE URIDYLATE 
KINASE (E.C.2.7.4.-) 
COMPLEXED WITH ADP AND 
AMP 1UKZ3 


SEQFOL 
D score 






64.97 


54.16 


52.43 


51.70 










PMF 

score 




0.95 
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Verify 
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PDB annotation 


HYDROLASE HYDROLASE, BETA- 
LACTAMASE, ANTIBIOTIC, 
METALLOENZYME 


OXIDOREDUCTASE 
OXIDOREDUCTASE, 
METALLOPROTEIN, FE(II) 
COMPLEX. CATECHOL 


/-\ 

w 

> d, c 
ubb< 

g s 3 S 

Ph 05 *1 ^ 

n o ^ 

esgl 


HYDROLASE PHOSPHOLIPASE C; 
ZINC PHOSPHOLIPASE C, 
GANGRENE DETERMINANT, C2 
DOMAIN, CA 2 AND MEMBRANE 
1 BINDING. HYD ROT A 


COMPLEX 

(HYDROLASE/COFACTOR) 
TRIACYLGLYCEROL LIPASE; 
COMPLEX 

(HYDROLASE/COFACTOR), LIPID 
DEGRADATTOM 
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DIOXYGENASE L-1; DIOXYGENASE, 
LIPOXYGENASE, 

METALL0PRO1EIN. FATTY ACIDS 


Coumpound 


METALLO BETA-LACTAMASE 
II; CHAIN: A, B; 


LIPOXYGENASE-3; CHAIN; 
NULL; 


LIPOXYGENASE-3; CHAIN; 
NULL; 


ALPHA -TOXIN; CHAIN: NULL; 


TRIACYLGLYCEROL ACYL- 
HYDROLASE; CHAIN; A, C; 
COLIPASE; CHAIN: B, D 


15-LIPOXYGENASE; CHAIN: 
NULL; 


15-LIPOXYGENASE; CHAIN: 
NULL; 


LIPOXYGENASE-1; CHAIN: 
NULL; 


LIPOXYGENASE- 1 ; CHAIN: 
NULL; 


1 SEQFOL 
[ D score 




253.58 
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PMF 

score 
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Verify 
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PDB annotation 


SIGNALING PROTEIN GTP -BINDING 
PROTEIN RHOA, GTPASE RHOA; 
RHO GDI 1; RHO GTPASE, G- 
PROTEIN, SIGNALING PROTEIN 


SIGNALING PROTEIN GTr-BlNDIJNu 
PROTEIN RHOA, GTPASE RHOA; 
RHO GDI 1 ; RHO GTPASE, G- 
PROTEIN, SIGNALING PROTEIN 


CELL CYCLE CDC42; RHO GDI i; 
GTP-BINDING PROTEIN, CDC42, 
RHOGDI, X-RAY 


CELL CYCLE CDC42; RHO GDI 1; 
GTP-BINDING PROTEIN, CDC42, 
RHOGDI, X-RAY 






i 


PROTEIN/DNA HOMEODOMAIN, 
DNA, COMPLEX, DNA-BINDING 
PROTEIN, PROTEIN/DNA 


Coumpound 


TRANSFORMING PROTEIN 
RHOA; CHAIN: A, C; RHO GDP 
DISSOCIATION INHIBITOR 
ALPHA; CHAIN: E, F; 


TRANSFORMING PROTEIN 
RHOA; CHAIN: A, C; RHO GDP 
DISSOCIATION INHIBITOR 
ALPHA; CHAIN: E, F; 


GTP-BINDING PROTEIN; 
CHAIN: A; GDP-DISSOCIATION 
INHIBITOR 1; CHAIN: B; 


GTP-BINDING PROTEIN; 
CHAIN: A; GDP-DISSOCIATION 
INHIBITOR 1 ; CHAIN: B; 




DNA-BINDING PROTEIN 
ANTENNAPEDIA PROTEIN 
(HOMEODOMAIN) MUTANT 
WITH CYS 39 lAHD 3 
REPLACED BY SER (C39S) 
COMPLEX WITH DNA (NMR, 
lAHD 4 16 STRUCTURES) lAHD 
5 


DNA-BINDING PROTEIN 
ANTENNAPEDIA PROTEIN 
(HOMEODOMAIN) MUTANT 
WITH CYS 39 lAHD 3 
REPLACED BY SER (C39S) 
COMPLEX WITH DNA (NMR, 
lAHD 4 16 STRUCTURES) lAHD 
5 


HOMEOBOX PROTEIN HOX-BI; 
CHAIN: A; PBXl; CHAIN: B; 
DNA CHAIN: D; DNA CHAIN: E; 


SEQFOL 
D score 












70.63 






PMF 
score 


1.00 


1.00 


1.00 


1.00 
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Verify 
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PDI3 annotation 


PROTEIN/DNA HOMEODOMAIN, 
DNA, COMPLEX, DNA-BINDING 
PROTRIN PROTPTW/nMA 


i 

o ^ c 

On Q A. 


3 1 g o ^ 
3 § 5 * o t- 
> e S o pi z 

3 § ? ^ e 


TRANSCRIPTK )N/DNA 
ULTRABITHORAX; PBX PROTEIN; 
DNA BINDING, HOMEODOMAIN, 
HOMEOTIC PROTEINS, 
DEVELOPMENT, 2 SPECIFICITY 










Coumpound 


HOMEOBOX PROTEIN HOX-Bl; 
CHAIN: A; PBXl; CHAIN: B; 
DNA CHAIN: D: DNA CHAIN: E: 


HOMEOBOX PROTEIN HOX-Bl; 
CHAIN: A; PBXl; CHAIN: B; 
DNA CHAIN: D: DNA CHAIN- F- 


ULTRABITHORAX HOMEOTIC 
PROTEIN IV; CHAIN: A; 
HOMEOBOX PROTEIN 
EXTRADENTICLE; CHAIN: B; 
DNA (5'- CHAIN: C; DNA (5'- 
CHAIN: D; 


ULTRABITHORAX HOMEOTIC 
PROTEIN IV; CHAIN: A; 
HOMEOBOX PROTEIN 
EXTRADENTICLE; CHAIN: B; 
DNA (5'- CHAIN: C; DNA (5'- 
CHAIN: D; 


DNA-BINDING FUSHI TARAZU 
PROTEIN (HOMEODOMAIN) 
(NMR, 20 STRUCTURES) IFTZ 3 


DNA-BINDING FUSHI TARAZU 
PROTEIN (HOMEODOMAIN) 
(NMR, 20 STRUCTURES) IFTZ 3 


DNA-BINDING PROTEIN OCT-1 
(POU DOMAIN) 10CT3 


DNA-BINDING PROTEIN 
ANTENNAPEDIA PROTEIN 
(HOMEODOMAIN) MUTANT 
WITH CYS 39 ISAN 3 
REPLACED BY SER AND 
RESIDUES 1-6 DELETED 
(C39S.DEL 1-6) ISAN 4 (NMR, 20 
STRUCTURES) ISAN 5 


SEQFOL 
P score 




64.24 


64.34 




66.44 






65.69 


score 


0.99 
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PDB annotation 




COMPLEX (DNA-BINDING 
PROTEIN/DNA) I-ID; 
HOMEODOMAIN, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) HD; 
HOMEODOMAIN, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 




LIPOCALIN LIPOCALIN, 
OLFACTION 


SUGAR BINDING PROTEIN NGAL; 
NEUTROPHIL, NGAL, LIPOCALIN 


ALLERGEN LIPOCALIN, ob i A 
BARREL 




■ 


ODORANT-BINDING PROTEIN OBP 
OLFACTION, NOSE, TRANSPORT, 
LIPOCALIN, ODORANT-BINDING 2 


Coumpound 


DNA-BINDING PROTEIN 
ANTENNAPEDIA PROTEIN 
(HOMEODOMAIN) MUTANT 
WITH CYS 39 ISAN 3 
REPLACED BY SER AND 
RESIDUES 1-6 DELETED 
(C39S 3 DEL 1-6) ISAN 4 (NMR, 20 
STRUCTURES) ISAN 5 


ANTENNAPEDIA PROTEIN; 
CHAIN: A, B; DNA; CHAIN: C, 
D, E, F; 


ANTENNAPEDIA PROTEIN; 
CHAIN: A, B; DNA; CHAIN: C, 
D, E, F; 




ODORANT BINDING PROTEIN; 
CHAIN: A, B; 


HUMAN NEUTROPHIL 
GELATINASE; CHAIN: A, B; 


ALLERGEN EQU C I; CHAIN: A; 


PHEROMONE-BINDING MAJOR 
URINARY PROTEIN COMPLEX 
WITH2-(SEC-BUTYL) IMUP 3 
THIAZOLINE IMUP 4 


PHEROMONE-BINDING MAJOR 
URINARY PROTEIN COMPLEX 
WITH 2-(SEC-BUTYL) IMUP 3 
THIAZOLINE IMUP 4 


ODORANT-BINDING PROTEIN; 
CHAIN: A, B; 


SEQFOL 
D score 
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PDB: annotation 


PROTEIN 

SUGAR BINDING PROTEIN NGAL; 
NEUTROPHIL LIPOCALIN, SIGNAL 
protein, nr . vhopr htpim 


i 

; 

) 


LIPID BINDING PROTEIN A2U- 
GLOBULIN, LIPID BINDING 
PROTETN 


LIPID BINDING PROTEIN A2U- 
GLOBULIN, LIPID BINDING 
PROTETN 


LIPID BINDING PROTEIN A2U- 
GLOBULIN, LIPID BINDING 
PROTEIN 


COMPLEX (VIRAL 
CAPSID/IMMUNOGLOBULIN) HIV-1 
CA, HIV CA, HIV P24, P24; FAB, FAB 
LIGHT CHAIN, FAB HEAVY CHAIN 
COMPLEX (VIRAL 
CAPSID/IMMUNOGLOBULIN), HIV, 


C4 

W 

< X w 
5 d a co "s d 

1 > O ^ ^ > C 
2 W Z g X ti 

w w ~ w w 5 
o E w w o £ 


IMMUNOGLOBULIN HUMAN FAB, 
ANTI-TETANUS TOXOID, HIGH 
AFFINITY, CRYSTAL 2 PACKING 
MOTIF, PROGRAMMING 


Coumpound 


NEUTROPHIL GELATINASE; 
CHAIN: A; 


RETINOL TRANSPORT 
RETINOL BINDING PROTEIN 
IRBP 3 


ALPHA-2U- GLOBULIN; CHAIN: 
A, B, C, D 


O 

§ 

CO 

o 
■J 
0 


1 ALPHA-2U-GLOBULIN; CHAIN: 
| A, B, C, D 


HUMAN IMMUNODEFICIENCY 
VIRUS TYPE 1 CAPSID CHAIN: 
A, B; ANTIBODY FAB25.3 
FRAGMENT; CHAIN: H, K, L, M; 


HLA-A 0201 ; CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
TAX PEPTIDE; CHAIN: C; T 
CELL RECEPTOR ALPHA; 
CHAIN: D; T CELL RECEPTOR 
BETA; CHAIN: E: 


FAB B7-15A2; CHAIN: L, H; 


SEQFOL 
D score 










78.30 




295.76 




PMF 
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PDH annotation 


REGION, IMMUNOGLOBULIN 
IMMUNOGLOBULIN 
IMMUNOGLOFtT TT .FM 


COMPLEX 

(IMMUNOGLOBULIN/PEPTIDE) 
IMMUNOGLOBULIN, IGGl; FAB 
FRAGMENT, CROSS -REACTIVITY, 
HIVl PROTEASE, ENZYME 2 
INHIBITION, COMPLEX 
(IMMUNOGLOBULIN/PEPTIDE) 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


PLANT PROTEIN TWO 
HOMOLOGOUS HEVEIN-LIKE 
DOMAINS 


SUGAR BINDING PROTEIN UDA; 
LECTIN, HE VEIN DOMAIN, UDA, 
SUPERANTTOKTnT 


SUGAR BINDING PROTEIN UDA; 
LECTIN, HE VEIN DOMAIN, UDA, 
SUPERANTTCEN, SACCHARIDE 
BINDING 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, SIGNALLING 
PROTEIN 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, SIGNALLING 
PROTEIN 

BLOOD COAGULATION INHIBITOR 


Coumpound 


T 
< 

Q 

2 » 
On § 


MONOCLONAL ANTIBODY 
Fl 1.2.32; CHAIN: L, H, M, N; 
HIV-1 PROTEASE PEPTIDE; 
CHAIN: P, Q; 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN:' C: 


AGGLUTININ ISOLECTIN VI; 
CHAIN: A 


AGGLUTININ ISOLECTIN 
VI/AGGLUTININ ISOLECTIN V; 
CHAIN: A; 


AGGLUTININ ISOLECTIN 
I/AGGLUTININ ISOLECTIN V/ 
CHAIN: A; 


TUMOR NECROSIS FACTOR 
RECEPTOR; CHAIN: A, B; 


TUMOR NECROSIS FACTOR 
RECEPTOR; CHAIN: A, B; 

FLAVORIDIN; 1FVL 4 CHAIN: 


SEQFOL 
D score 






50.68 
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62.34 




PMF 
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1.00 
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PDB annotation 


GP IIB/IJIA ANTAGONIST IFVL 9 


GLYCOPROTEIN GLYCOPROTEIN 


GLYCOPROTEIN GLYCOPROTEIN 


SIGNALLING PROTEIN TYPE I 
RECEPTOR, STNFR1; INCF 8 
BINDING PROTEIN, CYTOKINE 
INCF 19 


SIGNALLING PROTEIN TYPE I 
RECEPTOR, STNFR1; INCF 8 
BINDING PROTEIN, CYTOKINE 
INCF 19 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 
BINDING, HYDROLASE, 3 
GLYCOPROTEIN 


SERINE PROTEASE INHIBITOR 
FACTOR XA INHIBITOR; 
ANTISTASIN, CRYSTAL 
STRUCTURE, FACTOR XA 
INHIBITOR, 2 SERINE PROTEASE 
INHIBITOR, THROMBOSIS 


SERINE PROTEASE INHIBITOR 
FACTOR XA INHIBITOR; 
ANTISTASIN, CRYSTAL 
STRUCTURE, FACTOR XA 
INHIBITOR, 2 SERINE PROTEASE 
INHIBITOR, THROMBOSIS 


SERINE PROTEASE INHIBrrOR 
FACTOR XA INHIBITOR; 
ANTISTASIN, CRYSTAL 


Coumpound 


NULL IFVL 5 [ 


LAMININ; CHAIN: NULL; | 


LAMININ; CHAIN: NULL; | 


TUMOR NECROSIS FACTOR 
RECEPTOR; INCF 4 CHAIN: A, 
B; INCF 5 


TUMOR NECROSIS FACTOR 
RECEPTOR; INCF 4 CHAIN: A, 
B; INCF 5 


FACTOR IXA; CHAIN: C, L,; D- 
PHE-PRO-ARG; CHAIN: I; 


ANTISTASIN; CHAIN: NULL; 


ANTISTASIN; CHAIN: NULL; 


ANTISTASIN; CHAIN: NULL; 


SEQFOL 
D score 






VO 




55.28 


57.36 




65.03 




PMF 
score 
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0.23 
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Verify 
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PDB annotation 


ENDOCYTOSIS/EXOCYTOSIS j 


ENDOCYTOSIS/EXOCYTOSIS BETA 
SANDWICH, CALCIUM ION, C2 
DOMAIN 


SIGNALING PROTEIN DAPPl, PHISH, 
BAM32; PLECKSTRIN, 3- 
FHOSFKOINOSITIDES, INOSITOL 
TETRAKISPHOSPHATE 2 SIGNAL 
TRANSDUCTION PROTEIN, 
ADAPTOR PROTEIN 


SIGNALING PROTEIN DAPPl, PHISH, 
BAM32; PLECKSTRIN, 3- 
PHOSPHOINOSITIDES, INOSITOL 
TETRAKISPHOSPHATE 2 SIGNAL 
TRANSDUCTION PROTEIN, 
ADAPTOR PROTEIN 


SIGNALING PROTEIN NFl-333; 
NEUROFIBROMIN, TYPE I 
NEUROFIBROMATOSIS, NFl, RAS, 
GAP, 2 SIGNAL TRANSDUCTION, 
CANCER, GROWTH REGULATION, 
GTP 3 HYDROLYSIS, PATIENT 
MUTATION, ARGININE FINGER 


SIGNALING PROTEIN NFl-333; 
NEUROFIBROMIN, TYPE I 
NEUROFIBROMATOSIS, NFl, RAS, 
GAP, 2 SIGNAL TRANSDUCTION, 
CANCER, GROWTH REGULATION, 
GTP 3 HYDROLYSIS, PATIENT 
MUTATION, ARGININE FINGER 




Coumpound 




SYNAPTOTAGMIN III; CHAIN: 


DUAL ADAPTOR OF 
PHOSPHOTYROSINE AND 3- 
CHAIN: A; 


DUAL ADAPTOR OF 
PHOSPHOTYROSINE AND 3- 
CHAIN: A; 


NEUROFIBROMIN; CHAIN: A; 


NEUROFIBROMIN; CHAIN: A; 


PHOSPHORYLATION 
PLECKSTRIN (N-TERMINAL 
PLECKSTRIN HOMOLOGY 
DOMAIN) MUTANT IPLS 3 


SEQFOL 
D score 










108.36 






PMF 
score 




0.54 


0.36 


0.70 




0.99 


0.37 


Verify 
score 




0.47 
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PDB annotation 


COMPLEX 


RNA-BINDING PROTEIN/RNA TRA 
PRE-MRNA; SPLICING 
REGULATION, RNP DOMAIN, RNA 
COMPLEX 


RNA-BINDING PROTEIN/RNA TRA 
PRE-MRNA; SPLICING 
REGULATION, RNP DOMAIN, RNA 
COMPLEX 


RNA-BINDING PROTEIN/RNA TRA 
PRE-MRNA; SPLICING 
REGULATION, RNP DOMAIN, RNA 
COMPLEX ! 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN i, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1 , PABP 1 ; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA P0LY(A) 
BINDING PROTEIN 1,PABP 1;RRM, 
PROTEIN-RNA COMPLEX, GENE 
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POLYDENYLATE BINDING 
PROTEIN 1; CHAIN: A, B, C, D, 
E,F, G, H; RNA (5- 
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PDB annotation 


REGULATIONS RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


RNA BINDING PROTEIN RNA- 
BINDING DOMAIN 


RNA BINDING iPROTEIN RNA- 
BINDING DOMAIN 


RNA BINDING iPROTEIN RNA- 
BINDING DOMAIN 


RNA BINDING PROTEIN RNA- 1 


Coumpound 


Pu £ 

* v .„ 
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< O 


POLYDENYLATE BINDING 
PROTEIN 1; CHAIN: A, B, C, D, 
E, F, G, H; RNA (5'- 
R(*AP*AP*AP*AP*AP*AP*AP* 
AP*AP* AP* A)-3'); CHAIN: M, N, 
0, P, Q, R, S, T; 
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HU ANTIGEN C; CHAIN: A; 


HU ANTIGEN C; CHAIN: A; 


HU ANTIGEN C; CHAIN: A; 


HU ANTIGEN C; CHAIN: A; 


SEQFOL 
D score 
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PDB annotation 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 


Coumpound 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C,F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


SEQFOL 
D score 












102.60 




PMF 
score 
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1.00 


Verify 
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PDB annotation 


7 

I 

w 
a 

S 

a 
g 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, ZINC 
FINGER PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, ZINC 
FINGER PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, ZINC 
FINGER PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 


*3 
a 

a 

o 

Qt 

B 

3 
O 

U 




DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 


SEQFOL 
D score 














100.49 


PMF 
score 




0.05 
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0.29 


0.54 
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0.99 




Verify 
score 




0.55 


-0.73 


0.24 


0.28 


0.07 
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PDtt annotation 


REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYI, ZINC 2 
FINGER PROTlilN, DNA-PROTEIN 
RECOGNITIONS COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYI, ZINC 2 
FINGER PROTlilN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYI, ZINC 2 
FINGER PROTK IN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYI, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


I 
1 

I 
s 

o 

o 


Coumpound 




YYI; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYI; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYI; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYI; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYI; CHAIN: C; ADENO- 1 


SEQFOL 
D score 














PMF 
score 




LOO 


1.00 
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LOO 


I 00T 


Verify 
score 




0.13 
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PDB annotation 


GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


at 

o y - 
o o ^ 

o ^ 6 
u £ o w 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 




TRANS CRIPTION REGULATI ON 
PROTO-ONCOGENE, NUCLEAR 
BODIES (PODS), LEUKEMIA, 2 
TRANSCRIPTION REGULATION 






LIGASE CBL, UBCH7, ZAP-70, E2, 
UBIQUITIN, E3, 

PHOSPHORYLATION, 2 TYROSINE 
KINASE, UBIQUITINATION, 
PROTEIN DEGRADATION, 


( LIGASE CBL, UBCH7, ZAP-70, E2, | 


Coumpound 




ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 




o 

H 
CO 

j 

rH • — 1 


VIRUS EQUINE HERPES VIRUS- ! 
1 (C3HC4, OR RING DOMAIN) 
ICHC 3 (NMR, 1 STRUCTURE) 
ICHC 4 


VIRUS EQUINE HERPES VIRUS- 
1 (C3HC4, OR RING DOMAIN) 
ICHC 3 (NMR, 1 STRUCTURE) 
ICHC 4 


SIGNAL TRANSDUCTION 
PROTEIN CBL; CHAIN: A; ZAP- 
70 PEPTIDE; CHAIN: B; 
UBIQUITIN-CONJUGATING 
ENZYME E12-1 8 KDA UBCH7; 
CHAIN: C; 


| SIGNAL TRANSDUCTION 


SEQFOL 
D score 




88.77 


















PMF 
score 






0.84 
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0.39 


0.41 
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PBB annotation 


UBIQUITIN,E3, 

PHOSPHORYLATION, 2 TYROSINE 
KINASE, UBIOmiTNATION, 
PROTEIN DEGRADATION, 


ZINC-BINDING PROTEIN ZINC- 
BINDING PROTEIN, XNF7, BBOX, 
DEVELOPMENT, 3 M1D-BLASTULA- 
TRANSITION 


METAL BINDING PROTEIN RING 
FINGER PROTEIN MATl ; RING 
FINGER (C3HCA-) 


CONTRACTILB PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN 


Dl i A- B INDXN C i PROTEIN V(D)J 
RECOMBINAT ION ACTIVATING 
PROTEIN 1; RAGl, V(D)J 
RECOMBINAT ION, ANTIBODY, 
MAD, RING FINGER, 2 ZINC 
BINUCLEAR CLUSTER, ZINC 
FINGER. DNA-BINDING PROTF.TN 


DNA-BINDINd PROTEIN V(D)J 
RECOMBINATION ACTIVATING 
PROTEIN 1; RAGl, V(D)J 
RECOMBINATION, ANTIBODY, 
MAD, RING FINGER, 2 ZINC 
BINUCLEAR CLUSTER, ZINC 
FINGER, DNA-BINDING PROTEIN 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COELED-COILS. 


Coumpound 


PROTEIN CBL; CHAIN: A; ZAP- 
70 PEPTIDE; CHAIN: B; 
UBIQUITIN-CONJUGATING 
ENZYME E12-18 KDA UBCH7; 
CHAIN: C; 


NUCLEAR FACTOR XNF7; 
CHAIN: NULL; 


CDK-ACTIVATING KINASE 
ASSEMBLY FACTOR MATl ; ] 
CHAIN: A; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


o 

5 

S 


RAGl; CHAIN: NULL; 


ALPHA SPECTRIN; CHAIN: A, 
B, C; 


SEQFOL 
D score 








54.69 








PMF 
score 




0.36 


0.74 
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0.01 


0.19 


Verify 
score 
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PD B annotation 






SIGNAL TRANSDUCTION SRC 
HOMOLOGY DOMAIN; SIGNAL 
TRANSDUCTION, SH3 DOMAIN, 
EPS8, PROLINE RICH PEPTIDE 


TRANSFERASE IL-2-INDUCIBLE T- 

CELL KINASE ; TRANSFERASE, 
REGULATORY INTRAMOLECULAR 
COMPLEX, KINASE 


KINASE KINASE, SH3 DOMAIN, 
TRANSFERASE, 

PHOSPHOTRANSFERASE, 2 PROTO- 

ON CO GENE, MULTIPLE DOMAIN, 
LEUKEMIA 


COMPLEX 

(TRANSFERASE/PEPTIDE) 
COMPLEX 

(TRANSFERASE/PEPTIDE), SIGNAL 
TRANSDUCTION. 2 SH3 DOMATTM 


TRANSFERASE TYRO SINE-PROTEIN 
KINASE, TRANSFERASE, SIGNAL 
TRANSDUCTION. 2 SH3 


SIGNALING 

PROTEIN/TRANSFERASE NAK; 
COMPLEX, SIGNAL 
TRANSDUCTION, 
PHOSPHOTYROSINE BINDING 2 
DOMAIN (PTB), ASYMMETRIC CELL 
DIVISION 


COMPLEX (SH3 DOMAIN/VIRAL 
ENHANCER) SRC-HOMOLOGY 3 


Coumpound 


(TFIIS, NUCLEIC- ACID ITFI 3 
BINDING DOMAIN) (NMR, 12 
STRUCTURES) ITFI 4 




EPS 8; CHAIN: A, B; 


ITK; CHAIN: NULL; 


ABL TYROSINE KINASE; 
CHAIN: NULL; 


ABL TYROSINE KINASE; 
CHAIN: A, C, E, G; PEPTIDE P4I : 
CHAIN: B, D, F, H; 


HEMOPOIETIC CELL KINASE; 
CHAIN: A, B, C, D, E, F; 


NUMB PROTEIN; CHAIN: A; 
NUMB ASSOCIATE KINASE; 
CHAIN: B; 


FYN TYROSINE KINASE; 
CHAIN: A, C; HIV-1 NEF 


SEQFOL 
D score 




















PMF 
score 






0.01 


0.82 


0.92 


0.84 


OOT 


0.63 


1.00 


- Verify 
score 
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PDB annotation 




COMPLEX 

(TRANSFERASE/PEPTIDE) SRC, SH3 
DOMAIN, LIGANDS, NON-PEPTBDE 
ELEMENTS, 2 COMPLEX 
(TRANSFERASE/PEPTIDES 


CIRCULAR PERMUTANT P WT; 
CIRCULAR PERMUTANT, SH3 
DOMAIN. CYTOSKELETON 


TYROSINE KINASE TYROSINE 
KINASE-INHIBITOR COMPLEX, 
DOWN-REGUL ATED KINASE, 2 
ORDERED ACTIVATION LOOP 


CYTOSKELETON CYTOSKELETON, 
MEMBRANE, SIH3 DOMAIN 


TYROSINE-PROTEIN KINASE 
BRUTONS TYROSINE KINASE, B 
CELL PROGENITOR KINASE, 
TRANSFERASE, TYROSINE- 
PROTEIN KINASE, 
PHOSPHORYLATION, 2 SH3 
DOMAIN 




CELL CYCLE/GENE REGULATION 
COMPLEX, SIGNAL 
TRANSDUCTION, 
PHOSPHOTYROSINE BINDING 2 
DOMAIN (PTB), ASYMETR IC CELL 
DIVISION, CELL CYCLE/GENE 3 
REGULATION 


UtANSFERASE HCK; SH3, PROTEIN 1 


Coumpound 


J1LCK 14 CHAIN: B; ILCK 15 


C-SRC; CHAIN: C; NLl (MN7- 
MN2 -MN 1 -PLPPLP); CHAIN: N; 


ALPHA SPECTRIN; CHAIN: 
NULL; 


HAEMATOPOETIC CELL 
KINASE (HCK); CHAIN: A; 


ALPHA II SPECTRIN; CHAIN: A; 


i TYROSINE-PROTEIN KINASE 
' BTK; CHAIN: A; 


PHOSPHOTRANSFERASE FYN 
PROTO-ONCOGENE TYROSINE 
KINASE (E.C.2.7. 1 . 1 1 2) 1 SHF 3 
(SH3 DOMAIN) ISHF 4 


NUMB PROTEIN; CHAIN: A; 
GPPY PEPTIDE; CHAIN: B; 


HEMATOPOIETIC CELL \ 
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PDB annotation 






COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTHIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTf 'IN-DNA 

INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


TRANSCRIPTION REGULATION 
TRANSCRIPTION REGULATION, 
ADRL ZINC FINGER, NMR 




COMPLEX (TRANSCRIPTION 
REGULATION/I )NA) GABP ALPHA; 
GABPBETAl; COMPLEX 
(TRANSCRIPTION 

REGULATTON/DNA), DNA-BINDING, 
2 NUCLEAR PROTEIN, ETS DOMAIN, 
ANKYRIN REPEATS, 


Conmpound 




DNA-BINDING PROTEIN 
HUMAN ENHANCER-BINDING 
PROTEIN MBP-1 MUTANT 
WITHCYS 11 1BB0 3 
REPLACED BY ABU (Ci lABU) 
(NMR, 60 STRUCTURES) IBBO 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


ess - 

Q ^ cj 

< N 3 
O W m 

. « co P • 

< g o 

2 O ^ 
Q O 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


ADRl; CHAIN: NULL; 




GA BINDING PROTEIN ALPHA; 
CHAIN: A; GA BINDING 
PROTEIN BETA 1; CHAIN: B; 
DNA; CHAIN: D, E; 
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D score 
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sill 

e o, u K 
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a t 
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2 <£S 

5 § S % £ 

) z n A m 

; ■ « r £ 


2 

O >h rj (J 

s ^ 2 5 

^ w v < 
R w 2 S 

:G c Sfcgfc 

w M 5 2 11 

o § 3 < £ 


ANK-REPEAT MYOTROPHIN, 
ACETYLATION, NMR. ANK-REPEAT 


SIGNALING PROTEIN RUBl, 
UBIQUITIN-LIKE PROTEIN, 
ARABIDOPSIS, SIGNALING 


DE NOVO PROTEIN PROTEIN 
DESIGN, HYDROPHOBIC CORE, 
PACKING, ROTAMERS, ROC, 2 
UBIQUITIN, DE NOVO PROTEIN, 
UBIOUlTlN 


SIGNALING PROTEIN NEDD8; 
NEDD8, NEDD-8, UBIQUITIN-LIKE, 
PROTEOLYSIS, SIGNALING 2 


j 


Coumpound 


6 INHIBITOR; CHAIN: A; 


CYCLIN-DEPENDENT KINASE 
4 INHIBITOR B: CHAIN: A: 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A, B; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A, B; 


MYOTROPHIN; CHAIN: NULL 


UBIQUITIN-LIKE PROTEIN 7, 
RUBl; CHAIN: A; 


1D8 UBIQUITIN; CHAIN: A; 


UBIQUITIN-LIKE PROTEIN 
NEDD8; CHAIN: A, B, C, D; 


UBIQUITIN TETRAUBIQUITIN 
1TBE3 
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D score 
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LOADED, CALCIUM-BINDING 
PROTEIN 


CALCIUM-BINDING CNTNC; 
CALCIUM-BINDING, REGULATION, 
TROPONIN C, C ARDIAC MUSCLE 2 
CONTRACTION 


HYDROLASE CALCINEURIN; 
HYDROLASE, PHOSPHATASE, 
IMMUNOSUPPRESSION 


HYDROLASE C/VLCINEURIN; 
HYDROLASE, PHOSPHATASE, 
IMMUNOSUPPftRRRTOTsJ 


CALCIUM-BINDING CALCIUM- 
BINDING, MYRISTOYLATION, 
NEURONAL SPECIFIC GUANYLATE 
2 CYCLASE ACTTVATfYft 


CALCIUM-BINDING CALCIUM- 
BINDING, MYRISTOYLATION, 
NEURONAL SPECIFIC GUANYLATE 
2 CYCLASE A (TV TV A TCYft 


CALCIUM-BINDING CALCIUM- 
BINDING, MYRISTOYLATION, 
NEURONAL SPECIFIC GUANYLATE 
2 CYCLASE ACrrVATOH 






Coumpound 




CARDIAC N-TROPONIN C; 
CHAIN: NULL; 


SERINE/THREONINE 
PHOSPHATASE 2B; CHAIN: A, 
B; 


SERINE/THREONINE 
PHOSPHATASE 2B; CHAIN: A, 
B; 


NEUROCALCIN DELTA; 
CHAIN: A, B; 


NEUROCALCIN DELTA; 
CHAIN: A, B; 


NEUROCALCIN DELTA; 
CHAIN: A, B; 


CALCIUM-BINDING PROTEIN 
CALMODULIN COMPLEXED 
WITH CALMODULIN-BINDING 
DOMAIN OF 1CDM3 
CALMODULIN-DEPENDENT 
PROTEIN KINASE II ICDM 4 


CALCIUM-BINDING PROTEIN 
CALMODULIN (VERTEBRATE) 
ICLL 3 

CALCIUM-BINDING PROTEIN 
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D score 
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CALCIUM-BINDING PROTEIN 
CALMODULIN APO TR2C-DOMAIN; 
ICMF 9 


CALCIUM-BINDING PROTEIN 
CALMODULIN APO TR2C-DOMAIN; 
ICMF 9 


BLOOD CLOTTING HELICAL, EF- 
HAND, BLOOD CLOTTING 


STRUCTURAL PROTEIN HELIX- 
TURN-HELDC 


STRUCTURAL PROTEIN HELIX- 
TURN-HELIX 


METAL TRANSPORT CALMODULIN, 
HIGH RESOLUTION, DISORDER 


Transport protein calcium 
binding, ef hand, four-helix 

BUNDLE 


TRANSPORT PROTEIN CALCIUM 
BINDING, EF HAND, FOUR-HELIX 
BUNDLE 


CONTRACTILE PROTEIN TROPONIN 
C-TROPONIN I INTERACTION, 
CARDIAC, MUSCLE PROTEIN, 2 
CALCIUM BINDING PROTEIN 


CONTRACTILE PROTEIN TROPONIN 
C-TROPONIN I INTERACTION, 
CARDIAC, MUSCLE PROTEIN, 2 
CALCIUM BINDING PROTEIN 


METAL BINDING PROTEIN YEAST 
FREQUENIN EF-HAND, CALCIUM 


LYASE GCAP-2; EF-HAND, 
CALCIUM-BINDING PROTEIN, 


Coumpound 


CALMODULIN (VERTEBRATE) 
1CLL 3 


: CALMODULIN (VERTEBRATE); 
ICMF 6 CHAIN: NULL; ICMF 7 


1 CALMODULIN (VERTEBRATE); 
ICMF 6 CHAIN: NULL; ICMF 7 


, APO CIB; CHAIN: A 


CARDIAC TROPONIN C; 
CHAIN: A; 


| CARDIAC TROPONIN C; 
| CHAIN: A; 


CALMODULIN; CHAIN: A; 


CALMODULIN; CHAIN: A; 


CALMODULIN; CHAIN: A; 


TROPONIN C; CHAIN: A; 


TROPONIN C; CHAIN: A; 


CALCIUM-BINDING PROTEIN 
NCS-1;CHAIN:A; 


GUANYLATE CYCLASE 
ACTIVATING PROTEIN 2; 


SEQFOL 
D score 
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UUANYLYL CYCLASE 2 

REOT IT . ATTON 


CALCIUM-REGULATED MUSCLE 
CONTRACTION MUSCLE 
CONTRACTION, CALCIUM- 
BINDING, TROPONIN, E-F HAND, 2 
OPEN CONFORMATION 
REGULATORY DOMAIN, CALCIUM- 
REGULATED 3 MUSCLE 


CALCIUM-REGULATED MUSCLE 
CONTRACTION MUSCLE 
CONTRACTION, CALCIUM- 
BINDING, TROPONIN, E-F HAND, 2 
OPEN CONFORMATION 
REGULATORY DOMAIN, CALCIUM- 
REGULATED 3 MUSCLE 


CALCIUM-BINDING PROTEIN EF- 

HAND 1TNY 14 


CALCIUM-BINDING PROTEIN EF- 

T-TANn 1TNY \A 










Coumpound 


CHAIN; A; 


TROPONIN C; CHAIN: NULL; 


1 TROPONIN C; CHAIN: NULL: 




TROPONIN C; ITNX 4 CHAIN: 
NULL; ITNX 5 


TROPONIN C; ITNX 4 CHAIN: 
NULL; ITNX 5 


CONTRACTILE SYSTEM 
PROTEIN TROPONIN C ITOP 3 


CONTRACTILE SYSTEM 
PROTEIN TROPONIN C ITOP 3 


CALCIUM BINDING PROTEIN 
CALMODULIN (/TR-2=C$ 
FRAGMENT COMPRISING 
RESIDUES 78 - 148 ITRC 3 OF 
THE INTACT MOLECULE) ITRC 
4 


CALCIUM BINDING PROTEIN 
CALMODULIN f/TR=2«C$ 


SEQFOL 
D score 
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CALMODULIN, CALCIUM BINDING, 
HELIX-LOOP-HELIX, SIGNALLING, 2 
COMPLEX(CALCIUM-BINDING 
PROTEIN/PEPTDDE) 


CALMODULIN, CALCIUM BINDING, 
HELIX-LOOP-HELIX, SIGNALLING, 2 
COMPLEX(CALCIUM-BINDING 
PROTEIN/PEPTIDE) 


CALMODULIN, CALCIUM BINDING, 
HELIX-LOOP-HELIX, SIGNALLING, 2 
COMPLEX(CALCIUM-BINDING 
PROTEIN/PEPTIDE) 


MUSCLE PROTEIN MYOSIN, 
CALCIUM BINDING PROTEIN, 
MUSCLE PROTEIN 


CALCIUM-BINDING PROTEIN CTNC; 
CARDIAC, MUSCLE, REGULATORY, 
CALCIUM-BINDING PROTEIN 


CALCIUM-BINDING PROTEIN CTNC; 
CARDIAC, MUSCLE, REGULATORY, 
CALCIUM-BINDING PROTEIN 




COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


Conmpound 


FRAGMENT COMPRISING 
RESIDUES 78 - 148 ITRC 3 OF 
THE INTACT MOLECULE) ITRC 


CALMODULIN; CHAIN: A; 
RS20; CHAIN: B; 


CALMODULIN; CHAIN: A; 
RS20; CHAIN: B; 

... 


CALMODULIN; CHAIN: A; 
RS20; CHAIN: B; 


SCALLOP MYOSIN; CHAIN: A, 
B,C; 


TROPONIN C; CHAIN: NULL; 


TROPONIN C; CHAIN: NULL; 




QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; | 


QGSR ZINC FINGER PEPTIDE; ' 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


SEQFOL 
D score 






56.76 | 
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PDB annotation 


ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 i 
CRYSTAL STRUCTURE, COMPLEX 


Coumpound 


OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN; B, C; 


DNA; CHAIN; A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D,E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D,E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C,F, G; 


SEQFOL 
D score 


















PMF 
score 




LOO 
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0.36 
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PDB annotation 


CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) TFIIIA; 5S 
GENE; NMR, TFIIIA, PROTEIN, DNA, 
TRANSCRIPTION FACTOR, 5S RNA 2 
GENE, DNA BINDING PROTEIN, 
ZINC FINGER, COMPLEX 3 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE m, 2 
TRANSCRIPTION INITIATION, ZINC 
FINGER PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, ZINC 
FINGER PROTEIN 


Coumpound 




DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


TRANSCRIPTION FACTOR MA; 
CHAIN: A; 5S RNA GENE; 
CHAIN: E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C,E, F; 


TFIIIA; CHAIN: A, D;5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


SEQFOL 
D score 












114.65 


PMF 
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Verify 
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PDB annotation 


REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
RECrT TT.ATTniM/nW A ^ 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGI J7 . ATT ON/rvW A 


j COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 11 COMPLEX 
(TRANSCRIPTION 
REfrT 77 .A TTOTvI/nMA ^ 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGI IL ATTOTvJ/TYftJA ^ 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION. 


Coumpound 


ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


i YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl ; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA: 


SEQFOL 
D score 












PMF 
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Verify 
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PDB annotation 




COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DN A-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTfEIN/DNA) 


COMPLEX (DN A-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 
COMPLEX (DNA-BINDING 


Coumpound 


1 DNA 2DRP 4 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; ; 


ZINC FINGER PROTEIN GLI 1 ; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


1 ZINC FINGER PROTEIN GLIl; [ 
1 CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl ; 
CHAIN: A; DNA; CHAIN: C, D; 

ZINC FINGER PROTEIN GLIl; 


SEQFOL 
D score 




100.58 
















PMF 
score 
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PD B annotation 


BINDING PROTEIN 
NK CELL NK CELL, RECEPTOR, C- 
TYPE LECTIN, C-TYPE LECTIN- 
LIKE. NKD 


NK CELL NK CELL, RECEPTOR, C- 
TYPE LECTIN, C-TYPE LECTIN- 
LIKE. NKD 


COLLAGEN BINDING PROTEIN IX- 
BP; IX-BP; COAGULATION FACTOR 
IX-BINDING, HETERODIMER, 
VENOM, HABU 2 SNAKE, C-TYPE 
LECTIN SUPERFAMILY, COLLAGEN 

BINDTNO PROTRTTtf 


i COLLAGEN BINDING PROTEIN IX- 
1 BP; IX-BP; COAGULATION FACTOR 
IX-BINDING, HETERODIMER, 
VENOM, HABU 2 SNAKE, C-TYPE 
LECTIN SUPERFAMILY, COLLAGEN 
BINDING PROTPTW 


MEMBRANE PROTEIN C-TYPE 
LECTIN-LIKE DOMAINS 


MEMBRANE PROTEIN SP-A; SP- 
A:PHOSPHOLHTD MOLOLAYER 
COMPT 


O P 

si 
ii 

go 


\ O P 

it 

li 


3 § § 3 « 

;&;ss 

u 3 ^ 


Coumpound 


CD94; CHAIN: NULL; 


CD94; CHAIN: NULL; 


COAGULATION FACTOR IX- 
BINDING PROTEIN A; CHAIN: 
A; COAGULATION FACTOR IX- 
BINDING PROTEIN B; CHAIN: 
B; 


1 COAGULATION FACTOR IX- 
BINDING PROTEIN A; CHAIN: 

1 A; COAGULATION FACTOR IX- 
BINDING PROTEIN B; CHAIN: 
B; 


FLAVOCETIN-A: ALPHA 
SUBUNIT; CHAIN: A; 
FLAVOCETIN-A: BETA 
SUBUNIT; CHAIN: B 


SURFACTANT PROTEIN A; 
CHAIN: A; 


ASIALOGLYCOPROTEIN 
RECEPTOR 1: CHAIN: A: 


ASIALOGLYCOPROTEIN 
RECEPTOR 1: CHAIN: A: 


EARLY ACTIVATION ANTIGEN 
CD69; CHAIN: A; 


SEQFOL 
D score 


73.97 




68.01 
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score 
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PDB annotation 
LECTTN-L1KE. 2 NKD, KLR 


SUGAR BINDING PROTEIN C-TYPE 
LECTIN, MANNOSE RECEPTOR 


< ; 
1 I 

< g ; 
3§ 

S3 


cq Q 

El 

0 u 


COAGULATION r AtJ I UK bunjuiwu 
IX/X-BP COAGULATION FACTOR 
BINDING, C-TYPE LECTIN, GLA- 
DOMAIN 2 BINDING, C-TYPE CRD 
MOTIF, LOOP EXCHANGED DIMER 


COAGULA HUN X A<J 1 UK tfUNUUNU 
IX/X-BP COAGULATION FACTOR 
BINDING, C-TYPE LECTIN, GLA- 
DOMAIN 2 BINDING, C-TYPE CRD 
MOTIF, LOOP EXCHANGED DIMER 


COAGULATION F ALU UK iJlINUiiNU 
IX/X-BP COAGULATION FACTOR 
BINDING, C-TYPE LECTIN, GLA- 
DOMAIN 2 BINDING, C-TYPE CRD 
MOTIF, LOOP EXCHANGED DIMER 


COAGULATION FACT UK BUXULWU 
DC/X-BP COAGULATION FACTOR 
BINDING, C-TYPE LECTIN, GLA- 
DOMAIN 2 BINDING, C-TYPE CRD 
MOTIF, LOOP EXCHANGED DIMER 


i 

D O 
- H 

lO GO 

D O 

< < w 
pu a, J 


3 2 

ii 3 

1 1 
ii 1 

*> CO g 

jo S 

oops 

c a- J 


Coumpound 


MACROPHAGE MANNOSE 
RECEPTOR; CHAIN: A, B; 


MANNOSE-BINDING PROTEIN; 
IHUP 4 CHAIN: NULL: IHUP 5 


MANNOSE-BINDING PROTEIN; 
IHUP 4 CHAIN: NULL; IHUP 5 


COAGULATION FACTORS 
IX/X-BINDING PROTEIN; 
CHAIN: A, B, C, D, E, F; 


COAGULATION FACTORS 
IX/X-BINDING PROTEIN; 
CHAIN: A, B, C, D,E,F; 


COAGULATION FACTORS 
IX/X-BINDING PROTEIN; 
CHAIN: A, B, C, D, E, F; 


COAGULATION FACTORS 
IX/X-BINDING PROTEIN; 
CHAIN: A, B, C, D, E, F; 


LITHOSTATHINE; CHAIN: 
NULL 


LITHOSTATHINE; CHAIN: 
NULL 

LITHOSTATHINE; CHAIN: A; 


SEQFOL 
D score 




63.43 




65.74 




61.53 






67.30 
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0.95 
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PDB annotation 


ANTIFREEZE PROTEIN 
RECOMBINANT SEA RAVEN 
PROTEIN, SOLUTION BACKBONE 
FOLD, C- 2 TYPE LECTIN, 
ANTIFREEZE PROTEIN 






HYDROLASE CALCINEURIN; 
HYDROLASE, PHOSPHATASE, 
IMMUNOSUPPRESSION 




( 
i 
( 


CALCIUM-BIND1NU i'KUIfcIN 
CALMODULIN APO TR2C-DOM AIN; 
ICMF 9 

METAL TRANSPORT CALMODULIN, 
HIGH RESOLUTION, DISORDER 


Coumpound 


SEA RAVEN TYPE II 
ANTIFREEZE PROTEIN; CHAIN: 
A; 


LECTIN MANNOSE-BINDING 
PROTEIN A (LECTIN DOMAIN) 
COMPLEX WITH 2MSB 3 
CALCIUM AND 
GLYCOPEPTIDE 2MSB 4 


LECTIN MANNOSE-BINDING 
PROTEIN A (LECTIN DOMAIN) 
COMPLEX WITH 2MSB 3 
CALCIUM AND 
GLYCOPEPTIDE 2MSB 4 


SERINE/THREONINE 
PHOSPHATASE 2B; CHAIN: A, 
B; 


CALCIUM-BINDING PROTEIN 
CALMODULIN COMPLEXED 
WITH CALMODULIN-BINDING 
DOMAIN OF ICDM 3 
CALMODULIN-DEPENDENT 
PROTEIN KINASE II ICDM 4 


CALCIUM-BINDING PROTEIN 
CALMODULIN (VERTEBRATE) 
1CLL3 


CALMODULIN (VERTEBRATE); 
ICMF 6 CHAIN: NULL; ICMF 7 

CALMODULIN; CHAIN: A; 


SEQFOL 
D score 




61.86 












PMF 
score 


0.41 




0.54 


0.87 


0.36 
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PDIJ annotation 


TRANSPORT P ROTEIN CALCIUM 
BINDING, EF HAND, FOUR-HELIX 


CALCIUM-REGULATED MUSCLE 
CONTRACTION MUSCLE 
CONTRACTION, CALCIUM- 
BINDING, TROPONIN, E-F HAND, 2 
OPEN CONFORMATION 
REGULATORY DOMAIN, CALCIUM- 
REGULATED 3 MUSCLE 


) 




CALMODULIN, CALCIUM BINDING, 
HELIX-LOOP-HELIX, SIGNALLING, 2 
COMPLEX(CAL CIUM-BINDING 
PROTEIN/PEPTJ DE} 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 

STMT? "MP PTRfYMTT^T TJOBB rvrcixT 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 


d 

,88 
IP 

III 

m 

m 

da 


Coumpound 


CALMODULIN; CHAIN: A; 


TROPONIN C; CHAIN: NULL; 


CONTRACTILE SYSTEM 
PROTEIN TROPONIN C ITOP 3 


CALCIUM BINDING PROTEIN 
CALMODULIN (ATR=2=C$ 
FRAGMENT COMPRISING 
RESIDUES 78 - 148 ITRC 3 OF 
THE INTACT MOLECULE) ITRC 
4 


CALMODULIN; CHAIN: A; 
RS20; CHAIN: B; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN; B, D; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 


INTERNALIN B; CHAIN: A; 
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§ 

u 
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5 
u 

s 

o § 

H CO 


CO , 

o 5 g 

5 K 5 

o 51 s *5 

Ills 

° u & 

6 w i 

Z co O 


CONTRACTILE PROTEIN TRIPLE- 
HELDC COILED COIL, 
CONTRACTILE PROTEIN 




HYDROLASE TARTRATE- 
RESISTANT ACID PHOSPHATASE; 
METAL PHOSPHATASE, 
HYDROLASR 


HYDROLASE TARTRATE- 
RESISTANT ACXD PHOSPHATASE; 
METAL PHOSPHATASE, 
HYDROLASE 1 


HYDROLASE UTEROFERRIN, TRAP, 
PAP, TARTRATE RESISTANT ACID 
PURPLE ACID PHOSPHATASE, 
TARTRATE RESISTANT ACID 2 
PHOSPHATASE, METALLOENZYME, 


u i C/ivvjrcivtvilN, n X UKUl-rAbli 

HYDROLASE (PHOSPHORIC 
MONOESTER) PURPLE ACID 
PHOSPHATASE, HYDROLASE 
(PHOSPHORIC MONOESTER} 


Coumpound 


DNAK; CHAIN: D; 


SYNTAXIN BINDING PROTEIN 
1; CHAIN: A; SYNTAXIN I A; 
CHAIN: B; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


LIGASE(SYNTHETASE) SERYL- 
TRNA SYNTHETASE 
(E.C.6.1.1.11) (SERINE-TRNA , 
LIGASE) ISES 3 COMPLEXED 
WITH SERYL-HYDROXAMATE- j 
AMP ISES 4 ! 


PURPLE ACID PHOSPHATASE; 
CHAIN: A; 


PURPLE ACID PHOSPHATASE; 
CHAIN: A; 


0 PURPLE ACID 
PHOSPHATASE; CHAIN: A; 


PURPLE ACID PHOSPHATASE; 
CHAIN: A, B, C, D; 


SEQFOL 
D score 
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RNA-BINDING PROTEIN/RNA TRA 
PRE-MRNA; SPLICING 
REGULATION, RNP DOMAIN, RNA 
COMPLEX 


£j 1 IvUU 1 uivr\J-» riw i * a ^ 

C23; RNP, RBD, RRM, RNA BINDING 
DOMAIN, NUCLEOLUS 


2 ,§{3 

grills 

2 ^ Ah B & 00 

- < o 5 P g 

iliyp 


< 

i 

- 

c 
c 

P 
t 


PROTEIN; RNA BINDING DOMAIN, 
NUCLEAR PROTEIN 


PROTEIN; RNA BINDING DOMAIN, 
NUCLEAR PROTEIN 


BINDING DOMAIN, RBD, RNA 
RECOGNITION MOTIF, RRM, 2 
SPLICING INHIBITOR, 
TRANSLATIONAL INHIBITOR, SEX 3 
DETERMINATION, X CHROMOSOME 


Coumpound 


g feC? 

1 

AT * -<! 

5 gtf 
g g° 

s 

W 6 t 
DO < 5 


NUCLEOLIN RBD2; CHAIN: A; 


MIGRATION INHIBITORY 
FACTOR-RELATED PROTEIN 8; 
CHAIN: A, B; 


RNA-BINDING PROTEIN SEX- 
LETHAL PROTEIN (C- ! 
TERMINUS, OR SECOND RNA- 
BINDING DOMAIN ISXL 3 
(RBD-2), RESIDUES 199 - 294 
PLUS N-TERMINAL MET) 1 SXL 
4 (NMR, 17 STRUCTURES) ISXL 
5 


Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN A; 
CHAIN: NULL; 


Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN A; 
CHAIN: NULL; 


SEX-LETHAL; CHAIN: A, B, C; 
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X S O CM 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


STRUiJ 1 UKALr rKU i ciin 1 w 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 
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HELIX COILED COIL, 
CONTRACTILE PROTEIN 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN HA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GI YCOPROTEIN, 
COMPLEX (BLOOD 
CO A GUL ATI ON/INHI R TTOR 1 


ANTI-COAGUIANT ANTI- 
COAGULANT, PEPTIDIC 
INHIBITORS, CONFORMATIONAL 2 
FLEXIBILITY, SERINE PROTEASE 
INHIBITOR 


1 HYDROLASE INHIBITOR ALL-BETA 
STRUCTURE, HYDROLASE 
INHIBITOR 


HYDROLASE INHIBITOR ALL-BETA 
STRUCTURE, HYDROLASE 
INHIBITOR 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COI'ACTOR/LIGAND) 


APOPTOSIS TRAIL, DR5, COMPLEX 


APOPTOSIS TRAIL, DR5, COMPLEX 


Coumpound 


ALPHA-ACTININ 2; CHAIN: A; 


ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO-MAI; 
CHAIN: P; 


HIRUSTASIN; CHAIN: NULL; 


BOWMAN-BIRK TRYPSIN 
INHIBITOR; CHAIN: A 


BOWMAN-BIRK TRYPSIN 
INHIBITOR; CHAIN: A 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN: C: 


DEATH RECEPTOR 5; CHAIN: 
A, B, C, G, H, I; TNF-RELATED 
APOPTOSIS INDUCING 
LIGAND: CHAIN: D. E. F. J. K. L: 


DEATH RECEPTOR 5; CHAIN: - 
A, B, C, G, H, I; TNF-RELATED 


SEQFOL 
D score 
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f- 

c 

c 
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< 
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< 
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5 

i : 
a ! 

1 

D 

SI 


SERINE rKUlbirs/\dJ& 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD 141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE DOMAINS, 
ANTICOAGULANT COMPLEX, 2 
ANTIFIBRINOLYTIC COMPLEX 


rLAN I r KU I HUN 1 w u 
HOMOLOGOUS HEVEIN-LIKE 
DOMAINS 


PLAN T PRO iblJNIWU 
HOMOLOGOUS HEVEIN-LIKE 
DOMAINS 
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X o 
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00 »J 


Coumpound 


APOPTOSIS INDUCING 
LIGAND; CHAIN: D, E, F, J, K, L; 


DEATH RECEPTOR 5; CHAIN: 
A, B, C> G, H, I; TNF-RELATED 
APOPTOSIS INDUCING 
LIGAND; CHAIN: D> E, F, J, K, L; 


TI-IROMBIN LIGHT CHAIN; 
CHAIN: A, B, C, D; THROMBIN 
HEAVY CHAIN; CHAIN: M, N, 
0, P; THROMBOMODULIN; 
CHAIN: I, J, K, L; THROMBIN 
INHIBITOR L-GLU-L-GLY-L- 
ARM; CHAIN: E, F, G, H; 


AGGLUTININ ISOLECTIN VI; 
CHAIN: A 


AGGLUTININ ISOLECTIN VI; 
CHAIN: A 


AGGLUTININ ISOLECTIN VI; 
CHAIN: A 


AGGLUTININ ISOLECTIN 
VI/AGGLUTININ ISOLECTIN V; 
CHAIN: A; 


AGGLUTININ ISOLECTIN 
VI/AGGLUTININ ISOLECTIN V; 
CHAIN: A; 


AGGLUTININ ISOLECTIN 
VI/AGGLUTININ ISOLECTIN V; 
CHAIN: A; 


AGGLUTININ'ISOLECTIN 
VI/AGGLUTININ ISOLECTIN V; 


SEQFOL 
D score 
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M-l 

O 
P 

1 

w 

s 

00 


SUGAR BINDING PROTEIN UDA; 
LECTIN, HEVEIN DOMAIN, UDA, 
SUPERANTIGEN, SACCHARIDE 
BINDING 


SUGAR BINDING PROTEIN UDA; 
LECTIN, HEVEIN DOMAIN, UDA, 
SUPERANTIGEN, SACCHARIDE 
BINDING 


SUGAR BINDING PROTEIN UDA; 
LECTIN, HE VEEN DOMAIN, UDA, 
SUPERANTIGEN, SACCHARIDE 
BINDING 


SUGAR BINDING PROTEIN UDA; 1 
LECTIN, HEVEIN DOMAIN, UDA, 
SUPERANTIGEN, SACCHARIDE 
BINDING 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, SIGNALLING 
PROTEIN 


SIGNALLING PROTEIN BINDING 1 
PROTEIN, CYTOKINE, SIGNALLING 
PROTEIN 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, SIGNALLING 
PROTEIN 


HORMONE RECEPTOR HORMONE 
RECEPTOR, INSULIN RECEPTOR 
FAMILY 


GLYCOPROTEIN GLYCOPROTEIN 
GLYCOPROTEIN GLYCOPROTEIN 


GLYCOPROTEIN GLYCOPROTEIN 
SIGNALLING PROTEIN TYPE I 
RECEPTOR, STMFRl; INCF 8 


Coumpound 


1 CHAIN: A; 


AGGLUTININ ISOLECTIN 
I/AGGLUTININ ISOLECTIN V/ 
CHAIN: A; 


AGGLUTININ ISOLECTIN 
I/AGGLUTININ ISOLECTIN V/ 
CHAIN: A; 


AGGLUTININ ISOLECTIN 
I/AGGLUTININ ISOLECTIN V/ , 
CHAIN: A; 


AGGLUTININ ISOLECTIN 
I/AGGLUTININ ISOLECTIN V/ 
CHAIN: A; 


TUMOR NECROSIS FACTOR 
RECEPTOR; CHAIN: A, B; 


TUMOR NECROSIS FACTOR 
RECEPTOR; CHAIN: A, B; 


TUMOR NECROSIS FACTOR 
RECEPTOR; CHAIN: A, B; 


INSULIN-LIKE GROWTH 
FACTOR RECEPTOR I; CHAIN: 
A; 




«i 


NULL: 
NULL; 


A w * 


LAMININ; CHAIN: 
LAMININ; CHAIN: 


LAMININ; CHAIN: 
TUMOR NECROSI 
RECEPTOR; INCF 
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> s 

J f 

1 < 

2 5 
^ t 


MUlNAi-LliNU rRU i l r & i 

RECEPTOR, STNFRl; INCF 8 
BINDING PROTEIN, CYTOKINE 
INCF 19 


OSTEONECTIN, SPARC, SECRETED 
PROTEIN ACIDIC AND 
EXTRACELLULAR MODULE, 
GLYCOPROTEIN, ANTI-ADHESIVE 
PROTEIN, 2 COLLAGEN BINDING, 
SITE-DIRECTED MUTAGENESIS, 
GLYCOSYLATED 3 PROTEIN 
1 MODRES 


CUMf LbA 

COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 
BINDING, HYDROLASE, 3 
GLYCOPROTEIN 


COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 
BINDING, HYDROLASE, 3 
GLYCOPROTEIN 


COMrLcA [pujuu 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 


Coumpound 


B; INCF 5 


TUMOR NECROSIS FACTOR 
RECEPTOR; INCF 4 CHAIN: A, 
B; INCF 5 


BASEMENT MEMBRANE 
PROTEIN BM-40; CHAIN: A, B; 


FACTOR IXA; CHAIN: C, L,; D- 
P HE-PRO- ARG; CHAIN: I; 


FACTOR IXA; CHAIN: C, L,; D- 
PHE-PRO-ARG; CHAIN: I; 


FACTOR IXA; CHAIN: C, L,; D- 
PHE-PRO-ARG; CHAIN: I; 
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D score 
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PDB annotation 
INHIBITOR. THROMBOSIS 






ANTI-COAGULANT ANTI- 
COAGULANT, PEPTIDIC 
INHIBITORS, CONFORMATIONAL 2 
FLEXIBILITY, SERINE PROTEASE 
INHIBITOR 


< 
q 

-4 

D 

J 

2 

Si 
—* 

O 
H 

B 
5! 








t 
\ 


BLOOD COAGULATION f AvJlUK 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH FACTOR LIKE DOMAIN 


Coumpound 


LECTIN (AGGLUTININ) WHEAT 
GERM AGGLUTININ 
nSOLECTIN2) 9WGA3 


LECTIN (AGGLUTININ) WHEAT 
GERM AGGLUTININ 
(ISOLECTIN2)9WGA3 


HIRUSTASIN; CHAIN: NULL; 


DEATH RECEPTOR 5; CHAIN: 
A, B, C, G, H, I; TNF-RELATED 
APOPTOSIS INDUCING 
LIGAND; CHAIN: D, E, F, J, K, L; 


METALLOTHIONEIN CD-7 
METALLOTHIONEIN-2 (ALPHA 
DOMAIN) (/NMR$) IMHUA 2 


METALLOTHIONEIN CD-7 
METALLOTHIONEIN-2 (ALPHA 
DOMAIN) (/NMR$) IMHUA 2 


METALLOTHIONEIN CD-7 
METALLOTHIONEIN-2 (ALPHA 
DOMAIN) (/NMR$) IMRTA 2 


METALLOTHIONEIN CD-7 
METALLOTHIONEIN-2 (ALPHA 
DOMAIN) (YNMRS) IMRTA 2 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 


SEQFOL 
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PI>B annotation 






CALCIUM-BINDING PROTEIN CALB; 
CALCIUM++/PHOSPHOLIPID 
BINDING PROTEIN, 2 CALCIUM- 


m 

i i 

Bg3 

o 3 < 
£ ou 

4h cn 

ill 

a u m a 


[TRANSCRIPTION REGULATION 
PROTO-ONCOGENE, NUCLEAR 
BODIES (PODS), LEUKEMIA, 2 


- 00 

5 s ^ 

3 B § if 

hp N 

tel 1? 
BBSS! 

' g o 5 jj 

! W w W § fx 
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2 S 3 3 E 

If E 

4 W O PJ r £2 t 

3 S3 < £5 % * 

; F 8 h P > 
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i W co w § S 
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0 CJ 
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< ^ •> 

SRE3 
) ogc 


HYDROLASE CPLA2; 
PHOSPHOLIPASE. LIPID-BIWDTNTtt 


Coumpound 


METALLOTHIONEIN 
METALLOTHIONEIN ISOFORM 
II 4MT2 3 


METALLOTHIONEIN 
METALLOTHIONEIN ISOFORM 
II 4MT2 3 


PROTEIN KINASE C (BETA); 
CHAIN: A, B; 


PROTEIN KINASE C (BETA); 
CHAIN: A, B; 


! TRANSCRIPTION FACTOR 
PML; CHAIN: NULL; 


S YN APTOTAGMIN I; CHAIN: A; 


SYNAPTOTAGMIN I; CHAIN: A; 


CYTOSOLIC PHOSPHOLIPASE 
A2; CHAIN: A, B; 


CYTOSOLIC PHOSPHOLIPASE 
A2; CHAIN: A, B: 


SEQFOJL 
D score 




127.57 
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PDJB annotation 


PHOSPHOLIPID BINDING PROTEIN, 
CALCIUM-BINDING 2 PROTEIN, 
PHOSPHATIDYLSERINE, PROTEIN 


METAL BINDING PROTEIN RING 
FINGER PROTEIN MATl ; RING 


HYDROLASE CALB DOMAIN; 
HYDROLASE, C2 DOMAIN, CALB 
DOM AIM 


HYDROLASE CALB DOMAIN; 
HYDROLASE, C2 DOMAIN, CALB 

DOM ATM 


O . t 

£ \ 

i h v S 9 § S a 
! 1 1 1 1 |i 1 


g 
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TRANSPORT PB OTEIN FYVE 
DOMAIN, ENDOSOME ! 


Coumpound 


TYPE; CHAIN: A; 


CDK-ACTIVATING KINASE 
ASSEMBLY FACTOR MATl; 
CHAIN: A: 


PHOSPHOLIPASE A2; CHAIN: 
NULL; 


PHOSPHOLIPASE A2; CHAIN: 
NULL; 


RAGl; CHAIN: NULL; 


CALCIUM/PHOSPHOLIPID 
BINDING PROTEIN 
SYNAPTOTAGMIN I (FIRST C2 
DOMAIN) (CALB) IRSY 3 


CALCIUM/PHOSPHOLIPID i 
BINDING PROTEIN 
SYNAPTOTAGMIN I (FIRST C2 
DOMAIN) (CALB) IRSY 3 


CALCIUM/PHOSPHOLIPID 
BINDING PROTEIN 
SYNAPTOTAGMIN I (FIRST C2 
DOMAIN) (CALB) IRSY 3 


PHOSPHATIDYLINOSITOL-3- 
PHOSPHATE BINDING FYVE 


SEQFOL 
D score 
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1 PDB annotation 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMB IN IIA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN IIA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BL OOD 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/ COFACTOR/LIGAND) 


tfLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


HYDROLASE/HYDROLASE 
INHIBITOR PROTEIN-PEPTIDE 
COMPLEX 


Coumpound 


ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO-MAI; 
CHAIN: P; 


ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO-MAI; 
CHAIN: P; 


BLOOD COAGULATION 
FACTOR VJIA; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN: C: 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN: Ci 


DES-GLA FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H, I; 
DES-GLA FACTOR VIIA (LIGHT 
CHAIN); CHAIN: L, M; (DPN^ 
PHE-ARG; CHAIN: C, D; 
PEPTIDE E-76; CHAIN: X. Y: 


SEQFOL 
D score 


58.73 






60.57 




PMF 
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Verify 
score 
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PI>B annotation 


INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 
BINDING, HYDROLASE, 3 

glycoprotein 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTliASE, CALCIUM- 
BINDING, HYDROLASE, 3 
GLYCOPROTFTN 


SERINE PROTliASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 
PROTEASE 


SERINE PROTEASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 
PROTEASE 


SERINE PROTE ASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 
PROTEASE 


SERINE PROTEASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 
PROTEASE 


Coumpound 




FACTOR IXA; CHAIN: C, L,; D- 
PHE-PRO-ARG; CHAIN: I; 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VILA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 


SEQFOL 
D score 










55.94 
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score 
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PDB annotation 


W 

CO 

eg 

CO 2 

si 

23 < 

3 Si 

Sgg 

^ 2 ca 


RNA-BINDING DOMAIN RNA- 
BINDING DOMAIN, ALTERNATIVE 
SPLICING 


RNA BINDING DOMAIN RNA 
BINDING DOMAIN, RBD, RNA 
RECOGNITION MOTIF, RRM, 2 
SPLICING INHICBITOR, 
TRANSNATIONAL INHIBITOR, SEX 3 
DETERMINATION, X CHROMOSOME 
DOSAGE COMPENSATION 


GENE REGULATION POZ DOMAIN; 
PROTEIN-PROTEIN INTERACTION 
DOMAIN, TRANSCRIPTIONAL 2 
REPRESSOR, ZCNC-FINGER 
PROTEIN, X-RAY 
CRYSTALLOGRAPHY, 3 PROTEIN 
STRUCTURE, PROMYELOCYTIC 
LEUKRMTA. OT^MR RPfil TT atthm 


5 
■t 

! 
> 

i 






BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX. CO-FACTOR 






CHAIN: 


SEX-LETHAL; CHAIN: A, B, C; 



PROMYELOCYTIC LEUKEMIA 
ZINC FINGER PROTEIN PLZF; 
CHAIN: A; 


OXIDOREDUCTASE(OXYGEN( 
A)) GALACTOSE OXIDASE 
(E.C.I. 1.3.9) (PH 4.5) IGOF 3 


OXIDOREDUCTASE(OXYGEN( 
A)) GALACTOSE OXIDASE 
(E.C.I.I.3.9) (PH4.5) IGOF 3 


OXIDOREDUCTASE(OXYGEN( 

A)) GALACTOSE OXIDASE 

(E.C. 1.1.3.9) (PH4.5) IGOF 3 J 




Coumpound 


MUSASHI1; CHAIN: A; 


SEX-LETHAL PROTEIN; 
NULL; 


BLOOD COAGULATION 
FACTOR VnA; CHAIN: L, 


• 
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PDB annotation 


2 RECEPTOR bNZYMb, lNrtimiUK, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


BLOOD UJAUULAUUIN, 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


BLOOD COAGULAUUIN, SfclOiNi} 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


S 

< pa 

il 

il ! 
* i o 


HYDROLAdii/ri Y DKULAoii 

INHIBITOR PROTEIN-PEPTIDE 
COMPLEX 


HYDROLASE/HYDKULAaJi 
INHIBITOR PROTEIN-PEPTIDE 
COMPLEX 


Coumpound 


SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN: C; 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN: C; 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN: C; 


DES-GLA FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H, I; 
DES-GLA FACTOR VIIA (LIGHT 
CHAIN); CHAIN: L, M; (DPN)- 
PHE-ARG; CHAIN: C,D; 
PEPTIDE E-76; CHAIN: X, Y; 


DES-GLA FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H, I; 
DES-GLA FACTOR VIIA (LIGHT 
CHAIN); CHAIN: L, M; (DPN)- 
PHE-ARG; CHAIN: C, D; 
PEPTIDE E-76; CHAIN: X, Y; 


DES-GLA FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H, 1; 
DES-GLA FACTOR VIIA (LIGHT 
CHAIN); CHAIN: L, M; (DFN> 
PHE-ARG; CHAIN: C, D; 
PEPTIDE E-76; CHAIN: X, Y; 


SEQFOL 
D score 
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PDB annotation 


MATRIX PRO! BIN 
EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNALi MULTIGENE FAMILY, 
DISEASE MUTATION, 3 EGF-LIKE 
DOMAIN, HUMAN FIBRILLIN- 1 
FRAGMENT. MATRIX PROTEIN 


MATRIX PROTEIN 
EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNAL, MULTIGENE FAMILY, 
DISEASE MUTATION, 3 EGF-LIKE 
DOMAIN, HUMAN FIBRILLIN- 1 
FRAGMENT. MATRIX PROTRTN 


MATRIX PROTEIN 
EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNAL, MULTIGENE FAMILY, 
DISEASE MUTATION, 3 EGF-LIKE 
DOMAIN, HUMAN FIBRILLIN- 1 
FRAGMENT, MATRIX PROTEIN 


LIPID BINDING PROTEIN LDL 
RECEPTOR; BETA HAIRPIN, 3-10 
HELIX. CALCIUM BINDING 


LIPID BINDING PROTEIN LDL 
RECEPTOR; BETA HAIRPIN, 3-10 
HELIX. CALCIUM BINDING 


BLOOD CLOTTING 
COMPLEX(SERINE 
PROTEASE/COFACTOR/LIGAND), 
BLOOD COAGULATION, 2 SERINE 


Coumpound 


FIBRILLIN; CHAIN: NULL; 


FIBRILLIN; CHAIN: NULL; 


FIBRILLIN; CHAIN: NULL; 


LOW-DENSITY LIPOPROTEIN 
RECEPTOR; CHAIN: A; 


LOW-DENSITY LIPOPROTEIN 
RECEPTOR; CHAIN: A; 


BLOOD COAGULATION 
FACTOR VHA; CHAIN: L; 
BLOOD COAGULATION 
FACTOR VIIA; CHAIN: H; 


SEQFOL 
D score 
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PROTEASE, COMPLEX, CO-FACTUK, 
RECEPTOR ENZYME, 3 INHIBITOR, 
GLA, EGF, COMPLEX (SERINE 4 
PROTEASE/COFACTOR/LIGAND), 
BLOOD CLOTTING 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 
BINDING, HYDROLASE, 3 
GLYCOPROTEIN 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 PLASMA, 
SERINE PROTEASE, CALCIUM- 
BINDING, HYDROLASE, 3 
GLYCOPROTEIN 


SERINE PROTEASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 
PROTEASE 


SERINE PROTEASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 
PROTEASE 


1 SERINE PROTEASE FVIIA; FVIIA; | 


Conmpound 


SOLUBLE TISSUE FACTOR; 
CHAIN: T;5L15; CHAIN: I; 


FACTOR IXA; CHAIN: C, L,; D- 
PHE-PRO-ARG; CHAIN: I; 


FACTOR IXA; CHAIN: C, L,; D- 
PHE-PRO-ARG; CHAIN: I; 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


COAGULATION FACTOR VIIA 
(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


| COAGULATION FACTOR VIIA 


SEQFOL 
D score 
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Verify 
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PD1S annotation 


BLOOD COAGULATION, SERINE 
PROTEASE 




BLOOD COAGULATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH FACTOR LTKK DOMAHsJ 


BLOOD COAGULATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH FACTOR LIKE DOM A TAJ 






HYDROLASE HYDROLASE, NAD 
BINDING PROTRTN 


OXIDOREDUCTASE 
OXIDOREDUCTASE, TROPANE 
ALKALOID BIOSYNTHESIS, 
REDUCTION OF 2 TROPINONE TO 
TROPINE, SHORT-CHAIN 
DEHYDROGENASE 


Coumpound 


(LIGHT CHAIN); CHAIN: L; 
COAGULATION FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H; 
TRIPEPTIDYL INHIBITOR; 
CHAIN: C; 


MEMBRANE PROTEIN 
VITELLINE MEMBRANE 
OUTER LAYER PROTEIN I 
1VM0 3 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 


LECTIN (AGGLUTININ) WHEAT 
GERM AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


LECTIN (AGGLUTININ) WHEAT 
GERM AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


S-ADENOSYLHOMOCYSTEINE 
HYDROLASE: CHAIN: A, B: 


TROPINONE REDUCTASE-I; 
CHAIN: A, B; 


SEQFOL 
D score 
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PDB annotation 


ALCOHOL DJiHYDROGENASE IDEH 
26 


OXIDOREDUCTASE BETAl ADH; 

IDEH 9 NAD-I- DEPENDENT 

ALCOHOL DEHYDROGENASE IDEH 
26 


W Fx 

00 </ 

< < 

O C 

2 c 

§£ 

Q C 
■J p- 

o c 
a: a 
o c 
o a 

< < 


OXIDOREDUCTASE 
DEHYDROGENASE, ALCOHOL, 
NICOTINAMIDE COENZYME, 
STEROID 7 RiwnrMr; 


1 


OXIDOREDUCTASE 
OXIDOREDUCTASE, ZINC, NADP 


OXIDOREDUCTASE 
OXIDOREDUCTASE, ZINC, NADP 


OXIDOREDUCTASE 
OXIDOREDUCTASE 






Coumpound 


CHAIN: A, B; IDEH 8 


HUMAN BETAl ALCOHOL 
DEHYDROGENASE; IDEH 7 
CHAIN: A, B; IDEH 8 


ALCOHOL DEHYDROGENASE, 
CLASS II: CHAIN: A, B: 


ALCOHOL DEHYDROGENASE; 
CHAIN: A, B; 


OXIDOREDUCTASE 3-ALPHA, 
20-BETA-HYDROXYSTEROID 
DEHYDROGENASE 
(E.C. 1.1.1.53) 1HDC3 
COMPLEXED WITH 
CARBENOXOLONE IHDC 4 


w < 

93 

2 w 
w a 

Q Bo" 
z a a 


NADP-DEPENDENT ALCOHOL 
DEHYDROGENASE; CHAIN: A, 
B, C, D; 


FLAVOCYTOCHROME C3 
FUMARATE REDUCTASE; 
CHAIN: A, D: 


OXIDOREDUCTASE QUINONE 
OXIDOREDUCTASE 
COMPLEXED WITH NADPH 
1QOR3 


OXIDOREDUCTASE QUINONE 
OXIDOREDUCTASE 
COMPLEXED WITH NADPH 


SEQFOL 
D score 
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PDB annotation 


TYROSINE KINASE, SIGNAL 
TRANSDUCTION, 
PHOSPHOTRANSFERASE, 2 
COMPLEX (KINASE/PEPTIDE 
SUBSTRATE/ATP ANALOG), 
ENZYME, 3 COMPLEX 
(TRANSFERAS E/SUBSTRATE^ 


TYROSINE KINASE TYROSINE 
KINASE-INHIBITOR COMPLEX, 
DOWN-REGULATED KINASE, 2 
ORDERED ACTIVATION LOOP 


TRANSFERASE KDR; TYROSINE 
KfNASE 




CONTRACTILE PROTEIN 
TROPOMYOSIN COILED-COIL 
ALPHA-HELIOVL, CONTRACTILE 
PROTEIN 


ISOMERASE ISOMERASE, MUTASE, 
INTRAMOLECULAR TRANSFERASE 


COMPLEX 

(TRANSDUCER ^TRANSDUCTION) 
GT BETA-GAMMA; MEKA, PP33; 
PHOSDUCIN, TRANSDUCIN, BETA- 
GAMMA, SIGNAL TRANSDUCTION, 
2 REGULATION, 

PHOSPHORYLATION, G PROTEINS, 
THIOREDOXIN, 3 VISION, MEICA, 
COMPLEX 

(TRANSDUCER/TRANSDUCTIONi 




COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 


Coumpound 




I-IAEMATOPOETIC CELL 
KINASE (HCK); CHAIN: A; 


VASCULAR ENDOTHELIAL 
GROWTH FACTOR RECEPTOR 
CHAIN: A; 




TROPOMYOSIN; CHAIN: A, B, 
C,D 


METHYLMALONYL-COA 
MUTASE; CHAIN: A, B, C, D; 


TRANSDUCIN; CHAIN: B, G; 
PHOSDUCIN; CHAIN: P; 




QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 


SEQFOL 
D score 
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PDB annotation 


ZINC FINGER, DNA-BlNUlINu 
PROTEIN 


ll 1 

Ill 

lS!i 

D UN E 


COMPLEX (ZINC FINUhK/UMAJ 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


if* « 
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tig 
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—4 K _r 

11 i 

O O 3 0! 


C 


COMPLEX (ZINC rlNtf bK/uiNAj £Unu 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 


Coumpound 


OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B a C; 


TRANSCRIPTION REGULATION 
YEAST TRANSCRIPTION 
FACTOR ADRl (RESIDUES 102 - 
130) 1ARD 3 (AMINO 
TERMINAL ZINC FINGER 
DOMAIN) (NMR, 10 
STRUCTURES) 1ARD 4 (ADRIB) 
1ARD5 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


SEQFOL 
D score 
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PDH annotation 


(ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
fZINC FTNGRR /nM A ^ 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION* PROTEIN DESIGN, 2 
CRYSTAL STRVJCTURE, COMPLEX 
rZINC FTNTGRR / n>"M A ^ 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTIilN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
<ZINC FTNGRR /HM A *l 


j COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FTNGRR /IDN A 1 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTI vIN-DN A 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
CZINC FTNGRR /I TNT A ^ 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN. 2 


Coumpound 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZrNC FINGER 
PROTEIN; CHAIN: C, F, G; 


j DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G: 


SEQFOL 
0 score 
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PDB annotation 


POLYMERASE III, 2 
TRANSCRIPTION INITIATION, ZINC 
FINGER PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, ZINC 
FINGER PROTFTNT 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE HI, 2 
TRANSCRIPTION INITIATION, ZINC 
FINGER PROTR nsr 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYI, ZINC 2 
FINGER PROTECN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYI, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 
COMPLEX (TRANSCRIPTION 


Coumpound 




TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


YYI; CHAIN: C; ADENO- 
ASSOC1ATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYI; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; j 


YYI; CHAIN: C; ADENO- 


SEQFOL 
D score 
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PDB annotation 


REG UL ATION/DN AJ YINCj-YANU i; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (1 KAN bvJKJr 1 1UJN 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
; REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


t 


COMPLEX (DNA-BINLKNti 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 
COMPLEX (DNA-BINDING 


Coumpound 


ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


COMPLEX(TRANSCRIPTION 
REGULATION/DNA) 
TRAMTRACK PROTEIN (TWO 
ZINC-FINGER PEPTIDE) 
COMPLEXED WITH 2DRP 3 
DNA 2DRP 4 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 

ZINC FINGER PROTEIN GLIl; 


SEQFOL 
D score 










76.50 
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o & 

sSs 

* o ? 

(x 3 g 
c 

A o CO 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA^ 




STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


o 3 g 

H CU t 
Sa f-H rr 

O of 

^ Oh J. 

£9 {3 p 

W P 5 

fc # X 
S w s 

2 vV3 O 
W 2 U 


ENDOCYTOSIS/EXOCYTOSIS 
NSECl; PROTEIN-PROTEIN 
COMPLEX. MULTI-SI JRIJNTT 


ENDOCYTOSIS/EXOCYTOSIS 
NSECl; PROTEIN-PROTEIN 
COMPLEX. MULTI-SURtllsITT 


ENDOCYTOSJS/EXOCYTOSIS 
SYNAPTOTAGMIN ASSOCIATED 35 
KDA PROTEIN, P35A, THREE HELIX 
BUNDLE 


ENDOCYTOSIS/EXOCYTOSIS 
SYNAPTOTAGMIN ASSOCIATED 35 
KDA PROTEIN, P35A, THREE HELIX 


Coumpound 


CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GUI; 
CHAIN: A; DNA; CHAIN: C, D; 




ALPHA SPECTRIN; CHAIN: A, 
B, C; 


ALPHA SPECTRIN; CHAIN: A, 
B,C; 


I SYNTAXIN BINDING PROTEIN 
1; CHAIN: A; SYNTAXIN 1A; 
CHAIN: B; 


SYNTAXIN BINDING PROTEIN 
1; CHAIN: A; SYNTAXIN 1A; 
CHAIN: B; 


SYNTAXIN BINDING PROTEIN 
1; CHAIN: A; SYNTAXIN 1A; 
CHAIN: B; 


SYNTAXIN- 1 A; CHAIN: A, B, C; 


SYNTAXIN- 1 A; CHAIN: A, B, C; 


SEQFOL 
D score 
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PDB annotation 


ENDOCYTOSIS/EXOCYTOSIS 
SYNAPTOTAGMIN ASSOCIATED 35 
KDA PROTEIN, P35A, THREE HELIX 
BUNDLE 


ENDOCYTOSIS/EXOCYTOSIS 
SYNAPTOTAGMIN ASSOCIATED 35 
KDA PROTEIN, P35A, THREE HELIX 
BUNDLE ! 


ENDOCYTOSIS/EXUCJ Y 1 
SYNAPTOTAGMIN ASSOCIATED 35 
KDA PROTEIN, P35A, THREE HELIX 
BUNDLE 


SIGNALING PROTEIN CJBF, Ull^ 
HYDROLYSIS, GDP, GMP, 
INTERFERON INDUCED, DYNAMIN 
2 RELATED, LARGE GTPASE 
FAMILY. GMPPNP, GPPNHP. 


3 

! ! 

is 

it 

* s 
5 52 

§g 


CONTRACTILE PROTEIN 1K1PLL- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN 


is 

z 1 

< 

S3 
Si 

O H 

a 2 


TRANSCRIPTION KEUULAIiUN 
SIGMA70; RNA POLYMERASE 
SIGMA FACTOR, TRANSCRIPTION 
REGULATION 


TRANSCRIPTION RbUULAlIUJN 
SIGMA70; RNA POLYMERASE 
SIGMA FACTOR, TRANSCRIPTION 
REGULATION 


| COMPLEX 1 


Coumpound 


SYNTAXIN-1 A; CHAIN: A, B, C; 


SYNTAXIN-1 A; CHAIN: A, B, C; 


SYNTAXIN-1 A; CHAIN: A, B, C; 


INTERFERON-1NDUCED 
GUANYLATE-BINDING 
PROTEIN 1 ; CHAIN: A; 


SSOl PROTEIN; CHAIN: A; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


p 

CJ PQ 

P 
j-t if 

£ < 

U3 P 


RNA POLYMERASE PRIMARY 
SIGMA FACTOR; CHAIN: NULL; 


RNA POLYMERASE PRIMARY 
SIGMA FACTOR; CHAIN: NULL; 


| TRANSDUCIN; CHAIN: B, G; 
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D score 
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PD B annotation 


(TRANSDUCER/TRANSDUCHON) 
GT BETA-GAMMA; MEKA, PP33; 
PHOSDUCIN, TRANSDUCIN, BETA- 
GAMMA, SIGNAL TRANSDUCTION, 
2 REGULATION, 

PHOSPHORYLATION, G PROTEINS, 
THIOREDOXIN, 3 VISION, MEKA, 
COMPLEX 

( TR ANSDI ICRT I /Ttt ATMSTYT TPTi n\n 


COMPLEX 

(TR ANSDUCER/TRANSDUCTI ON) 
GT BETA-GAMMA; MEKA, PP33; 
PHOSDUCIN, TRANSDUCIN, BETA- 
GAMMA, SIGNAL TRANSDUCTION, 
2 REGULATION, 

PHOSPHORYLATION, G PROTEINS, 
THIOREDOXIN, 3 VISION, MEKA, 
COMPLEX 

(TRANSDUCER /TR AMSm IPTTHM^ 


COMPLEX 

(TRANSDUCER /TRANSDUCTION) 
GT BETA-GAMMA; MEKA, PP33; 
PHOSDUCIN, TRANSDUCIN, BETA- 
GAMMA, SIGN/VL TRANSDUCTION, 
2 REGULATION, 

PHOSPHORYLATION, G PROTEINS, ! 
THIOREDOXIN, 3 VISION, MEKA, 
COMPLEX 

(TRANSDUCERn-RANSDUCTION) 


TRANSCRIPTION REGULATION 
PROTO-ONCOGENE, NUCLEAR 
BODIES (PODS), LEUKEMIA, 2 
TRANSCRIPTION REGULATION 


Coumponnd 


PHOSDUCIN; CHAIN: P; 


TRANSDUCIN; CHAIN: B, G; 
PHOSDUCIN; CHAIN: P; 


TRANSDUCIN; CHAIN: B, G; 
PHOSDUCIN; CHAIN: P; 


TRANSCRIPTION FACTOR 
PML; CHAIN: NULL; 


SEQFOL 
D score 
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§ .82 s 

§ [d ^ 

3 CQ H 2 i 


ACTIN B1JNJJJJNU rKUlDUN nDr-i^u, 

ACTIN BINDING PROTEIN, 
STRUCTURE, IMMUNOGLOBULIN, 
GELATION 2 FACTOR, ABP- 120 


1" 

S ► * 

£ ^° * 

a Q 9 2 ! 
5 5 5 2 ; 
3 3 h 2 

5 3 p < 
r 1 H — 1 


^ 3 2 
o 5 

h W W D M ( 

7 h h ro i ^ 
5 o o o & 
3 a4 pd. ^ 3 

5 PL, pu o < J 

o 5 5 § < 

< CQ fO ft 


AC 1 IN JdIJNLHINvJ rlvwiniiN /yv^iun 

BINDING PROTEIN 120; ACTIN 
BINDING PROTEIN, 
IMMUNOGLOBULIN, GELATION 
FACTOR, ABP- 2 120 


A 2 I 

5 0 0 0 & 

3 p4 p4 »J 3 
2; Ph 0 < 

< cq pq rS 


ACTIN mNLUNvj rtuu 1 hiin /yv^ a hn 
BINDING PROTEIN 120; ACTIN 
BINDING PROTEIN, 
IMMUNOGLOBULIN, GELATION 
FACTOR, ABP- 2 120 

TRANSFERASE 


Coumpound 


GELATION FACTOR; CHAIN: 




GELATION FACTOR; CHAIN: 
NULL; 


GELATION FACTOR; CHAIN; 
NULL; 


GELATION FACTOR; CHAIN: A, 
B; 


GELATION FACTOR; CHAIN: A, 
B; 


GELATION FACTOR; CHAIN: A, 
B; 


GELATION FACTOR; CHAIN: A, 
B; 

GLYCINE N« 


SEQFOL 
D score 
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PDB annotation 


METHYLTRANSFERASE 


STRUCTURAL. GENOMICS 
HYPOTHETIC^ PROTEIN, 


TRANSFERASE SAM-BINDING 
DOMAIN, BETA-BARREL, MIXED 
ALPHA-BETA, HEXAMER, 2 DIMER 


u< 

> h 

o< 

o a 
o o 
o u 

9?< 




COMPLEX (TRANSCRIPTION 
REGULATION/DNA) GABPALPHA; 
GABPBETAl; COMPLEX 
(TRANSCRIPTION 

REGULATION/DNA), DNA-BINDING, 
2 NUCLEAR PROTEIN, ETS DOMAIN 
ANKYRIN REPEATS, 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) GABPALPHA; 
GABPBETAl; COMPLEX 


Coumpound 


W 

3 

■a - 

Eli Q 

K CQ 
►J < 

a 5 
- < 


MJ0882; CHAIN: A; 


HNRNP ARGININE N- 
METHYLTRANSFERASE; 
CHAIN: 1,2, 3, 4, 5,6; 


TUMOR SUPPRESSOR 
PI6INK4A; CHAIN: NULL; 


TRANSFERASE(PHOSPHOTRA 
NSFERASE) $C-/AMP$- 
DEPENDENT PROTEIN KINASE 
(E.C.2.7.1 .37) (SC/APKS) 1 APM 3 
(CATALYTIC SUBUNIT) ALPHA 
ISOENZYME MUTANT WITH 
SER 139 1 APM 4 REPLACED BY 
ALA (/S139A$) COMPLEX WITH 
THE PEPTIDE 1APM 5 
INHIBITOR PKI(5-24) AND THE 
DETERGENT MEGA-8 1 APM 6 


GA BINDING PROTEIN ALPHA; 
CHAIN: A; GA BINDING 
PROTEIN BETA 1; CHAIN: B; 
DNA; CHAIN: D, E; 


GA BINDING PROTEIN ALPHA; 
CHAIN: A; GA BINDING 
PROTEIN BETA 1; CHAIN: B: 


SEQFOL 
D score 
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PDB annotation 


REGULATION/DNA), DNA-BINDING, 
2 NUCLEAR PROTEIN, ETS DOMAIN, 
ANKYRIN REPEATS, 
TRANSCRIPTION 3 FACTOR 


REGULATION/DNA) GABPALPHA; 
GABPBETAl; COMPLEX 
(TRANSCRIPTION 

REGULATION/DNA), DNA-BINDING, 
2 NUCLEAR PROTEIN, ETS DOMAIN, 
ANKYRIN REPEATS, 
TRANSCRIPTION 3 FACTOR 


2*3" 
;z 0 

la II § 

3 5 w y ^ ^ s 0 1 

d w < E w z 5 F 


REGULATION/DNA) GABPALPHA; 
GABPBETAl; COMPLEX 
(TRANSCRIPTION 

REGULATION/DNA), DNA-BINDING, 
2 NUCLEAR PROTEIN, ETS DOMAIN, 
ANKYRIN REPEATS, 
TRANSCRIPTION 3 FACTOR 


REGULATION/DNA) GABPALPHA; 
GABPBETAl; COMPLEX 
(TRANSCRIPTION 

REGULATION/DNA), DNA-BINDING, 


Coumponnd 


DNA; CHAIN: D,E; 


GA BINDING PROTEIN ALPHA; 
CHAIN: A; GA BINDING 
PROTEIN BETA 1; CHAIN: B; 
DNA; CHAIN: D, E; 


GA BINDING PROTEIN ALPHA; 
CHAIN: A; OA BINDING 
PROTEIN BETA i; CHAIN: B; 
DNA; CHAIN: D, E; 


GA BINDING PROTEIN ALPHA; 
CHAIN: A; GA BINDING 
PROTEIN BETA 1; CHAIN: B; 
DNA; CHAIN: D, E; 


GA BINDING PROTEIN ALPHA; 
CHAIN: A; GA BINDING 
PROTEIN BETA 1 ; CHAIN: B; 
DNA; CHAIN: D, E; 


SEQFOL 
D score 




94.92 








PMF 
score 






1.00 


1.00 


1.00 . 


Verify 
score 
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PDB annotation 


TUMOR SUPPRESSOR TUMUK 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF j 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PROTEIN/KINASE) 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PROTEIN/KINASE) 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 


Coumpound 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


SEQFOL 
D score 
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0.49 


1.00 


o 
q 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


Verify 
score 


0.37 


0.66 


0.58 


0.76 


0.32 


0.51 


o 


0.77 


0.63 


Psi Blast 


o 

1 
oo 


wo 

1 

wo 


o 

I 


3 
wo 


& 
vo 


o 

wo 


l.le-36 ! 


u 
ro 


wo 


Is 


s 

m 


wo 
o 

VO 


CO 
VO 


CO 
VO 


OS 
CO 


wo 
o 
oo 


o 

VO 


CO 

VO 


a 


STAR 
TAA 


wo 


oo 


NO 

oo 


OO 
1 — < 

wo 


WO 
00 

vn 


<N 
WO 
VO 




r-t 

Ol 
VO 


oo 
wo 


CHAI 
NID 














PQ 


PQ 


rt 




oo 

1 


8 

»—l 


00 

3 


OO 
T3 
X> 


00 
T3 


oo 


Iblx 


Iblx 


Iblx 


gag 


1853 


CO 

wo 

p 


1853 


1853 


1853 


1853 


1853 


1853 


1853 



911 



WO 02/070539 W * PCT/US02/05095 



fi 

o 

o 

B 
B 

« 

QQ 

e 


ALPHA/BETA, COMPLEX 
rrMHTRTTm? PI? C\TT! TXT /V txt a cr?\ 


J pi £ P S 

2 o a E 

3 8 3 a rf 5 * 

! ^2 O w (-; » 


HORMONE/GROWTH FACTOR PI 8- 
INK4C; CELL CYCLE INHIBITOR, 
P18INK4C, TUMOR, SUPPRESSOR, 
C YCLIN- 2 DEPENDENT KINASE, 


HORMONE/GROWTH FACTOR PI 8- 
INK4C; CELL CYCLE INHIBITOR, 
P18INK4C, TUMOR, SUPPRESSOR, 
C YCLIN- 2 DEPENDENT KINASE, 


TRANSFERASE CSK; PROTEIN 
KINASE, C-TERMINAL SRC KINASE, 
PHOSPHORYLATION, 2 


Q 

0 

j 

j 

) 

> 




SIGNALING PROTEIN HELIX-TURN- 
HEL1X, ANKYRIN REPEAT ! 
SIGNALING PROTEIN HELIX-TURN- 


Coumpound 




CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A; 


C-TERMINAL SRC KINASE; 
CHAIN: A; 


PHOSPHOTRANSFERASE 
CAMP-DEPENDENT PROTEIN 
KINASE CATALYTIC SUBUNIT 
ICMK 3 (E.C.2.7.1.37*) ICMK 4 


TRANSFERASE(PHOSPHOTRA 
NSFERASE) CAMP- 
DEPENDENT PROTEIN KINASE 
(E.C.2.7.L37) (CAPK) ICTP 3 
(CATALYTIC SUBUNTTl ICTP 4 


C YCLIN-DEPENDENT KINASE 
4 INHIBITOR B; CHAIN: A: 
CYCLIN-DEPENDENT KINASE " 


SEQFOL 
D score 
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PDB annotation 


PHOSPHOTRANSFERASE FGFR1K, 
FIBROBLAST GROWTH FACTOR 
RECEPTOR I; TRANSFERASE, 
TYROSINE-PROTEIN KINASE, ATP- 
BINDING, 2 PHOSPHORYLATION, 
RECEPTOR, 

PHOSPHOTRANSFERASE 


PHOSPHOTRANSFERASE C-SRC, 
P60-SRC; SRC, TYROSINE KINASE, 
PHOSPHORYL ATION, SH2, SH3, 2 
PHOSPHOTYROSINE, PROTO- 
ONCOGENE, 

PHOSPHOTRANSFERASR 


TRANSFERASE PI 50, C-ABL; 
KINASE, KINASE INHIBITOR, STI- 
571. ACTIVATI ON LOOP 


PROTEIN KINASE CDK2; 
TRANSFERASE, 
SERINE/THRE( DNINE PROTEIN 
KINASE, ATP-BINDING, 2 CELL 
CYCLE, CELL DIVISION, MITOSIS, 
PHOSPHORYLATION 


CELL CYCLE INHIBITOR P 1 8- 
INK4C(INK6); CELL CYCLE 
INHIBITOR, P15-INK4C(INK6), 
ANKYRIN REPEAT, 2 CDK 4/6 
INHIBITOR 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


TRANSCRIPTION FACTOR P65; 
P50D; TRANSCRIPTION FACTOR, 
IKB/NFKB COMPLEX 


Coumpound 


FGF RECEPTOR 1; CHAIN: A, B; 


TYROSINE-PROTEIN KINASE 
SRC; CHAIN: NULL; 


PROTO-ONCOGENE 
TYROSINE-PROTEIN KINASE 
ABL; CHAIN: A, B; 


HUMAN CYCLIN-DEPENDENT 
KINASE 2; CHAIN: NULL; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A, B; 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
B-ALPHA; CHAIN: D; 


NF-KAPPA-B P65 SUBUNIT; 
CHAIN: A; NF-KAPPA-B P50D 
SUBUNIT; CHAIN: C; I-KAPPA- 
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PBB annotation 
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) H C 
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SS 
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2 x 5 
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3S.2 

At < 

SE 


c h < 

• > 2 
H C 

flu ^ 

?Sl 


2 5 

* . p 

ii 

J Eg 

fg| 

> H C 


ANK-REPEAT MYOTROPHIN, 
ACETYLATION. NMR ATvTRf _RPPP at 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 1 
(TRANSCRIPTXON 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HF.T TY 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ ANK REPEAT), 
ANKYRITvJ 9 RPPP AT MPf IY 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRTTJ 0 TIP.PU.AT UUt ty 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION//VNK REPEAT), 
ANKYRIN 2 REPEAT HELIX 
COMPLEX (TRANSCRIPTION 


Coumpound 


MYOTROPHIN; CHAIN: NULL 


MYOTROPHIN; CHAIN: NULL 


MYOTROPHIN; CHAIN: NULL 


MYOTROPHIN; CHAIN: NULL 


MYOTROPHIN; CHAIN: NULL 


MYOTROPHIN; CHAIN: NULL 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 


NF-KAPPA-B P65; CHAIN: A, C; 
NF-KAPPA-B P50; CHAIN: B, D; 
I-KAPPA-B-ALPHA; CHAIN: E, 
F; 

NF-KAPPA-B P65; CHAIN: A, C; 
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PDB annotation 


TRANSFERASE MITOGEN 
ACTIVATED PROTEIN KINASE, MAP 
2, ERK2; TRANSFERASE, 
SERINE/THREONINE-PROTEIN 
KINASE. MAP KINASE. 2 ERK2 




SIGNALING PROTEIN GTP-BINDING 
PROTEINS, PROTEIN-PROTEIN 
COMPLEX, EFFECTORS 


SIGNALING PROTEIN GTP-BINDING 
PROTEINS, PROTEIN-PROTEIN 
COMPLEX, EFFECTORS 


SIGNALING PROTEIN G PROTEIN, 
GTP HYDROLYSIS, KINETIC 
CRYSTALLOGRAPHY, 2 SIGNALING 
PROTEIN 


SIGNALING PROTEIN G PROTEIN, 
GTP HYDROLYSIS, KINETIC 
CRYSTALLOGRAPHY, 2 SIGNALING 
PROTEIN 


ENDOCYTOSIS/EXOCYTOSIS G- 
PROTEIN, GTPASE, RAB6, 
VESICULAR TRAFFICKING 


SIGNALING PROTEIN P21-RAC2; 
RHO GDI 2, RHO-GDI BETA, LY-GDI; 
BETA SAND WHICH, PROTEIN- 
PROTEIN COMPLEX, G-DOMAIN, 2 
IMMUNOGLOBULIN FOLD, 
WALKER FOLD, GTP-BINDING 
PROTEIN 


Coumpound 


EXTRACELLULAR 
REGULATED KINASE 2; 
CHAIN: NULL: 






RAS-RELATED PROTEIN RAP- 
1A; CHAIN: A; PROTO- 
ONKOGENE 

SERINE/THREONINE PROTEIN , 
KINASE CHAIN: B; 


RAS-RELATED PROTEIN R AP- 
IA; CHAIN: A; PROTO- 
ONKOGENE 

SERINE/THREONINE PROTEIN 


TRANSFORMING PROTEIN 
P21/H-RAS-I; CHAIN: A; 


TRANSFORMING PROTEIN 
P21/H-RAS-1; CHAIN: A; 


RAB6 GTPASE; CHAIN: A; 


RAS-RELATED C3 BOTULINUM 
TOXIN SUBSTRATE 2; CHAIN: 
A; RHO GDP-DISSOCIATION 
INHIBITOR 2; CHAIN: B; 
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D score 
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PDB annotation 


HORMONE RECEPTOR HORMONE 
RECEPTOR, INSULIN RECEPTOR 
FAMILY 


SERINE PROTEASE INHIBITOR 
SERINE PROTEASE INHIBITOR, R- 
ELAFIN. ELASTASE INHIBITOR 




SIGNALING PR OTEIN GTP-BINDING 
PROTEINS, PROTEIN-PROTEIN 
COMPLEX EFFECTORS 


SIGNALING PROTEIN PROTEIN- 
PROTEIN COMPLEX, 
ANTIPARALLEL COILKlD-COn. 


ENDOCYTOSIS/EXOCYTOSIS G- 
PROTEIN, GTPASE, RAB6, 
VESICULAR TRAFFI CK TNfr 


G PROTEIN G PROTEIN, RAS, ARF, 
ARF6. MEMBRANE TRAFFIC 


ENDOCYTOSIS/EXOCYTOSIS G 
PROTEIN, VESICULAR TRAFFIC, 
GTP HYDROLYSIS, YPT/RAB 2 
PROTEIN, ENDOCYTOSIS, 
HYDROLASE 


SIGNALING PROTEIN ARF-LIKE 
PROTEIN 3, ARL3; PROTEIN-GDP 
COMPLEX WITHOUT MAGNESIUM, 
ARF FAMILY, RAS 2 SUPERFAMILY, 
G-DOMAIN 


PROTEIN TRANSPORT GDP- 
BINDING, MEMBRANE TRAFFICKIN, 
NON-MYRISTOYLATED IHUR 16 
PROTEIN TRANSPORT GDP- 


Coumpound 


INSULIN-LIKE GROWTH 
FACTOR RECEPTOR 1; CHAIN: 
A; 


R-ELAFIN; CHAIN: NULL; 




RAS-RELATED PROTEIN RAP- 
1 A; CHAIN: A; PROTO- 
ONKOGENE 

SERINE/THREONINE PROTEIN 
KINASE CHAIN: B: 


HIS-TAGGED TRANSFORMING 
PROTEIN RHOA(0-181); CHAIN: 
A; PKN; CHAIN: B: 


RAB6 GTPASE; CHAIN: A; 


ADP-RIBOSYLATION FACTOR 
6; CHAIN: A: 


GTP-BINDING PROTEIN YPT5I; 
CHAIN: A; 


ADP-RIBOSYLATION FACTOR- 
LIKE PROTEIN 3; CHAIN: A; 


HUMAN ADP-RIBOSYLATION 
FACTOR 1; IHUR 5 CHAIN: A, 
B; IHUR 7 

HUMAN ADP-RIBOSYLATION 
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PDB annotation 


COMPLEX (ZTNC FINGER/DNA) ZINC 
FINGER, PROIEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 


COMPLEX (ZD>JC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STR UCTURE, COMPLEX 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) TFUIA; 5S 
GENE; NMR, TFIIIA, PROTEIN, DNA, 
TRANSCRIPTION FACTOR, 5S RNA 2 
GENE, DNA BINDING PROTEIN, 
ZINC FINGER, COMPLEX 3 
(TRANSCRIPTION 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTFilN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER. COMPLEX (DNA- 


Coumpound 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; ! 


TRANSCRIPTION FACTOR IIIA; 
1 CHAIN: A; 5S RNA GENE; 
CHAIN: E, F; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN; A, B; 


ZINC FINGER PROTEIN GLIl ; 
CHAIN: A; DNA; CHAIN: C, D; 


SEQFOL 
D score 
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PDB annotation 


BINDING PROTEIN/DNA) j 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 




COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 


© 

e 

o 

u 




ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 




QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B,C; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


SEQFOL 
D score 




58.21 
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0.11 
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PDB annotation 


(ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PRO! EIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(zinc FiNGRu/ni\^ 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
fZINC FTNORR/rVN^ 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FTNfxRR /i"YKF A , * 


I. 


Coumpound 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


TRANSCRIPTION REGULATION 
YEAST TRANSCRIPTION 
FACTOR ADRI (RESIDUES 130- 


SEQFOL 
D score 


70.79 
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PDB annotation 


i 

i 

1 

( 


ZINC FINGER I KANoUKlr 1 
FACTOR SPl; ZINC FINGER, 
TRANSCRIPTION ACTIVATION, SPl 


COMPLEX (TRANS CKlr 1 1UN 
REGULATION/DNA) TFIIIA; 5S 
GENE; NMR, TFIIIA, PROTEIN, DNA, 
TRANSCRIPTION FACTOR, 5S RNA 2 
GENE, DNA BINDING PROTEIN, 
ZINC FINGER, COMPLEX 3 
(TRANSCRIPTION 
REGULATION/DNA) 


REGULATION/DNA) TFIIIA; 5S 
GENE; NMR, TFUIA, PROTEIN, DNA, 
TRANSCRIPTION FACTOR, 5S RNA 2 
GENE, DNA BINDING PROTEIN, 
ZINC FINGER, COMPLEX 3 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCKlr 1 1UJN 
REGULATION/DNA) TFIIIA; 5S 
GENE; NMR, TFIIIA, PROTEIN, DNA, 
TRANSCRIPTION FACTOR, 5S RNA 2 
GENE, DNA BINDING PROTEIN, 
ZINC FINGER, COMPLEX 3 
(TRANSCRIPTION 


Coumpound 


159) IPAA 3 (PAPA - CARBOXY 
TERMINAL ZINC FINGER 
DOMAIN) MUTANT WITH IPAA 
4 PRO 131 REPLACED BY ALA, 
PRO 133 REPLACED BY ALA, 
CYS 140 IPAA 5 REPLACED BY 
ALA (P131AJP133A.C140A) 
(NMR, 10 STRUCTURES) IPAA 6 


SP1F2; CHAIN: NULL; 


TRANSCRIPTION FACTOR IIIA; 
CHAIN: A; 5S RNA GENE; 
CHAIN: E, F; 


TRANSCRIPTION FACTOR ILIA; | 
CHAIN: A; 5S RNA GENE; 
CHAIN: E,F; 


TRANSCRIPTION FACTOR IIIA; 
CHAIN: A; 5S RNA GENE; 
CHAIN: E, F; 


SEQFOL 
D score 
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PMF 
score 
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PDB annotation 


RECOGNITION, J UUMr*l.l}A 

(TRANSCRIPTION 

REGULATION/DNA) 


COMPLEX (TKANoUKIr 1 1U1N 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANoCKlr 1 1U1N 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCKlr 1 1UN 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


TRANSCRIPTION REvJULA 1 1UN 
TRANSCRIPTION REGULATION, 
ADRl, ZINC FINGER, NMR 


COMPLEX (DNA-BIND1N (i 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


Coumpound 




YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A,B; 


YYl; CHAIN: C; ADENO- 
ASSOC1ATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


Y Y 1 ; CHAIN: C; ADENO- 
ASSOC1ATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


ADRl; CHAIN: NULL; 


ZINC FINGER PROTEIN GLI1; 
CHAIN: A; DNA; CHAIN: C, D; 


SEQFOL 
D score 






68.47 








PMF 
score 




0.07 
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Verify 
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PDB annotation 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTETN/DNA ^ 


COMPLEX (DN A-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROlfEIN/DNA} 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTFTM 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTRTN 


COMPLEX (ZINC FINGER/DNA) 
; COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


GENE REGULATION POZ DOMAIN; 
PROTEIN-PROTEIN INTERACTION 
DOMAIN, TRANSCRIPTIONAL 2 
REPRESSOR, ZINC-FINGER 
PROTEIN, X-RAY 
CRYSTALLOGRAPHY, 3 PROTEIN 
STRUCTURE, PROMYELOCYTIC 
LET JKFlS/fTA fSR'NTP tt A-nrvNT 


GENE REGULATION POZ DOMAIN; 
PROTEIN-PROTEIN INTERACTION 
DOMAIN, TRANSCRIPTIONAL 2 
REPRESSOR, ZINC-FINGER 
PROTEIN, X-RAY 


Coumpound 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl; 
CHAIN: A; DNA; CHAIN: C, D; 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE: CHAIN: B, C: 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C: 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C: 


PROMYELOCYTIC LEUKEMIA 
ZINC FINGER PROTEIN PLZF; 
CHAIN: A; 


PROMYELOCYTIC LEUKEMIA 
ZINC FINGER PROTEIN PLZF; 
CHAIN: A; 


SEQFOL 
D score 


68.84 






69.01 






64.40 


PMF 

score 




0.04 


1.00 




0.41 


1.00 




Verify 
score 
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PDB annotation 


CRYSTALLOGRAPHY, 3 ^KUlfclN 
STRUCTURE, PROMYELOCYTIC 
LEUKEMIA, GENE REGULATION 


COMPLEX (ZINC rlNGbK/lJJNA; 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGEK/una; z,iinu 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC fingek/dna; 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC F1JNGEK/UNAJ Z,llNv. 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) TFIIIA; 5S 
GENE; NMR, TFIIIA, PROTEIN, DNA, 
TRANSCRIPTION FACTOR, 5S RNA 2 
GENE, DNA BINDING PROTEIN, 
ZINC FINGER, COMPLEX 3 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 


Coumpound 




DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B,D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 

- 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B,D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


TRANSCRIPTION FACTOR IIIA; 
CHAIN: A; 5S RNA GENE; 
CHAIN: E, F; . 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


SEQFOL 
D score 








67.59 








PMF 
score 




0.09 


1.00 




0.71 


0.23 
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Verify 
score 
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PDB annotation 


1 

1 

;, 

g p 

J 

$ 3 


COMPLEX (DNA-B1ND1MU 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


REPLICATION DNA NUCLEOTIDE 
EXCISION REPAIR, UVRABC, 
HELICASE 2 

HYPERTHERMOSTABLE PROTEIN 


HYDROLASE UVKB; MULllUUMAin 
PROTEIN 


HYDROLASE UVRB; MULIADUMAiin 
PROTEIN 


GENE REGULATION ArU rRU 1 biN 


TRANSLATION YEAST 1MH 1A11UM 
FACTOR 4A, EIF4A; HELICASE, 
INITIATION FACTOR 4A, DEAD-BOX 
PROTEIN 


TRANSLATION EUKAKYUllC 
INITIATION FACTOR 4A; IF4A, 
HELICASE, DEAD-BOX PROTEIN 


TRANSLATION EUKAKYU IIC 
INITIATION FACTOR 4A; IF4A S 
HELICASE, DEAD-BOX PROTEIN 


TRANSPORT PROTEIN TC4; GTPASE, 
NUCLEAR TRANSPORT, 
TRANSPORT PROTEIN 


TRANSPORT PROTEIN TC4; UUfASia, 
NUCLEAR TRANSPORT, 
TRANSPORT PROTEIN 


Coumpound 




ZINC FINGER PROTEIN GLIl ; 
CHAIN: A; DNA; CHAIN: C, D; 


DNA NUCLEOTIDE EXCISION 
REPAIR ENZYME UVRB; 
CHAIN: A; 


EXCINUCLEASE ABC SUBUNIT 
B; CHAIN: A; 


EXCINUCLEASE ABC SUBUNIT 
B; CHAIN: A; 


EXCINUCLEASE UVRABC 1 
COMPONENT UVRB; CHAIN: A; 


EUKARYOTIC INITIATION 
FACTOR 4A; CHAIN: A; 


YEAST INITIATION FACTOR 
4A; CHAIN: A, B; 


YEAST INITIATION FACTOR 
4A; CHAIN: A, B; 


GTP-BINDING PROTEIN RAN; 
CHAIN: A, B; 


GTP-BINDING PROTEIN RAN; 
CHAIN: A, B; 


SEQFOL 
D score 
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TRANSPORT PROTEIN TC4; GTPASE, 
NUCLEAR TRANSPORT, 
TRANSPORT Pi?nxprKT 


TRANSPORT PROTEIN TC4; GTPASE, 
NUCLEAR TRANSPORT, 
TRANSPORT PIJOTTTTM 


SIGNALING PROTEIN GTP-BINDING 
PROTEINS, PROTEIN-PROTEIN 
COMPLEX, EFFECTORS 


SIGNALING PROTEIN GTP-BINDING 
PROTEINS, PROTEIN-PROTEIN 
COMPLEX, EFFECTORS 


SIGNALING PROTEIN G PROTEIN, 
GTP HYDROLYSIS, KINETIC 
CRYSTALLOGRAPHY, 2 SIGNALING 
PR OTFTNI 


SIGNALING PROTEIN G PROTEIN, 
GTP HYDROLYSIS, KINETIC 
CRYSTALLOGRAPHY, 2 SIGNALING 
PROTPTW 


SIGNALING PROTEIN PROTEIN- 
PROTEIN COMPLEX, 
ANTTPARAT.T.PT fWT trn r-rm 


ENDOCYTOSIS/EXOCYTOSIS G- 
PROTEIN, GTPASE, RAB6, 


ENDOCYTOSIS/EXOCYTOSIS G 
PROTEIN, VESICULAR TRAFFIC, 
GTP HYDROLYSIS, YPT/RAB 2 
PROTEIN, ENDOCYTOSIS r 


Coumpound 


GTP-BINDING PROTEIN RAN; 
CHAIN: A, B; 


GTP-BINDING PROTEIN RAN; 
CHAIN: A, B; 


RAS-RELATED PROTEIN R AP- 
IA; CHAIN: A; PROTO- 
ONKOGENE 

SERINE/THREONINE PROTEIN 
KINASE CHAIN: B: 


RAS-RELATED PROTEIN RAP- 
1A; CHAIN: A; PROTO- 
ONKOGENE 

SERINE/THREONINE PROTEIN 
KINASE CHAIN: B: 


TRANSFORMING PROTEIN 
P21/H-RAS-1; CHAIN: A; 


TRANSFORMING PROTEIN 
P21/H-RAS-1; CHAIN: A; 


HIS-TAGGED TRANSFORMING 
PROTEIN RHOA(0- 181); CHAIN: 
A; PKN; CHAIN: B; 


RAB6 GTPASE; CHAIN: A; 


GTP-BINDING PROTEIN YPT51; 
CHAIN: A; 


SEQFOL 
D score 




109.68 




102.67 




98.76 


95.76 






PMF 
score 


1.00 




1.00 




1.00 
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Verify 
score 
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RAB3A, RABPHILIN 
COMPLEX (G1;P- 

BINDING/EFFJ sCTOR) RAS-RELATED 
PROTEIN RAB3A; COMPLEX (GTP- 
BINDING/EFFECTOR), G PROTEIN, 
EFFECTOR, RABCDR, 2 SYNAPTIC 
EXOCYTOSIS, RAB PROTEIN, 

P AR'* A P A "RPf-TTT TXT 


SB a 

5 Q t, m H 

spi 

J <5 pi < 
« j J j p _ 

fOPOgC 

1 1 ^ 


HYDROLASE Ci PROTEIN, 
VESICULAR TI MAFFICKING, GTP 
HYDROLYSIS, RAB 2 PROTEIN, 
NEUROTRANSMITTER RELEASE, 
HYDROLASE 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 


< < 

if 1 

lis 

> CL ^ Ct 

i s 2 <_ 


CiENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 

GENE REGULATION/RNA POLY(A) 


Coumpound 


RAB-3A; CHAIN: A; 
RABPHILIN-3A; CHAIN: B; 


RAB3A; CHAIN: A; 


RAB 3 A; CHAIN: A; 


U2 RNA HAIRPIN IV; CHAIN: Q, 
R; U2 A'; CHAIN: A, C; U2 B"; 
CHAIN: B, D; 
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Ph 0U, W ftf < O £ 


SEQFOL 
D score 
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PMF 

score 
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Verify 
score 
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SPLICING INHIBITOR, 
TRANSLATION AL INHIBITOR, SEX 3 
DETERMINATION, X CHROMOSOME 
DOSAGE COMPENSATION 




COMPLEX (TRANSCRIPTION 
FACTOR/DNA) COMPLEX 
(TRANSCRIPTION FACTOR/DNA), 
TRANSCRIPTION FACTOR, 2 DNA- 
BINDING PROTEIN 


COMPLEX (TRANSCRIPTION 
FACTOR/DNA) COMPLEX 
(TRANSCRIPTION FACTOR/DNA), 
TRANSCRIPTION FACTOR, 2 DNA- 
BINDING PROTEIN 


COMPLEX (INHrBITOR/NUCLEASE) 
COMPLEX (INHCBITOR/NLJCLEA SE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 
3 REPEATS 


COMPLEX (INHIBITOR/NUCLEASE) 
COMPLEX (INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 2 
MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE-RICH 


Coumpound 




LIGASE(SYNTHETASE) SERYL- 
TRNA SYNTHETASE 
(E.C.6.1.1.11) (SERINE-TRNA 
LIGASE) ISES 3 COMPLEXED 
WITH SERYL-HYDROXAMATE- 
AMP ISES 4 


T PROTEIN; CHAIN: A, B; DNA; 
CHAIN: C, D; 


T PROTEIN; CHAIN: A, B; DNA; 
CHAIN: C, D; 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


RIBONUCLEASE INHIBITOR; 
CHAIN: A, D; ANGIOGENIN; 
CHAIN: B, E; 


SEQFOL 
D score 














PMF 

score 




0.01 


0.60 


0.47 


0.43 


0.60 


Verify 
score 




-0.21 
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0.15 
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PDEi annotation 


GTPASE-ACTIVATING PROTEIN 
FOR SPIl, GTPASE-ACTIVATING 
PROTEIN, GAP 3 RNA1P, RANGAP, 
LRR, LEUCINE- 2 RICH REPEAT 
PROTEIN, TWINNING, 
HEMIHEDRAL TWINNING, 3 
MEROHEDRAL TWINNING, 
MEROHEDRV 


ACETYLATION RNASE INHIBITOR, 
RIBONUCLEAS E/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH REPEATS 


TRANSFERASE GRK-2, BETA- 
ADRENERGIC RECEPTOR KINASE I , 
BETA-ARK PLECKSTRIN 
HOMOLOGY DOMAIN, PH DOMAIN, 
SIGNAL TRANSDUCTION, 2 G- 
BETA-GAMMA BINDING DOMAIN, 
BETA-ADRENERGIC RECEPTOR 3 
KINASE, BETA- ARK, G-PROTEIN 
COUPLED RECEPTOR KINASE (GRK- 
2) 


SIGNAL TRANSDUCTION PROTEIN 


CYTOSKELETON 




MEMBRANE PROTEIN CRYSTAL 
STRUCTURE, MEMBRANE, FERM 
DOMAIN, TAIL DOMAIN 


Coumpound 


PROTEIN RNA I SCHPO; 
CHAIN: A, B; 


RIBONUCLEASE INHIBITOR; 
CHAIN: NULL; 


G-PROTEIN COUPLED 
' RECEPTOR KINASE 2; CHAIN: 
| NULL; 


BETA-SPECTRIN; 1BTN4 
CHAIN: NULL; IBTN 5 


BETA-SPECTRIN; IDRO 6 
CHAIN: NULL: IDRO 7 


SIGNAL TRANSDUCTION 
PROTEIN D YNAMIN 
(PLECKSTRIN HOMOLOGY 
DOMAIN) (DYNPH) IDYN 3 


MOESIN; CHAIN: A, B; MOESIN- 
CHAIN: C, D; 
























SEQI 
D sc< 






















PMF 
score 




0.71 
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Verify 
score 
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PDB annotation 


SIGNALING PROTEIN DAPPl, PHISH, 
BAM32; PLECKSTRIN, 3- 
PHOSPHOINOSITIDES, INOSITOL 
TETRAKISPHOSPHATE 2 SIGNAL 
TRANSDUCTION PROTEIN, 
ADAPTOR PROTEIN 


SIGNALING PROTEIN DAPPl, PHISH, 
BAM32; PLECKSTRIN, 3- 
PHOSPHOINOSITEDES, INOSITOL 
TETRAKISPHOSPHATE 2 SIGNAL 
TRANSDUCTION PROTEIN, 
ADAPTOR PROTEIN 


SIGNALING PROTEIN ARFl 
GUANINE NUCLEOTIDE EXCHANGE 
FACTOR AND PH DOMAIN 


CELL ADHESION 3 
SUBDOMAINSjCYTOSKELETON, 
CELL ADHESION 






OXIDOREDUCTASE IPMDH, IMDH; 
OXIDOREDUCTASE, 
DECARBOXYLATING 
DEHYDROGENASE, LEUCINE 2 
BIOSYNTHESIS 


OXIDOREDUCTASE IPMDH, IMDH; 
OXIDOREDUCTASE, 


Coumpound 


DUAL ADAPTOR OF 
PHOSPHOTYROSINE AND 3- 
CHAIN: A; 




DUAL ADAPTOR OF 
PHOSPHOTYROSINE AND 3- 
CHAIN: A; 


GRPl; CHAIN: A; 


RADIXIN; CHAIN: A; 


PHOSPHORYLATION 
PLECKSTRIN (N-TERMINAL 
PLECKSTRIN HOMOLOGY 
DOMAIN) MUTANT IPLS 3 
WITH LEU GLU (HIS)6 ADDED 
TO THE C TERMINUS I PLS 4 
(INS(G 1 05-LEHHHHHH)) (NMR, 
25 STRUCTURES) IPLS 5 




3-1 SOPROP YLMALATE 
DEHYDROGENASE; CHAIN: A, 
B; 


3-ISOPROPYLMALATE 
DEHYDROGENASE; CHAIN: A, 


SEQFOL 
D score 














69.11 




PMF 
score 


0.80 


0.69 


0.65 


0.01 


0.09 
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Verify 
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PDB annotation 


<N 

o 

§ 

CD h-) 
p 00 

III 

3SS 

a w £ 
O Q pc 


OXIDOREDUCTASE 
OXALOSUCCINATE 
DECARBOXYLASE, IDH; 
OXIDOREDUCTASE (NAD(A> 
CHOH(D)),NADP, 
PHOSPHORYLATION, 2 

OT YDYVT ATC DVD A CC 


OXIDOREDUCTASE 
OXALOSUCCINATE 
DECARBOXYLASE, IDH; 
OXIDOREDUCTASE (NAD(A)- 
CHOH(D)), NADP, 
PHOSPHORYLATION, 2 

fiT VOYVI ATI? QVTJAOO 


g | 
3& .2H5 

5 W W rjt g f 

j G 5 i a i 

sails 

»OOQQf£ 


OXIDOREDUCTASE IPMDH, IMDH; 
OXIDOREDUCTASE, 
DEHYDROGENASE, NAD- 
DEPENDANT ENZYME, 2 LEUCINE 


wawj *. t ^ iiiiii rmnnA I 
OXIDOREDUCT ASE IPMDH, IMDH* 
OXIDOREDUCTASE, 
DEHYDROGENASE, LEUCINE 
BIOSYNTHETIC PATHWAY, 2 NAD- 

PITT T> TTXm A \rr T?x rT\y> *-r- 


OXIDOREDUCTASE IPMDH, IMDH; 
OXIDOREDUCTASE. 


Coumpound 


i 


ISOCITRATE 

DEHYDROGENASE; CHAIN: 
NULL; 


ISOCITRATE 

DEHYDROGENASE; CHAIN: 
NULL; 


3-ISOPROPYLMALATE 
DEHYDROGENASE; CHAIN: A, 
B; 


3-ISOPROPYLMALATE 
DEHYDROGENASE; CHAIN: A 
B; 


3-ISOPROPYLMALATE 
DEHYDROGENASE; CHAIN: A, 
B; 


3-ISOPROPYLMALATE 
DEHYDROGENASE; CHAIN: A 


SEQFOL 
D score 
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PDB annotation 


DEHYDROGENASE, LEUCINE 
BIOSYNTHETIC PATHWAY, 2 NAD- 
DEPEND ANT ENZYME 


OXIDOREDUCTASE IPMDH; IIDM 7 
CHIMERA IIDM 20 


OXIDOREDUCTASE IPMDH; IIDM 7 
CHIMERA IIDM 20 


OXIDOREDUCTASE IPMDH, IMDH; 
IXAC 10 OXIDOREDUCTASE, 
CHIMERA IXAC 21 


OXIDOREDUCTASE IPMDH, IMDH; 
IXAC 10 OXIDOREDUCTASE, 
CHIMERA IXAC 21 


OXIDOREDUCTASE 
OXIDOREDUCTASE, 3- 
ISOPROPYLMALATE 
DEHYDROGENASE, LEUCINE 2 
BIOSYNTHESIS, MODERATE 
THERMOPHILE _ 


OXIDOREDUCTASE 
OXIDOREDUCTASE, 3- 
ISOPROPYLMALATE 
DEHYDROGENASE, LEUCINE 2 
BIOSYNTHESIS, MODERATE 
THERMOPHILE 




ANTI-ONCOGENE CELL CYCLE, 
ANTI-ONCOGENE, REPEAT, ANK 
REPEAT 


| COMPLEX (TRANSCRIPTION J 


Coumpound 




3-ISOPROPYLMALATE 
DEHYDROGENASE; IIDM 5 
CHAIN: NULL; IIDM 6 


3-ISOPROPYLMALATE 
DEHYDROGENASE; IIDM 5 
CHAIN: NULL; IIDM 6 


3-ISOPROPYLMALATE 
DEHYDROGENASE 2T2M6T 
S82R; IXAC 8 CHAIN: NULL; 
IXAC 9 


3-ISOPROPYLMALATE 
DEHYDROGENASE 2T2M6T 
S82R; IXAC 8 CHAIN: NULL; 
IXAC 9 


3-ISOPROPYLMALATE 
DEHYDROGENASE; CHAIN: A, 
B; 


3-ISOPROPYLMALATE 
DEHYDROGENASE; CHAIN: A, 
B; 




TUMOR SUPPRESSOR 
P161NK4A; CHAIN: NULL; 


| GA BINDING PROTEIN ALPHA; 


SEQFOL 
D score 




54.18 




50.68 






68.62 








PMF 
score 






0.42 




0.46 


0.57 






0.51 


0.22 


Verify 
score 






0.06 
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PDB annotation 


REGULATION/DNA) GABP ALPHA; 
GABPBETAl ; COMPLEX 
(TRANSCRIPTION 

REGULATION/DNA), DNA-BINDING, 
2 NUCLEAR PROTEIN, ETS DOMAIN, 
ANKYRIN REPEATS, 
TR AN SCR TPTT f YW * PArmo 


COMPLEX (TRANSCRIPTION 
REGULATION/ONA) GABP ALPHA; 
GABPBETAl; COMPLEX 
(TRANSCRIPTION 

REGULATION/DNA), DNA-BINDING, 
2 NUCLEAR PR OTEIN, ETS DOMAIN, 
ANKYRIN REPIiATS, 
TRANSfTRTPTTrttJ 1 PAPTAD 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANK YR TM lV/f rYT TT7 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTTT7 


COMPLEX (KINASE/ANTI- 
ONCOGENE) CDK6; P16INK4A, 
MTSl; CYCLIN DEPENDENT 
KINASE, CYCLIN DEPENDENT 
KINASE INHIBITORY 2 PROTEIN, 
CDK, INK4, CELL CYCLE, MULTIPLE 
TUMOR SUPPRESSOR, 3 MTSl, 
COMPLEX (KINASE/ANTI- 


COMPLEX (INHIBITOR 
PROTEIN/KINAiJE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 


Coumpound 


CHAIN: A; GA BINDING 
PROTEIN BETA 1; CHAIN: B; 
DNA; CHAIN: D, E; 


GA BINDING PROTEIN ALPHA; 
CHAIN: A; GA BINDING 
PROTEIN BETA 1; CHAIN: B; 
DNA; CHAIN: D, E; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


P19INK4D CDK4/6 INHIBITOR; 
CHAIN: NULL; 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; MULTIPLE 
TUMOR SUPPRESSOR; CHAIN: 


CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


SEQFOL 
D score 














PMF 
score 




1,00 


0.11 


0.96 


0.62 


0.15 


Verify 
score 




0.43 


0.05 


0.22 


0.52 


0.18 
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PDB annotation 


KINASE, CELL CYCLE I CON1KUL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PROTEIN/KINASE) 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 CONTROL, 
ALPHA/BETA, COMPLEX 
(INHIBITOR PROTEIN/KINASE) 


HORMONE/GROWTH FACTOR r 1 b- 
INK4C; CELL CYCLE INHIBITOR, 
PI8INK4C, TUMOR, SUPPRESSOR, 
C YCLIN- 2 DEPENDENT KINASE, 
HORMONE/GROWTH FACTOR 


HORMONE/GROWTH FACTOR f 1 8- 
INK4C; CELL CYCLE INHIBITOR, 
P18INK4C, TUMOR, SUPPRESSOR, 
C YCLIN- 2 DEPENDENT KINASE, 
HORMONE/GROWTH FACTOR 


HORMONE/GROWTH FAC 1 UK F 1 8- 
INK4C; CELL CYCLE INHIBITOR, 
P18INK4C, TUMOR, SUPPRESSOR, 
C YCLIN- 2 DEPENDENT KINASE, 
HORMONE/GROWTH FACTOR 


SIGNALING PROTEIN HEL1X-TUKN- 
HELIX, ANKYRIN REPEAT 


METAL BINDING PROTEIN Z.1NC- 
BINDING MODULE, ANKYRIN 
REPEATS, METAL BINDING 
PROTEIN 


CELL CYCLE INHIBITOR PI 8- 
INK4C(INK6); CELL CYCLE 
INHIBITOR, P18-INK4C(INK6), 
ANKYRIN REPEAT, 2 CDK 4/6 


Coumpound 




CYCLIN-DEPENDENT KINASE 
6; CHAIN: A; P19INK4D; CHAIN: 
B; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A; 


CYCLIN-DEPENDENT KINASE 
4 INHIBITOR B; CHAIN: A; 


PYK2-ASSOCIATED PROTEIN 
BETA; CHAIN: A; 


CYCLIN-DEPENDENT KINASE 
6 INHIBITOR; CHAIN: A, B; 


SEQFOL 
D score 


















PMF 
score 
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0.23 


0.11 


0.05 


0.33 


0.69 


0.71 


Verify 
score 
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PDB annotation 


ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 


Coumpound 


OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; . 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A,B,D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A,B, D, E; 
CONSENSUS ZINC FINGER 


SEQFOL 
D score 
















98.05 


PMF 
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Verify 
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PDE: annotation 


(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, ZINC 
FINGRR PR OTP XW 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE HI, 2 
TRANSCRIPTION INITIATION, ZINC 
FTNGRR PR OTP fNJ 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, ZINC 
FINGRR PROTPTM 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, ZINC 

RUMOR R PROTPTM 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE HI, 2 
TRANSCRIPTION INITIATION, ZINC 
FINGER PROTEIN 
COMPLEX (TRANSCRIPTION 


Coumpound 


CHAIN: B, C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B 3 C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C,.E, F; 

YY 1 ; CHAIN: C; ADENO- 


SEQFOL 
D score 
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PDB annotation 


REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG I; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1 ; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 


Coumpound 


ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOC1ATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 


SEQFOL 
D score 












PMF 
score 
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0.86 
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Verify 
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PDli annotation 


INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/TWA^* 


COMPLEX (TRANSCRIPTION 
REGULATION/^DNA) YING- YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTE IN, DNA-PROTEIN 
RECOGNITION* 3 COMPLEX 
(TRANSCRIPTION 
REGI IL ATT OTM/J YNJ A "> 


COMPLEX (TRANSCRIPTION 
REGULATI ON/DNA) YING- YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEtN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATTON/DNA^ 


COMPLEX (TRANSCRIPTION 
REGULATION/I)NA) YING- YANG I; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEKN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING- YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEI N, DNA-PROTEIN 


Coumpound 


CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHArN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


Y Y 1 ; CHAIN: C; ADENO- 
ASSOC1ATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


SEQFOL 
, D score 




86.32 








PMF 
score 
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Verify 
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PDB annotation 


RECOGNITION, 3 COMPLEX 

(TRANSCRIPTION 

REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZDMC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


TRANSCRIPTION REGULATION 
TRANSCRIPTION REGULATION, 
ADRl, ZINC FINGER, NMR 


TRANSCRIPTION REGULATION 
TRANSCRIPTION REGULATION, 
ADRl, ZINC FINGER, NMR 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA) 


| COMPLEX (DNA-BINDING | 


Courapound 




YYl; CHAIN: C; ADENO- 
ASSOC1ATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCJATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


ADRl; CHAIN: NULL; 


ADRl; CHAIN: NULL; 


ZINC FINGER PROTEIN GLII; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLII; 
CHAIN: A; DNA; CHAIN: C, D; 


| ZINC FINGER PROTEIN GLII; 


SEQFOL 
D score 
















| 93.76 


PMF 
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PDB annotation 


PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA^ 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA^ 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX (DNA- 
BINDING PROTEIN/DNA^ 




HEXAMERIZATION DOMAIN 
HEXAMERIZATION DOMAIN, 
ATPASE. TRANSPORT 


HEXAMERIZATION DOMAIN 
! HEXAMERIZATION DOMAIN, 
ATPASE. TRANSPORT 


CHAPERONE HfiLV; HSLU 
CHAPERONE, HSLVU, CLPQY, AAA- 
ATPASE, ATP-DEPENDENT 2 
PROTEOLYSIS. PROTEASOMK 


CELL CYCLE CDC6P; CDC6, CDC1 8, i 
ORCl, AAA PROTEIN, DNA 
REPLICATION IMITATION 2 
FACTOR, CELL CYCLE CONTROL 
FACTOR 


CHAPERONE AAA-ATPASE, CLPY, 
ATP-DEPENDENT PROTROT VSTS 


CHAPERONE AAA-ATPASE, CLPY, 
ATP-DEPENDENT PROTROT .YRTS 


TRANSFERASE CK; NUCLEOTIDE 
MONOPHOSPHATE KINASE, 
TRANSFERASE 


Coumpound 


CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl ; 
CHAIN: A; DNA; CHAIN: C, D; 


ZINC FINGER PROTEIN GLIl ; 
CHAIN: A; DNA; CHAIN: C, D; 




N-ETHYLMALEIMIDE- 
SENSITIVE FUSION PROTEIN; 
CHAIN: A; 


N-ETHYLMALEIMIDE- 
SENSITIVE FUSION PROTEIN; 
CHAIN: A; 


HEAT SHOCK PROTEIN HSLV; 
CHAIN: A, B, C, D; HEAT 
SHOCK PROTEIN HSLU; 
CHAIN: E, F; 


CELL DIVISION CONTROL 
PROTEIN 6; CHAIN: A, B; 


HEAT SHOCK PROTEIN HSLU; 
CHAIN: A; 


HEAT SHOCK PROTEIN HSLU; 
CHAIN: A; 


CYTIDINE MONOPHOSPHATE 
KINASE; CHAIN: A; 


SEQFOL 
D score 
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PDB annotation 


IMMUNOGLOBULIN, FAB 
FRAGMENT, HUMANISATION 


COMPLEX (VIRAL 
CAPSID/IMMUNOGLOBULIN) HIV-1 
CA, HIV CA, HIV P24, P24; FAB, FAB 
LIGHT CHAIN, FAB HEAVY CHAIN 
COMPLEX (VIRAL 
CAPSID/IMMUNOGLOBULIN), HIV, 
CAPSID PROTEIN, 2 P24 


COMPLEX (MHC/VIRAL 
PEPTIDE/RECEPTOR) HLA A2 
HEAVY CHAIN; COMPLEX 
(MHC/VIRAL PEPTIDE/RECEPTOR) 


COMPLEX (MHC/VIRAL 
PEPTIDE/RECEPTOR) HLA A2 
HEAVY CHAIN; COMPLEX 
(MHC/VIRAL PEPTIDE/RECEPTOR) 


RECEPTOR T CELL RECEPTOR IBEC 
14 


IMMUNE SYSTEM IMMUNE 
SYSTEM 


IMMUNOGLOBULIN FAB, 
ANTIBODY, ANTIGEN, HIV-1, P24, 
CA 




Coumpound 

i 


X 


HUMAN IMMUNODEFICIENCY 
VIRUS TYPE 1 CAPSID CHAIN: 
A, B; ANTIBODY FAB25.3 
FRAGMENT; CHAIN: H, K, L, M; 


HLA-A 0201; CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
TAX PEPTIDE; CHAIN: C; T 
CELL RECEPTOR ALPHA; 
CHAIN: D; T CELL RECEPTOR 
BETA; CHAIN: E; 


HLA-A 0201; CHAIN: A; BETA-2 
MICROGLOBULIN; CHAIN: B; 
TAX PEPTIDE; CHAIN: C; T 
CELL RECEPTOR ALPHA; 
CHAIN: D; T CELL RECEPTOR 
BETA; CHAIN: E; 


14.3.DT CELL ANTIGEN ] 
RECEPTOR; IBEC 5 CHAIN: 
NULL; IBEC 6 


CHIMERIC GERMLINE 
PRECURSOR OF OXY-COPE 
CHAIN: L; CHIMERIC 
GERMLINE PRECURSOR OF 
OXY-COPE CHAIN: H; 


IMMUNOGLOBULIN LIGHT 
CHAIN; CHAIN: L; 
IMMUNOGLOBULIN HEAVY 
CHAIN; CHAIN: H; 


| IMMUNOGLOBULIN FAB 


SEQFOL 
D score 






301.81 




273.44 








PMF 

score 
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Verify 
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PDB annotation 






IMMUNOGLOBULIN INTACT 
IMMUNOGLOBULIN V REGION C 
REGION TMMT JNDftt OUT TT TW 


IMMUNOGLOBULIN INTACT 
IMMUNOGLOBULIN, V REGION, C 
REGION. HTNGK RRHTrVM 


IMMUNOGLOBULIN 
IMMUNOGI.Om TT TNI 


IMMUNOGLOBULIN VARIABLE 
HEAVY (VH) DOMAIN, VARIABLE 
LIGHT (VL) ANTIBODY FRAGMENT, 
MULTIVALENT ANTIBODY, 
DIABODY, DOMAIN 2 SWAPPING, 
IMMUNOGLOTAF JTXNJ 


RECEPTOR TCFt; T-CELL, 
RECEPTOR, TRANSMEMBRANE, 
GLYCOPROTEIN, SIGNAL 


CHAPERONE AAA-ATPASE, CLPY, 
ATP-DEPENDENT PROTEOLYSIS 


GENE REGULATION POZ DOMAIN; 
PROTEIN-PROTEIN INTERACTION 
DOMAIN, TRANSCRIPTIONAL 2 


Coumpound 


FRAGMENT OF HUMANIZED 
ANTIBODY 4D5, VERSION 4 
IFVD 3 


COMPLEX 

(ANTIBODY/BINDING 
PROTEIN) IGGl FAB 
FRAGMENT COMPLEXED 
WITH PROTEIN G (DOMAIN III) 
IIGC 5 PROTEIN G, 
STREPTOCOCCUS IIGC 15 


IGG2A INTACT ANTIBODY - 
MAB231; CHAIN: A, B, C, D 


IGGl INTACT ANTIBODY 
MAB61.1.3; CHAIN: A, B, C, D 


NIG9 (1GG1=LAMBDA=); 
CHAIN: L, H; 


SINGLE-CHAIN ANTIBODY 
FRAGMENT; CHAIN: A, C; 


ALPHA, BETA T-CELL 
RECEPTOR CHAIN: A, B; 


HEAT SHOCK PROTEIN HSLU; 
CHAIN: A; 


PROMYELOCYTIC LEUKEMIA 
ZINC FINGER PROTEIN PLZF; 
CHAIN: A; 


SEQFOL 
D score 














277.99 






PMF 
score 




1.00 


0.98 


0.82 
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1.00 




0.04 
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Verify 
score 




0.22 


-0.00 


0.07 


0.21 


ZZ'O 




-0.34 


0.34 


Psf Blast 




rn 
oo 

<J> 
vq 


o 

CN 
on 


CN 
oo 
i 

<£) 

CN 

m" 


m 

00 

i 

<u 
vq 


t> 
o 
• 

ON 


8e-71 


0.0094 


4.8e-24 


u 




CN 


CN 


CN 

oo 

CN 


CN 


O 

m 
p-. 


CN 


ON 

VO 

m 


On 
CN 


STAR 
TAA 




o 


a 


CN 


O 

N 


N 


-H 


tj- 
m 


O 


CHAI 
NID 




£ t 




3Q 


13 


< t 






< 


Is 






— « 


>^ 
bp 


n, 


D 


o ! 

M 
M 


1 


lbuo 


lag 


c 
c 


« c 
« c 

r 


» < 

>0 < 

-4 f 


» c 
» < 

-4 * 


» c 
10 c 


o 

0 

— li 


s 

0 

H 


1885 


1886 



962 



WO 02/070539 W W PCT/US02/0509S 



PDB annotation 


REPRESSOR, ZINC-FINGER 
PROTEIN, X-RAY 
CRYSTALLOGRAPHY, 3 PROTEIN 
STRUCTURE, PROMYELOCYTIC 
LEUKEMIA, GENE REGULATION 


GENE REGULATION POZ DOMAIN; 
PROTEIN-PROTEIN INTERACTION 
DOMAIN, TRANSCRIPTIONAL 2 
REPRESSOR, ZINC-FINGER 
PROTEIN, X-RAY 
CRYSTALLOGRAPHY, 3 PROTEIN 
STRUCTURE, PROMYELOCYTIC 
LEUKEMIA, GENE REGULATION 








RNA-BINDING PROTEIN/RNA TRA 
PRE-MRNA; SPLICING 
REGULATION, RNP DOMAIN, RNA 
COMPLEX 


GENE REGULATION/RNA POL Y(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 
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RNA BINDING PROTEIN RNA- 

BINDTNO DOM ATM 


STRUCTURAL PROTEIN PROTEIN 
C23; RNP, RBD, RRM, RNA BINDING 
DOM A FN MTTr^ T?nr ttc 


OUTER MEMBRANE PROTEIN 
OSMOPORIN; OUTER MEMBRANE 
PROTEIN, NON-SPECIFIC PORIN, 
OSMOPORIN, 2 BETA-BARREL, 
TR ANSMFMTU? A MP 




HYDROLASE DOUBLE BETA 
BARREL, BACTERIAL SERINE 
PR OTP. ASF. 


) 
) 


RNA-BINDING DOMAIN RNA- 
BINDING DOMAIN, ALTERNATIVE 


RNA-BINDING PROTEIN SPLICING, 
U2 SNRNP, RBD, RNA-BINDING 
PROTFrw 


RNA BINDING DOMAIN RNA 
BINDING DOMAIN, RBD, RNA 
RECOGNITION MOTIF. RRM. 2 


Coumpound 
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OMPK36; CHAIN: A, B, C; 


OUTER MEMBRANE PROTEIN 
PHOSPHOPORIN (PHOE) IPHO 3 


ALPHA-LYTIC PROTEASE; 
CHAIN: A; 


RNA-BINDING PROTEIN SEX- 
LETHAL PROTEIN (C- 
TERMINUS, OR SECOND RNA- 
BINDING DOMAIN ISXL 3 
(RBD-2), RESIDUES 199 - 294 
PLUS N-TERMINAL MET) ISXL 
4 (NMR, 17 STRUCTURES) ISXL 
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SEX-LETHAL PROTEIN; CHAIN: 
NULL; 
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SUBUNIT; CHAIN: A; \ 


SEX-LETHAL; CHAIN: A, B, C; 


SEQFOL 
D score 






















PMF 
score 




LOO 


0.65 


-0.20 


ON 

i 


ON 

d 
i 


0.54 


00*1 


0.99 


0.55 


Verify 
score 




0.45 


0.42 


0.70 


0.77 


| 0.89 


0.07 


0.47 


0.44 


0.01 


Psi Blast 




»o 
o 

<u 


in 
o 

<b 
m 


ON 

o 

1 


o 
<£> 
t< 


00 

o 

<L> 

in 


0.00015 


0.00015 


IT) 
O 
1 

ro 


6e-05 






o 


*** 


o 


do 
M 


3 

N 




? 


o 

5 


rn 
* 


STAR 
T AA 




*s 


"0 




t 

- 1 c 


^ i 

■n 


n < 
2 ' 




N ( 
* t 


<o 

■n 


CHAI 
NID 




< « 


< 


< 




< 




« 


< 


< 


Be 


c 
n 


N 


? 


l 

o 

-4 r 


O t 
C < 

a < 




" c 


^ t 


i 

* r 


2 

o 


leg 


r 

c 
c 


^ r 

O o 

o 0 


t 

o e 

0 0 

1 m 


8 5 

0 o 

H 


* f 

0 0 
0 0 


* r 

0 0 
0 0 


8 s 


0 0 
3 6 


r 

0 0 
0 0 


0 
0 



964 



WO 02/070539 W W PCT/US02/05095 



PDB annotation 


SPLICING INHIBITOR, 
TRANSLATIONAL INHIBITOR, SEX 3 
DETERMINATION, X CHROMOSOME 
DOSAGE COMPENSATION 




RNA-BINDING PROTEIN/KWA IKA 
PRE-MRNA; SPLICING 
REGULATION, RNP DOMAIN, RNA 
COMPLEX 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


GENE REGULATION/RNA rULY(/\) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


RNA BINDING PROTEIN KNA- 
BINDING DOMAIN 


STRUCTURAL PROTEIN PROTEIN 
C23; RNP, RBD, RRM, RNA BINDING 
DOMAIN, NUCLEOLUS 


OUTER MEMBRANE PROTEIN 
OSMOPORIN; OUTER MEMBRANE 
PROTEIN, NON-SPECIFIC PORIN, 
OSMOPORIN, 2 BETA-BARREL, 
TRANSMEMBRANE ' 




| HYDROLASE DOUBLE BETA | 


Coumpound 
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| ALPHA-LYTIC PROTEASE; 


SEQFOL 
D score 






















PMF 

score 






1.00 


1. 00 


0.89 


LOO 


0.65 


-0.20 


OS 

o 

• 


o 


Verify 
score 






0.53 


0.41 


0.41 


0.45 


o 


0.70 


0.77 


ON 

oo 
o 


Psi Blast 






in 
o 

k 


3e-05 


2 

co 


in 
o 

o 
m 


in 
o 
6 

ro 


OS 

o 

1 

m 


o 
■ 

«n 


OO 

o 

1 

m 


s < 

■«< 






1371 


1371 


1371 


1371 


1378 


o 


oo 
CN 


O 
CN 


STAR 
T AA 






1303 


1303 


1301 


1303 


1292 


*—* 




r- 

CO 


<o 

u ^ 






< 


< 


PQ 


< 


< 


< 




< 


la 






«— < t 


J 


Icvj 


& 




losm 


Ipho 


cr 
cr 




! 




1888 


1888 


1888 


1888 


1888 


1888 


00 

oo 
oo 


1888 



965 



WO 02/070539 W W PCT/US02/05095 



PDIt annotation 


BARREL, BACTERIAL SERINE 
PROTEASE 




RNA-BINDING DOMAIN RNA- 
BINDING DOMAIN, ALTERNATIVE 
SPLICING 


RNA-BINDING PROTEIN SPLICING, i 
U2 SNRNP, RBD, RNA-BINDING 
PROTEIN 


RNA BINDING DOMAIN RNA 
BINDING DOMAIN, RBD, RNA 
RECOGNITION MOTIF, RRM, 2 
SPLICING INHIBITOR, 
TRANSLATIONAL INHIBITOR, SEX 3 
DETERMINATION, X CHROMOSOME 
DOSAGE COMPENSATIOM 




RNA-BINDING PROTEIN/RNA TRA 
PRE-MRNA; SPLICING 
REGULATION, RNP DOMAIN, RNA 
COMPLEX 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


Coumpound 


CHAIN: A; 


RNA-BINDING PROTEIN SEX- 
LETHAL PROTEIN (C- 
TERMINUS, OR SECOND RNA- 
BINDING DOMAIN ISXL 3 
(RBD-2), RESIDUES 199 - 294 
PLUS N-TERMINAL MET) ISXL 
4 (NMR, 17 STRUCTURES) 1 SXL 
5 


SEX-LETHAL PROTEIN; CHAIN: 
NULL; ( 


SPLICING FACTOR U2AF 65 KD 
SUBUNIT; CHAIN: A; 


SEX-LETHAL; CHAIN: A, B, C; 
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PDB annotation 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP I; RRM, 
PROTEIN-RNA COMPLEX, GENE 
REGULATION/RNA 


RNA BINDING PROTEIN RNA- 
BINDING DOMAIN 


STRUCTURAL PROTEIN PROTEIN 
C23; RNP, RBD, RRM, RNA BINDING 
DOMAIN, NUCLEOLUS 


OUTER MEMBRANE PROTEIN 
OSMOPORIN; OUTER MEMBRANE 
PROTEIN, NON-SPECIFIC PORIN, 
OSMOPORIN, 2 BETA-BARREL, 
TRANSMEMBRANE 




HYDROLASE DOUBLE BETA 
BARREL, BACTERIAL SERINE 
PROTEASE 




RNA-BINDING DOMAIN RNA- 
BINDING DOMAIN, ALTERNATIVE 
SPLICING 


RNA-BINDING PROTEIN SPLICING, 
U2 SNRNP, RBD, RNA-BINDING 


Coumpound 


POLYDENYLATE BINDING 
PROTEIN 1; CHAIN: A, B, C, D, 
E, F, G, H; RNA (5- 
R(*AP*AP*AP*AP*AP*AP*AP* 
AP*AP*AP*A)-3'); CHAIN: M,N, 
0,P,Q,R,S,T; 


HU ANTIGEN C; CHAIN: A; 


NUCLEOLIN RBD2; CHAIN: A; 


OMPK36; CHAIN: A, B, C; 


OUTER MEMBRANE PROTEIN 
PHOSPHOPORIN (PHOE) IPHO 3 


ALPHA-LYTIC PROTEASE; 
CHAIN: A; 1 


RNA-BINDING PROTEIN SEX- 
LETHAL PROTEIN (C- 
TERMINUS, OR SECOND RNA- 
BINDING DOMAIN ISXL 3 
(RBD-2), RESIDUES 199 - 294 
PLUS N-TERMINAL MET) 1 SXL 
4 (NMR, 17 STRUCTURES) ISXL 
5 


SEX-LETHAL PROTEIN; CHAIN: 
NULL; 


SPLICING FACTOR U2AF 65 KD 
SUBUNIT; CHAIN: A; 
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PROTEIN 


RNA BINDING DOMAIN RNA 
BINDING DOMAIN, RBD, RNA 
RECOGNITION MOTIF, RRM, 2 
SPLICING INHIBITOR, 
TRANSLATIONAL INHIBITOR, SEX 3 
DETERMINATION, X CHROMOSOME 
DOSAGE COMPENSATION 




RECEPTOR RECEPTOR, SIGNAL 
j TRANSDUCER OF EL-6 TYPE 

CYTOKINES, THIRD 2 N-TERMINAL 
; DOMAIN, TRANSMEMBRANE, 

GLYCOPROTEIN 


CONNECTIN A71, CONNECTIN; 
TITIN, CONNECTIN, FIBRONECTIN 
TYPE III 


CONNECTIN A71, CONNECTIN; 
TITIN, CONNEC TIN, FIBRONECTIN 
TYPE III 




HEPARIN AND INTEGRIN BINDING 
HEPARIN AND INTEGRIN BINDING 


STRUCTURAL PROTEIN INTEGRIN, 
HEMIDESMOSOME, FIBRONECTIN, 
CARCINOMA, STRUCTURAL 2 


Coumpound 




SEX-LETHAL; CHAIN: A, B, C; 




GP130; CHAIN: NULL; 
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TITIN; CHAIN: NULL; 
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MOLECULE DROSOPHILA 
NEUROGLIAN 
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CONTAINING THE ICFB 3 TWO 
AMINO PROXIMAL 
FIBRONECTIN TYPE III 
REPEATS ICFB 4 (RESIDUES 
610-814)) ICFB 5 


FIBRONECTIN; CHAIN: A; 


INTEGRIN BETA-4 SUBUNIT; 
CHAIN: A, B; 
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PDB annotation 


PROTEIN 


STRUCTURAL PROTEIN TENASCIN, 
FIBRONECTIN TYPE-UI, HEPARIN, 
EXTRACELLULAR 2 MATRIX, 
ADHESION, FUSION PROTEIN, 
STRUCTURAL PROTEIN 


STRUCTURAL PROTEIN TENASCIN, 
FIBRONECTIN TYPE-III, HEPARIN, 
EXTRACELLULAR 2 MATRIX, 
ADHESION, FUSION PROTEIN, 
STRUCTURAL PROTEIN 


STRUCTURAL PROTEIN TENASCIN, 
FIBRONECTIN TYPE-III, HEPARIN, 
EXTRACELLULAR 2 MATRIX, 
ADHESION, FUSION PROTEIN, 
STRUCTURAL PROTEIN 






RNA BINDING PROTEIN/RNA 
XLRBPA; PROTEIN-RNA COMPLEX 
DOUBLE STRANDED RNA, 
PROTEIN-RNA 2 INTERACTIONS, 
RNA-BINING PROTEIN, RNA 
BINDING PROTEIN/RNA 


SH3 PROTOTYPE WWPROTOTYPE, 
PROTEIN DESIGN 


CELL CYCLE/RNA DSRBDm; NMR ! 
STRUCTURE, PROTEIN/RNA, 
PROTEIN DSRBD, DROSOPHILA, 
RNA 2 HAIRPIN 


ISOMERASE PINl; PEPTIDYL- 


Coumpound 




TENASCIN; CHAIN: A, B; 


TENASCIN: CHAIN: A, B; 


TENASCIN; CHAIN: A, B; 


GLYCOPROTEIN FIBRONECTIN 
(TENTH TYPE III MODULE) 
(NMR, 36 STRUCTURES) ITTF 3 




DOUBLE STRANDED RNA 
BINDING PROTEIN A; CHAIN: 
A, B; RNA (5'- 

R(*GP*GP*CP*GP*CP*GP*CP*G 
P*CP*C)-3'); CHAIN: C, D, E, G; 


WWPROTOTYPE; CHAIN: A; 


MATERNAL EFFECT PROTEIN 
(STAUFEN); CHAIN: A; 
STAUFEN DOUBLE-STRANDED 
RNA BINDING DOMAIN; 
CHAIN: B; 


PEPTIDYL-PROLYL CIS-TRANS 
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D score 
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\ DOMAIN STAUFEN ISTU 13 




Coumpound 


ISOMERASE NIMA- CHAIN: B; 
Y(SEP)PT(SEP)S PEPTIDE; 
CHAIN: C; 


MATERNAL EFFECT PROTEIN 
STAUFEN; ISTU 4 
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D score 
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TABLE 6 



SEQ ID NO: 


Position of Signal in 
Amino Acid SEquence 


maxS (Maximum score) 


meanS (Mean score) 
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24 
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0.738 


950 


24 


0,976 


0.913 


951 


22 


0.982 


0.872 


952 


41 


0.937 


0.604 


953 


23 


0.896 


0.747 


954 


16 
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0.737 
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45 


0.964 


0.657 
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0.855 
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31 
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23 
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36 
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0.799 
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11 
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16 
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ZJ 


A 07ft 

u.y/o 


ft 807 


1018 


a r 
45 


A CltLA 


A AS7 i 


1019 


28 


A QC£ 


A ftA4 
V.0U4 


1020 


33 


A A/I O 

0.948 


A 77 A 

U.7/0 


1021 


18 


A AO A 

0.930 


A ft70 

U.O/9 


1022 


26 


0.947 


A CA/l 


1023 


31 


0.991 


a no« 
U.9zj 


1024 


41 


0.942 


A *7A1 


1025 


36 


0.910 


A 7/tQ 

U./49 


1026 


24 


A AOO 


A A1A 


1027 


27 


a n^o 

0.962 


A flQtL 

U.oyo 


1028 


23 


A A£C 

0.965 


A £Ao 


1029 


22 


0.962 


AGIO 

U.919 


1030 


O A 

24 


A A/11 

0.943 


A CIO ! 


1031 ~l 


34 


0.973 


A 0 1*7 

0.857 


1032 


22 


A Q/l "7 

0.947 


A A*77 
11.0/ / 


1033 


25 


A OOA 

U.059 


A *71 0 


1034 


27 


a cscn 
U.90Z 


A O^ft 

U.ojO 


1035 


i a 
19 


A Q<0 


a ono 


1036 


39 


A QQd 


A 000 

u.yzz 


1037 


28 


A QOO 
U.95Z 


A Q1A I 

u.y/4 


1 A"J O 


44 


A 07A 


A ftftO 
U.OOZ 


1 A*5 A 


Z9 


A QQA 
U.9o4 


ft 7A1 


1 A/I A 

1U4U 


oo 

ZZ 


A 07 A 

u.y/4 


ft 70ft 

u. /yo 


1 A/1 1 

1041 


Z9 


A 00 


ft 77S 




Ol 

ZJ 




ft RIO 


11D4 


10 


ft Qs1 

U.7J 1 




1 A*N^ 


1 A 
10 


ft 007 


0.827 


IfKft 
IUjO 


ZJ 


ft 040 


ft R71 


1 A^Q 


Oft 
Zo 


ft QSft 


ft R4R 


1 Aftl 


04. 
Z*+ 


ft OftS 


ft 8Q1 

U.07 1 


IftftO 


OS 
Zj 


ft 04ft 


ft Rfift 


1 Aft7 


1ft 
JO 


ft 0ft4 


0 ft4R 


1074 


32 


0.941 


0.669 


1076 


17 


0.995 


0.974 


1083 


18 


0.968 


0.799 


1089 


24 


0.882 


0.599 


1091 


38 


0.991 


0.904 


1094 


29 


0.963 


0.888 


1096 


19 


0.892 


0.715 


1101 


34 


0.921 


0.652 


1102 


20 


0.951 


0.839 


1106 


31 


0.921 


0.659 


1110 


36 


0.992 


0.917 
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SEQ III NO: 


Position of Signal in 

A 9 A * ■ ftYj 

Amino Acid SEquence 


maxS (Maximum score) 


meanS (Mean score) 


1 1Z4 


on 
30 


0.989 


0.910 


1 19^ 
1 1ZD 


Zo 


0.974 


0.851 


1 I07 
1 1Z / 


25 


0.937 


0.812 


i 13U 


3U 


0.978 


0.786 


1111 
1 1 3 I 


OO* 

27 


0.987 


0.879 


i m 
1 135 


OO 

23 


0.923 


0.655 


1 144 


28 


0.980 


0.893 


1 14/ 


27 


0.963 


0.833 


1 1 CA 

115U 


24 


0.976 


0.913 


1 K.1 

1151 


24 


0.988 


0.967 


1 1 c> 
1 IjZ 


22 


0.968 


0.831 


1 1 CA 

1154 


21 


0.952 


0.822 


1 155 


42 


0.939 


0.682 


1 1 C7 

1157 


22 


0.984 


0.928 


1 1 CO 

1 158 


22 


0.984 


0.928 


1165 


21 


0.942 


0.713 


1167 


18 


0.922 


0.838 


1170 


18 


0.988 


0.944 


1 174 


18 


0.975 


0.958 


1 175 


18 


0.975 


0.958 


1176 


18 


0.975 


0.958 


1 1 '7'"* 

11 77 


48 


0.989 


0.889 


1 1 TO 
11/0 


23 


0.996 


0.936 


1 1 OA 

lloU 


16 


0.967 


0.933 


1 1 no 

1193 


15 


0.948 


0.907 


1 195 


27 


0.936 


0.689 


1 t cue 
1190 


42 


0.978 


0.750 


1 \y/ 


15 


0.977 


0.966 


1199 


26 


0.976 


0.875 


i inn 
1ZUU 


28 


0.973 


0.822 


i 

1ZU1 


28 


0.990 


0.925 


i one. 

1ZU3 


Art 

22 


0.982 


0.933 


IZUo 


15 


0.986 


0.919 


1 orv7 


27 


0.994 


0.900 


101C 

JZ1 :> 


21 


0.989 


0.871 


1 OOA 

1ZZU 


28 


0.976 


0.653 


199£ 
1ZZO 


OA 

20 


0.987 


0.916 


1 99Q 

izzy 


30 


0.996 


0.894 


1 ajU 


A 1 

41 


0.983 


0.791 


1914. 


on 
ZU 


0.978 


0.893 _j 


191Q 


1 n 
1 / 


0.953 


0.784 


1 0/1 A 
1Z4U 


25 


0.950 


0.897 


19A1 
1 Z*rl 


on 
ZU 


0.974 


0.912 


1949 


ID 


0.974 


0.817 




3D 


0.973 


0.795 




ZZ 


0.982 


0.872 


1251 


18 


0.983 


ft Q97 


1254 


20 


0.934 


0.828 


1255 


16 


0.952 


0.807 


1256 


19 


0.904 


0.656 


1260 


35 


0.957 


0.640 


1261 


35 r 


0.957 


0.640 


1263 


33 


0.953 


0.707 


1264 


24 


0.981 


0.884 1 


1265 


24 


0.987 


0.914 


1266 21 


0.977 


0.905 
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SEQ ID NO: 


Position of Signal in 
Amino Acid SEquence 


maxS (Maximum score) 


meanS fMean score} 


1267 


24 


0.978 


0.911 


1268 


18 


0.984 


0.958 


1269 


18 


0.984 


0.958 


1270 


24 


0.989 


0.922 


1271 


18 


0.984 


0.956 


1272 


18 


0.986 


C 965 


1273 


18 


0.986 


0.965 


1276 


32 


0.956 


0.706 


1277 


48 


0.983 


0.616 


1278 


20 


0.965 


0.878 


1282 


16 


0.921 


0.828 


1283 


20 


0.937 


0.700 


1293 


19 


0.995 


0.971 


1294 


19 


0.995 


0.971 


1296 


20 


0.926 


0.751 


1302 


29 


0.981 


0.937 


1310 


27 


0.977 


0.849 


1314 


24 


0.977 


0.845 


1315 


23 


0.990 


0.833 


1317 


25 


0.971 


0.894 


1318 


16 


0.961 


0.916 


1319 


41 


0.980 


0 681 


1328 


17 


0.977 


0.921 


1329 


26 


0.993 




1331 


25 


0.986 


0.939 


1333 


33 


0.977 


0 R1 1 

u.Ol i 


1340 


43 


0.992 




1341 


20 


0.943 


0.882 


1343 


20 


0.995 


0.933 


1344 


26 


0.938 


0.663 


1346 


21 


0.955 


0.767 


1347 


19 


0.920 


0.692 


1348 


41 


0.937 


0.604 


1349 


41 


0.937 


0.604 


1353 


19 


0.986 


0.961 


1357 


41 


0.923 


0.559 


1359 


25 


0.973 


0.853 


1361 


20 


0.935 


0.817 


1364 


29 


0.958 


0.637 


1365 


23 


0.991 


0.740 


1368 


29 


0.986 


0.848 


1369 


18 


0.997 


0.979 


1373 


16 


0.979 


0.964 


1375 


25 


0.980 


0.952 


1376 


37 


0.989 


0.822 


1378 


37 


0.984 


0.878 


1379 


42 


0.978 


0.698 


1380 


18 


0.969 


0.913 


1383 


31 ! 


0.981 


0.856 | 


1386 


36 


0.987 


0.595 


1387 


27 


0.955 


0.786 


1389 


17 


0.915 


0.825 


1390 


34 


0.954 


0.783 


1393 


19 


0.941 


0.839 


1396 


27 


0.944 


0.778 


1397 


20 


0.941 | 0.734 1 
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SEQ ID NO: 


Position of Signal in 
Amino Acid SEquence 


raaxS (Maximum score) 


meanS (Mean score) 


1404 


22 


0.982 


0.872 


1427 


19 


0.993 


0.931 


1428 


22 


0.992 


0.807 


1430 


22 


0.918 


0.716 


1433 


38 


0.994 


0.887 


1436 


24 


0.914 


0.588 


1438 


28 


0.990 


0.919 


1442 


26 


0.990 


0.969 


1446 


36 


0.954 


0.817 


1450 


23 


0.896 


0.747 


1451 


23 


0.969 


0.855 


1456 


24 


0.985 


0.932 


1462 


23 


0.985 


0.951 


1464 


27 


0.985 


0.927 


1465 


21 


0.960 


0.649 


1467 


41 


0.990 


0.922 


1468 


20 


0.991 


0.954 


1469 


20 


0.991 


0.954 


1473 


21 


0.975 


0.909 


1478 


11 


0.900 


0.758 


1493 


18 


0.933 


0.634 


1495 


22 


0.899 


0.639 


1496 


40 


0.953 


0.668 


1509 


16 


0.881 


0.607 


1516 


36 


0.924 


0.590 


1517 


25 


0.919 


0.718 


1518 


28 


0.928 


0.590 


1519 


27 


0.967 


0.872 


1526 


25 


0.998 


0.934 


1527 


30 


0.973 


0.829 


1547 


20 


0.937 


0.728 


1551 


30 


0.962 


0.801 


1552 


17 


0.925 


0.779 


1557 


20 


0.981 


0.910 


1562 


17 


0.977 


0.921 


1565 


25 


0.938 


0.677 


1568 


22 


0.975 


0.822 


1577 


34 


0.934 


0.552 


1578 


28 


0.946 


0.799 


1583 


15 


0.954 


0.725 


1584 


30 


0.938 


0.808 


1592 


24 


0.973 


0*910 


1597 


24 


0.920 


0,596 


1600 


29 


0.968 


0.769 


1604 


28 


0.975 


0.926 


1613 


25 


0.977 


0.776 


tiCIO 

lolo 


A 1 


U.iroO 




1627 


24 


0.915 


0.578 


1630 


24 


0.950 


0.737 


1631 


28 


0.987 


0.785 


1632 


19 


0.890 


0.552 


1633 


22 


0.968 


0.934 


1634 


22 


0.968 


0.934 


1635 


23 


0.965 


0.883 


1636 


23 


0.965 


0.883 


1638 


26 


0.896 


0.615 
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ob\2 IV INU: 


rosition oi Signal in 
Amino Acia oJC/Quencc 


maxo (Maximum scores 


mtauo {mean score^ 


1641 
1041 






UiUUJ 


1044 


7Q 


0 971 


U.U7t 




AA 
44 


0 007 


n S76 

U.J /o 


io4y 


*o 


0 Q64 
v/.yo*f 


n 6^7 

u.oj/ 


IOjU 




fl 06/1 


n 6^7 
U.Oj/ 


10 J J 


i7 


0.96 S 


\j.y**/ 


IOjj 


7B 
AO 


0 060 
v.you 


U.Ol// 


iOJ/ 


71 
Jl 


A 077 


A 77A 




Zo 


A 0<A 
U.yjO 


A 6 A 4 


1000 


4/ 


A 09.5 
u.yoj 


A 646 
v.040 


1007 


1 o 
ly 


A OOA 

u.yyU 


A 0/16 


1677 


ah 
4/ 


A QQ< 


A 


1 CO A 

loo4 


1 Q 

io 


A 07A 


A 670 

u.o/y 


1687 


25 


A OOO 

U.yyz 


A Q/iB 


1059 


OiT 

zo 


a a An 
U.y4/ 


A 'CO/l 

U.jy4 


1693 


22 


0.963 


A 0<O 


1 /TAC 

1695 


o£ 

26 


A OC£ 

U.yjO 


A C7A 


1703 


o^ 
26 


A AAA 

0.990 


A ft CO 


1706 


/I A 

40 


A AOT 

0.987 


A OH 

u.yi / 


1707 


26 


A no c 

0.985 


a on 
U.yl / 


1710 


36 


a nni 

O.yyl 


A 0CO 
0.600 


1713 


24 


A OOT 

0.887 


A sci J 


1714 


23 


A AAC 

0.995 


A 07/1 

0.974 


1715 


18 


0.997 


A AT7 

0.977 


1716 


22 


0.968 


A CilA 

0.934 


1718 


26 


A (\H A 

0.974 


0.730 


1721 


18 


A flOl 

0.983 


A AQA 

0.939 


1731 


24 


a noo 
0.988 


A OIO 


1 "IA 1 

1743 


1 A 

30 


a mn 
O.yjy 


A £1Ci 

U.ojy 


1744 


ZD 


a no a 


A nAC 

0./40 


1 /OS 


OO 
2j 


A O/CC 

U.yOJ 


A £07 


1 /jo 


22 


A OlCO 

U.yoZ 


A Q1Q 




0 1 

21 


A QQQ 

U.yoo 


A 01 1 


1 /oU 


30 


A OCA 

U.yoU 


A <^0 

V.jjy 


1 /by 


1 A 

10 


A QCA 

U.ooU 


A 7QA i 
U./oU 


1 771 

1 //I 


1 A 

14 


A OOO 

U.yZZ 


A £78 


i 777 
1 / / J 


5y 


A GfiO 

U.yoZ 


A 570 

u.oZy 


177R 


1A 
34 


A 077 


A 817 
U.Ol / 


1770 
l / /y 


76 


A G76 

u.y/o 


0 70d 


17R6 


1 

10 


n oi 5? 


0 6^1 


17R7 
i /o / 


JJ 


n ooi 

u.yy i 


n Rid 

U.OJt 


17R0 
i toy 


ZZ 


fi 047 


0 677 

V/.V7/ / 


1705 
i /yj 


OA 

ZH- 


0 061 
u.yoj 


n R6^ 


i /yu 


Jv 


0 067 
u.yo / 


0 7^8 

U. / JO 


1707 

1/7/ 


77 


0 076 
u.yzo 


V.Oxli 


1800 


25 


0.889 


0.718 


1805 


20 


0.995 


0.968 


1807 


27 


0.887 


0.642 


1810 


41 


0.975 


0.875 


1813 


26 


0.921 


0.620 


1826 


36 


0.951 


0.782 


1832 


19 


0.983 


0.888 


1833 


23 


0.971 


0.941 


1834 


40 


0.964 


0.560 


1839 


39 


0.942 


0.587 


1846 


16 


0.945 


0.737 
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SEQ ID NO: 


Position of Signal in 
Amino Acid SEquence 


maxS (Maximum score) 


meanS flVfean scnrp^ 


1847 


24 


0.946 


0.593 


1848 


44 


0.974 


0.662 


1850 


26 


0.974 


0.730 


1851 


27 


0.952 


0.832 


1856 


25 


0.960 


0.642 


1866 


31 


0.956 


0.846 


1869 


16 


0.968 


0.921 


1884 


19 


0.984 


0.936 



TABLE 7 



SEQ ID 


Chromsomal location 


1 


X 


2 


Xp21.2p-pl 1.2 


3 


2 


4 


2 


5 


8 


6 


8 


7 


17 


10 


17 


13 


9 


14 


Xq24-q25 


15 


Xq24-q25 


16 


9p21 


17 


16 


19 


7q32 


20 


2 


21 


2 


22 


11 


23 


12a 


2.1 


22qll 


28 


6p 


29 


6p23 


31 


17q 


32 


12 


33 


2p23.3-q34 


34 


9q31-q32 


35 


19 


36 


19 


37 


19ql3.3 


38 


6p21.3 


39 


10q26.2-10q26.3 


40 


3 


42 


12 


43 | 


6p21-pl2 


44 


13ql2-13 


46 


2 


47 


3 


48 


15ql5 


49 


22q 13.33 


50 


5ql4-q22 


51 


X 


52 


19cen-qI3.2 


53 


3p26 
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SEO ID 


i hmtncnmal Innafinn 


54 


2p24.3 


55 


3p26 


56 


3p26 


57 


15 


60 


15 


62 


ln91 9-99 1 


65 


j j f~\ i y«/i 111 


66 


xpll 


67 


90nl 1 91-nH 1? 1 


69 


0 


70 


y 

A. 


71 


1 
4 


72 


tn21 9-n91 3 


73 


17n1 1 9 1 
i /pi l.Z 


74 


y 

A 


75 


10 


76 


o 

7 


80 


10 


83 


oqio 


85 


I J 


86 




R7 




88 


Cn99 nT2 




1 c 
1 J 


91 
7 i 




94 




95 


/ 


96 


lfln9< 1 


97 




98 




99 


J 


101 


1 C 
1 J 


102 


10 

i7 


103 


6n91 7 


104 


1 ipi j.j-pi 


105 


io 


107 


14rrt9 1-n^9 ? 

ltqjZ.l-LjlJZ.Z 


111 




112 


9 


114 


2cj35 


115 


22a H 


116 


16 


117 


16 


118 


16q24.3 


120 


19 


122 


j 


123 


20 


124 


9 


125 


3 


126 


11 


127 


22qll.2 


128 


20ql 1.2-12 


129 


14 


131 


10q25.1 


133 


17pl1.2 


134 


20 


136 


4pl6.3 
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SEQ ID 


Chromsomal location 


137 


12pl3 


138 


19 


139 


Ip34.1-p32 


140 


4 or 17 1 


143 


Xql3.l-13.3 


144 


3 


145 


3 


146 


5 


147 


9 


148 


11ql3 


149 


6 


150 


15 


151 


19 


152 


Xpll.21-11.23 


153 


18q22-q23 


155 


16 


157 


4 


160 


Ip36.23-p33 


161 


9q22.2 


163 


4 


165 


3 


166 


17 


167 


6p21.3 


168 


16 


169 


9 


170 


19 


171 


15 


172 


2p25 


173 


22qlL21 


174 


]8q22 


179 


6p21.3 


180 


15ql4 


181 


5q 


183 


5 


184 


11 


187 


11 


188 


3p21.3 


189 


12pl3-qter 


190 


12pl3-qter 


191 


12pl3-qter 


192 


12pl3-qter 


193 


2q34-q35 


194 


2q34-q35 


195 


10 


199 


7qll-q22 


200 


7q22.l-7q31.33 


201 


3 


202 


iyql3.4 


203 


3 


205 


Xq28 


206 


6 


208 


Xq26.2-27.2 


209 


4 


210 


4 


211 


lq31 


212 


19ql2-19ql3.1 


213 


6q23 
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CPA f f| 


Chrnmsnmal location 


Zlf 


10 


Zl J 


1 


01fi 
410 


lo32 2-34 2 


017 
Zl / 


g 


01 ft 

ZlO j 


llql3 


ziy 


InOl-nOI 


zzu 




ZZl 


10 


2ZZ 


17 


001 
223 


1 
1 


224 


lypi j.j 


ZZ5 


IQnl 1 nl 1 


22o 


1 
1 


zz/ 


1 
I 


228 


1 
1 


229 


10 
1Z 


230 


D 


231 


Inll 


232 


1«Q1 1 rvOO 1 


233 


OOn1 1 01 

Zzqi I.Zj 


234 


ZzqiZ.i-qiz.J 


235 


1 
1 


236 


17 

1/ 


237 


1 K 

ID I 


238 


Zpii 


239 


17 
1 / 


24 1 


1 1 nl 1 i 
llplj 1 


242 


1 1 
1 1 


243 


/Iz-iOO nOA 

4qzz-qz4 


244 


10 
1Z 


245 


1Q 


246 


D 


247 


1/1 
14 


248 


1< 
10 


249 


Id. 


ZDU 


4 
*r 


ZD1 


10 


ZDZ 


0n90-n01 fir On 11 


ZDo 


nq 


ZD/ 


✓qj j*ljj < t. 1 


0^5 
ZD 5 


ixpici pi J.J 1 


ZOv 


g 


zol 


1 1n14. 
i iqi*r 


OAO 

zoz 


17 


o<i 
ZCO 


i£>qi j 


Z04 


iuqi j -q£i 


265 


16ql3-q21 


267 


6q26-27 


268 


9ql2-q21.2 


269 


9ql2-q21.2 


270 


9ql2-q21.2 


271 


19 


272 


lp34.1~35.3 


273 


11 


274 


11 


275 


X 


277 


2 



980 



WO 02/070539 W W PCT/US02/05095 



SEQ ID 


Chromsomal location 


278 


1$ 


280 


19ql3.1-ql3.2 


282 


12 


284 


2p23.3-q24.3 


285 


llplS 


286 


19 


287 


7 


288 


10 


290 


12 


293 


3q23-q24 


294 


lp34.1~35.3 


295 


p22.2-31.1 


296 


3q22-q24 


297 


6 


299 


2pll 


300 


16 


301 


10q24 


302 


10 


304 


12q22-q23 


305 


17pter-pl 1 j 


307 


Ip35-p34 


308 


9 


309 


16 


310 


21 


311 


12pl3 


312 


1 


313 


1 


314 


17 


315 


Iq42-q43 


316 


6p21. 32-22.1 1 


317 


6p2 1.32-22,1 


318 


6p2 1.32-22.1 


319 


6p2 1.32-22.1 


320 


6p21. 32-22.1 j 


321 


6p2 1.32-22.1 


322 


6p2 1.32-22.1 


323 


6p21. 32-22.1 




6p21. 32-22.1 


325 


6p2 1.32-22.1 


326 


17 


327 


3 


328 


14 


332 


17 


335 


17 


336 


14q32 


337 


8q23 


338 


8q23 


339 


1 A.rt 11 1 nil O 

loqi i.i-qi 1.2. 


340 


8q22-q23 j 


341 


16pl3.3 J 


342 


18 


343 


15 


344 


17 


345 


20qll.2-gl3.1 


346 


20qll.2-ql3.1 


347 


19ql3.3 


348 


19 - -, 



981 
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bEQID 


Chromsoma) location 


•3/10 


17n0<! 1 

l/q/j.l 




lo 




1 1 
11 


352 


1 

1 


353 


q25.1-31.1 


354 


llql4 


355 


Iq3I 


35o 


lq31 


357 


18 


358 


3p 


">Cft 

359 


10 


360 


7q22 


361 


7q22 


362 


lq21.3 


363 


3 


364 


18ql2 


365 


llpl5 


367 


14 


369 


lp36.21 


371 


6pl 2.3-21.1 


373 


4 J 111 A A Al A 

14ql3.1-14q21.3 


374 


15 


375 


4 


376 


7q32-q34 


377 


7q32-q34 ; 


378 


20ql3.1-ql3.2 


381 


13 


382 


lq2 1.2-22 


383 


16 


384 


12 


385 


20ql3.1 


386 


16 


387 


8q21.3-q22.1 


•500 

388 


11 


389 


15q22.1 


390 


17 


391 


17 


■JQ/t 


8q23 


1QK 

oyj 


I5q24-q2o 


jyo 


15q22 


Jy/ 


7q34-q36 


100 


2 


AAA 


oqil-zZ 




oqzl-zz 


/I AO 


14q24.3 


AA1 




404 


1 1 nl 1 

l iqi j 


405 


17 


407 


14 


408 


10 


409 


10q23-q24 


410 


19 


411 


1J 


412 


llpl5 


413 


12ql3.2-ql3.3 


414 


3pl3-q26.1 


416 


17 
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WO 02/070539 




PCT7US02/05095 



r SEQID 


Chromsomal location 


421 


5q35 


422 


3 


423 


12 


424 


3p24.3 


425 


19 


426 


6p2M-21.2 


427 


6 


428 


22ql3.2-ql3.31 


429 


11 


430 


2 


432 


14q31 


433 


17 


434 


22 


436 


7q35 


440 


8pll.2 


441 


llq!2 


443 


5 1 


444 


1 


445 


13ql3 


446 


17 


447 


17 


448 


lp35.2-36.13 


449 


6 


450 


15q24-q25 


451 


19 


452 


4 


453 


2 


454 


il 


455 


17 


456 


10 


457 


p3 1.3-32.2 


458 


1 


459 


7pl3-pll.2 


460 


12 


461 


18pll.2 


462 j 


17 


463 


22qlL2 


464 


16 


466 


17 


467 


llq23 


468 


3 


469 


7q32 


470 


1 


472 


19 


473 


19 


474 


p33-34.3 


475 


7q36 




2 


477 


9p24.1-24.3 


478 


6p22. 1-22.3 


479 


20 


480 


22ql2.3-13.1 


481 


16q22.1-q22.3 


483 


19 


484 


19 


489 


6 


491 


Iq21-q22 ] 
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SEQID 


Chromsomal location 


492 


4q26-q27 


493 


2pl3 


494 


7 i 


495 


3 


496 


5 


497 


17q23.2-q25.3 


498 


llq 


499 


4pl531 


500 


10 


501 


8q24.3 


503 


18 


504 


2 


505 


19 


506 


3pl4.3 


507 


14 


508 


11 pi 5.5 


509 


11 


510 


15q25 


511 


14q21.1-q22.3 


512 


13ql 1 


513 


10 


514 


Xq28 


515 


15ql5 


516 


19pl3.3 


518 


14q21 


000003 85Rd232 




519 


13 


520 


12 


521 


12 


522 


20qI2-ql3.12 


523 


6q22.1-22.33 


524 


12 


526 


16 


527 


17 


528 


3 


529 


6p2l.3 


530 


lq42.13-43 


531 


16p 


532 


17 


533 


16q24.1 


534 


19pl3.3-pl3.2 


535 


19pl3.3-pl3.2 


536 


13 


537 


17 


538 


12 


539 


Iq21.2-q22 


540 


17 


541 


18ql2 


542 


16 


545 


2p23.3-q34 


546 


14q21.1-q22.3 


547 


2pl2 


548 


17qll-qter 


549 


4 


550 


17 


551 


9 


552 


17q24-q25 



984 



WO 02/070539 



PCTAJS02/05095 



SEQ ID 


Chromsomal location 


ceo 

553 


12 ' 


554 


5 


555 


17 


556 


10q233 


557 


9 


ceo 

55o 


9 


559 


X 


560 


20 


562 


16 


563 


15 


564 


3 


565 


4 


566 


6p22.1-23 


567 


X 


568 


5 


570 


19ql3.4 


572 


Ip36.23-p36.21 


573 


6 


574 


6pl 1.2-12.3 


575 


8 


577 


12 


578 


17 


579 


10 


580 


6p21 


581 


2 


582 


3q25.1-q25.2 


583 


3q25.1-q25.2 


584 


19 


585 


19pl3.3 


586 


15 


587 


15 


588 


4q25 


589 


9p24 


590 


17 


591 


20ql3. 11-13.2 


cm 
592 


X 


593 


2p23.3-q24.3 


594 


6p22.1 


595 


1 


59o 


1 


59/ 


2q31-q37 


596 


Ip3o.I 


599 


2pll.2-qll.2 


ouu 


1 f\«* 1 1 1 

19pl3.1 


<c\i 
0U1 


2 


OUZ 


17pl3.3 




15q 




1 1 


606 


lp36 


607 


22q 11.21 


608 


3 


609 


16 


610 


1 


611 


6 


613 


12 


615 


2p24.3 


616 


20qll.2 



985 



WO 02/070539* 




PCT/US02/05095 



SEOID 


Chromsomal location 


617 


16 


618 


17 


619 


8q22 


620 


17q23.1 


621 


14 


622 


8 


624 


17 


625 


5 


626 


17 


628 


19 


629 


I7qll 


630 


3p21.1-ql3.13 


631 


20 


632 


20 


634 


6 


636 


6 


637 


2 


638 


20ql3 


639 


8q22-q23 


640 


g 


642 


5q21-q22 


643 


xq26 


644 


22 


645 


llq25 


646 


2 


647 


2p23.3-q34 


648 


5 


651 


x 


652 


3 


653 


3pl3-q26.1 


654 


12q 


655 


12q 


656 


17 


657 


17 


658 


12 


659 


12 


660 


9q31 


661 


11 


663 


6 


664 


2p23.3-q24.3 


666 


10 


667 


21q 


668 


11 


669 


11 


670 


21q22.3 


671 


4pl6.1 


672 


11 


673 


12 


675 


19 


676 


19 


677 


6p24-25 


678 


5 


679 


X 


680 


3p21.1-ql2.3 


681 


5 


683 


12pl3.1-pl2.3 


684 


17q 
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SEQ ID 


Chromsomal location 


685 


19 


686 


19 


687 


19 


688 


19 


689 


7q31.1-q31.3 


691 


17 


692 


17 


694 


X 


695 


17 


697 


7pl4.3-pl4.1 


698 


5 


699 


18ql2-q21 


700 


14q243 


701 


17 


702 


17 


703 


1 


704 


20pl3 


705 


6pter-p21.1 


706 


17 


708 


8 


711 


1 


712 


2p24.3-p24.1 


713 


16 


714 


18q21 


715 


17 


716 


lq32 


717 


lql2-21.1 


718 


18 


720 


17 


721 


21 


722 


11 


723 


15 


724 


5 


726 


7 


727 


17q21.3 


728 


11 


729 


5 


730 


12 


731 


4 / ! 


732 


12 


733 


15ql5 


734 


8q24.3 


735 ! 


Xq26.3-27.3 


736 


19ql3.2 


737 


21q22.1 


738 


5 


740 


13 


7/19 


1 ml A. ^ C 

ipJo.2-pj J 


743 


2 


744 


2 


746 


1 


747 


2q32-q33 


748 


2p23 


749 


13 


750 


4 


751 


17pll.2 


752 


lpl3 
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SEQ ID 


Chromsomal location 


753 


5 


754 


Ilql3.1-ql3.3 


755 


19 


756 


5pter-pl3.3 


757 


10 


758 


19ql3 


760 


19 


761 


14 


762 


19 


763 


X 


765 




766 


9q34.3 


767 


17 


769 


< 

j 


770 


id 
it 


771 




772 




773 


17 


774 


Q 
O 


775 


a 


776 


15 


777 


0 
o 


778 

f/O 


O 


779 


17 


780 


1 S 


781 


14 
i*t 


787 


1 1 
1 1 


783 




784 


Rnl 1 7 


785 


c 
o 


786 


c 

o 


787 


17 


788 


3 


789 


ln42 


790 


7all 21-all 23 


791 


11 


792 


3d13-o13 2 


793 


9 


794 


llql2 


795 


lp32.2-34.2 


797 


18 j 


798 


18 


799 


llql3 


800 


17 


801 


7pl5-p21 


802 


17 


803 


17pl3.1 


805 


17q25.3 


806 


17q25 


808 


5 


809 


7pl5-pl4 


810 


9q34.2-q34.3 


811 


1 


812 


6 


813 


8 


814 


17 


815 


20 



988 
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SEQ ID 


Chromsomal location 


816 


7q34-q36 


817 


j4q21.1-q/1 .3 


Ol o 
015 


lp32.1-33 


0 1 rv 

819 


5 


820 


6p21.3 


821 


17 


822 


15 


823 


5 


824 


19 


825 


lp32.3 


826 


11 


827 


14 


828 


p34.1-34.3 


829 


16 


830 


8pll.2 


831 


17q2L3-17q22 


833 


17 


834 


7p 


835 


21 


836 


10cen-q26.11 


837 


19 


838 


5 


840 


10 


841 


7qll-q22 


842 


11 


843 


17 


844 


3 


845 


17 


846 


17 


847 


17 


OA O 

848 


10 


849 


6 


850 


5q 


Of 1 

851 


5 


853 


7q35-qter 


854 


19 


855 


19 


OCT 

857 


8 


858 


16 


860 


10 


ool 


19 




1 Ow1 1 o 

lopl 1.2 


oo4 


17 i 


OOO 


15qlo 


van 
00/ 


7 


ooo 


12 


O/CO 

ooy 


1 


o/ u 




871 


16 


872 


16pl3.3 


873 


17ql2-q21 


874 


llql3.5 


875 


llcen-ql2.1 1 


876 


16ql3 


877 


X 


878 


Iq21-q23 \ 


879 


xq22.1-q22.3 



989 
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SEOID 


Chrowsomal location 


880 


lp31.2-32.3 


881 


19ql3.3-ql3.4 


883 


3d 


884 


7 


885 


14q32 


886 


2 


887 




888 


12 


889 


15 


890 


18 


891 


17 


892 


Xq2 1.33-22.3 


893 


6d21 32-22 2 


894 


11 


895 


7o33-a34 


897 


13 


898 


15 


got 


1 


909 


14 






90S 


7 1 n?7 3 


907 


10 


90Q 
y\iy 


Y 


910 




91 1 
y\ i 


70 


91? 


1 ! 
l 


9H 
y i j 


1R 

I 0 


914 


A 
u 


91^ 


10 


916 


13 ! 


917 


17 


919 


15ql5 


920 


20 


921 


22ql2.3 


922 


16 


923 


11 


924 


2q33.3 


927 


9 


928 


2q21 


929 


19 


930 


18 


931 


16 


932 


16 


934 


llpl5 


936 


7q35 


937 


6q22. 1-22.33 


939 


17 


940 


17 


941 


17 


942 


4 


943 


Ip36.31-p36.13 


944 


9 


945 


12 


946 


19pl3.3 


947 


22qll.2 


948 


5 



990 
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TABLE 8 



SEQ ID NO of Full-length 
Nucleotide Sequence 


SEQ ID NO of Full-length 
Peptide Sequence 


SEQ ID NO in Priority Application 
USSN 09/799,451 


1 


949 


1 


2 


950 


2 


3 


951 


3 


4 


952 


4 


5 


953 


5 


6 


954 


6 


7 


955 


7 


8 


956 


8 


9 


957 


9 


10 


958 


10 


11 


959 


11 


12 


960 


12 


13 


961 


13 


14 


962 


14 


15 


963 


15 


16 


964 


16 


17 


965 


17 


18 


966 


18 


19 


967 


19 | 


20 


968 


20 


21 


969 


21 


22 


970 


22 


23 


971 


23 


24 


972 


24 


25 


973 


25 


26 


974 


26 


27 


975 


27 


28 


976 


28 


29 


977 


29 | 


30 


978 


30 


31 


979 


31 


32 


980 


32 


33 


981 


33 


34 


982 


34 


35 


983 


35 


36 


984 


36 


37 


985 


37 


38 


986 


38 


39 


987 


39 


40 


988 


40 


41 


989 


41 


42 


990 


42 


43 


991 


43 


A± 




A A 

44 


45 


993 


45 


46 


994 


46 


47 


995 


47 


48 


996 


48 


49 


997 


49 1 


50 


998 


50 1 


51 


999 


51 


52 


1000 


52 


53 


1001 


53 


54 


1002 


54 



991 
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PCTYUS02/05095 



SEQ ID ISO oi b ull-lengtn 
Nucleotide Sequence 


CPA TTV Xf/"V n C 17.. 11 |,, n „iL 

SEQ ID NO of bull-length 
repuae sequence 


SEQ ID INU in rrionty Application 

UoOIi Uy/ /yy,*»DI 


jj 


1 AA1 


SS 
jj 


00 


mo/i 


JO 


^7 

J/ 


iUUj 


?7 

j 1 


SB. 
JO 


1UU0 


CO 
JO 


KQ 

jy 


1 AYV7 


jy 


nr. 


1008 


60 


01 


tnno 
luuy 


01 


oz 


1 c\\c\ 
1U1U 


OZ 


xa 

oi 


1 At 1 

1UI 1 


03 


C\A 

04 


1 All 


04 


oj 


1 Al*3 

IUJ3 


OJ 


66 


1014 


00 


CI 

67 


101 j 


67 


68 


1016 


CO 

68 


res 
69 


1017 


^A 

69 


70 


1 Al O 

1018 


TA 

70 


71 


1 A1 A 

1019 


71 


72 


1020 


72 


73 


1021 


73 


74 


1022 


74 


75 


1023 


75 


76 


1024 


nc 

76 


77 


1025 


77 


78 


1026 


78 


79 


1027 


79 


80 


1028 


80 


81 


1029 


81 


oi 


1030 


01 

82 


OI 

83 


1 A^ 1 

1031 


0*5 

83 


84 


1032 


84 


OC 
OJ 


i Ala 
1033 


5J 


QC 

86 


1034 


QC 

86 


on 
87 


1035 


87 


CO 

oo 


1 A1/C 


QQ 
00 


CO 

oy | 


1 01*7 

103/ 


QQ 
07 


yu 


1 A10 

1038 


OA 

90 


yi 


1 AlO 

103y 


yi 


oo 
yz 


\ WO 


OO 




IDA! 


OT 

yj 


y*f 




OA 


yj i 


Jl/HJ 


OS 

yj 


yo 




yo 


07 
y / 


1 vrtJ 


07 

y / 


08 
yo 


lUfO 


yo 


00 
yy 


lfld.7 
1U 1 t/ 


QQ 

77 


100 


1048 


100 * 


101 


1049 


103 


102 


1050 


102 


103 i 


1051 


103 


104 


1052 


104 


105 


1053 


105 


106 


1054 


106 


107 


1055 


107 


108 


1056 


108 


109 


1057 


109 


110 


1058 


110 
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SEQ ID NO of Full-length 
Nucleotide Sequence 


SEO ID NO of Full-leneth 
Peptide Sequence 


SEO ID NO in Prinrifv Anrtlirftfinn 
^*<>C i^v iu jt villi it j /\.jjpilvaUUn 

USSN 09/799.451 


111 


1059 


111 


112 


1060 


112 


113 


1061 


113 


114 


1062 


114 


115 


1063 


115 "~~ 


116 


1064 


116 


117 


1065 


117 


118 


1066 


118 


119 


1067 


119 


120 


1068 


120 


121 


1069 


121 


122 


1070 


122 


123 


1071 


123 


124 


1072 


124 


125 


1073 


125 


126 


1074 


126 


127 


1075 


127 


128 


1076 


128 


129 


1077 


129 


130 


1078 


no 


131 


1079 


ui 


132 


1080 


132 


133 


1081 


133 


134 


1082 


134 


135 


1083 


135 1 


136 


1084 


136 


137 


1085 


137 


138 


1086 


138 


139 


1087 


139 


140 


1088 


140 


141. 


1089 


141 


142 


1090 


142 


143 


1091 


143 


144 


1092 


144 


145 


1093 


145 


146 


1094 


146 


147 


1095 


147 


148 


1096 


148 


149 


1097 


149 


150 


1098 


150 


151 


1099 


151 


152 


1100 


152 


153 


1101 


153 


154 


1102 


154 


155 1 


1103 


155 


156 


1104 


156 


157 j 


1105 


157 1 


158 


1106 


158 


159 


1107 


159 


160 


1108 


160 


161 


1109 


161 


162 


1110 


362 


163 


1111 


163 


164 1 


1112 


164 


165 


1113 


165 


166 


1114 


166 
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SEQ ID NO of Full-length 

XT 1 -J.' J 

Nucleotide Sequence 


SEQ IB NO of Full-length 
Peptide Sequence 


SEQ ID NO in Priority Application 
USSN 09/799,451 


KO 
10/ 


1115 


167 


1 CQ 

loo 


1116 


168 


169 


1117 


169 


1 nn 
170 


1118 


170 


171 


1119 


171 


172 


1120 


172 


173 


1121 


173 


174 


1122 


174 


175 


1123 


175 


176 


1124 


176 


177 


1125 


177 


178 


1126 


178 


179 


1127 


179 


180 


1128 


180 


181 


1129 


181 


182 


1130 


182 


183 


1131 


183 


184 


1132 


184 


• 185 


1133 


185 


186 


1134 


186 


187 


1135 


187 ^ 


188 


1136 


188 


189 


1137 


189 


190 


1138 


190 


191 


1139 


191 J 


192 


1140 


192 


193 


1141 


193 


194 


1142 


194 


195 


1143 


195 


196 


1144 | 


196 


197 


1145 


197 


198 


1146 n 


198 


199 


1147 


199 


200 


1148 


200 


201 


1349 


201 


202 


1150 


202 


203 


1151 


203 j 


204 


1152 


204 


205 


1153 


205 




1154 


206 


0m 


1155 


207 


ZUa 


1156 


208 


OAQ 


1157 


209 




1158 


210 


01 1 

Zl 1 


3159 


211 ~ J 


010 


1160 


212 


213 


3161 


115 


214 


1162 


214 


215 


1163 


215 


216 


1164 


216 


217 


1165 


237 


218 


1166 


218 


219 r 


1167 


219 


220 


1168 


220 


221 


1169 


221 1 


222 


1170 


222 



994 
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SEQ ID NO of Full-length 
Nucleotide Sequence 


SEQ ID NO of FulHength 
Peptide Sequence 


SEQ ID NO in Priority Application 

TTOOM AO/700 A*Z1 


223 


1 1 "71 

1 171 


701 
ZZO 


OO/* 

ZZ4 


1 1 OO 
1 1 11 


00A 
ZZ4 


ZZD 


1 1 77 
1 1 /J 


ZZD 


OO/C 
ZZO 


11/4 


OOA 

zzo 


Oil 

ZZ/ 


1 1 /D 


007 

zz/ 


ooo 
ZZo 


1 1 7/C 
11/0 


OOO 
ZZo 


oo a 

zzy 


1 1 oo 
11// 


ooo 
zzy 


0*5 A 

230 


1 1 OO 

1 I/O 


OOA 

Z3U 


231 


1 1 OA 

1 179 


ZJ1 


232 


1180 


o^o 
232 


233 


1 1 01 

1 1 8 1 


2j3 


234 


1182 


Ol /I 

234 


235 


1183 


235 


236 


1184 


236 


237 


1185 


237 


238 


1186 


238 


239 


1187 


239 


240 


1188 


240 


241 


1189 


241 


242 


1190 


242 


243 


1191 


243 


244 


1192 


244 


245 


1193 


245 


246 


1194 


246 


247 


1195 


247 


248 


1196 


Ovi o 

248 


249 


1 1 AT 

1197 


O/l A 

249 


OCA 

250 


1 1 AO 


OCA 

ZD0 


251 


1 1 AA 


OC1 

251 


oco 

252 


1 OAA 

1200 


oco 
252 


OC^ 

253 


1 O Al 

1201 


ZDJ 


ZD4 


1 O AO 

1ZUZ 


7</l 

ZD4 


occ 
ZDO 


1 OAO 


ZD J 


ZDO 


1 OA/1 

1Z04 


OC£ 

zdo 


OC7 

ZD/ 


1 OAC 


OCO 
ZD/ 


25 o 


lZOo 


ZDS 


25V 


1 OAT 

1207 


oco 
ZD9 


zou 


1 OAG 

1ZU5 


OAA 

zou 


0^1 

Z01 


1 OAG 


0<1 
ZOl 


OAO 

zoz 


1 O 1 A 

1Z 1 U 


zoz 


ZOj 


1011 
IZ1 1 


ZOj 


OA/1 
Z04 


1010 
1Z 1Z 


Z0*+ 


0« 
ZD J 


1011 


ZOD ! 


Zoo 


101/1 

1Z14 


0££ 
ZOO 


o/£7 
Zo7 


1 0 1 C 
1Z1D 


0£7 

zo/ 


zoo 


1 O 1 < 

lzlo 


ZOo 


zo> 


1017 




270 


1218 


270 


271 


1219 


271 


272 


1220 


272 


273 


1221 


273 


274 


1222 


274 


275 


1223 


275 


276 


1224 


276 


277 


1225 


277 


278 


1226 


278 



995 



WO 02/070539 




PCT/US02/05095 



SEQ ID NO of Full-length 
Nucleotide Sequence 


SEQ ID NO of Full-length 
Peptide Sequence 


SEQ ID NO in Priority Application 
USSN 09/799,451 


279 


1227 


279 


280 


1228 


280 


281 


1229 


281 


282 


1230 


282 


283 


1231 


283 


284 


1232 


284 1 


285 


1233 


285 


286 


1234 


286 


287 


1235 


287 


288 


1236 


288 


289 


1237 


289 


290 


1238 


290 


291 


1239 


291 


292 


1240 


292 


293 


1241 


293 


294 


1242 


294 


295 


1243 


295 


296 


1244 


296 


297 


1245 


297 


298 


1246 


298 


299 


1247 


299 


300 


1248 


300 


301 


1249 


301 


302 


1250 


302 


303 


1251 


303 


304 


1252 


304 


305 


1253 


305 


306 


1254 


306 


307 


1255 


307 


308 


1256 1 


308 


309 


1257 


309 


310 


1258 n 


310 


311 


1259 


311 


312 


1260 


312 


313 


1261 


313 


314 


1262 


314 


315 


1263 


315 


316 


1264 


316 


317 


1265 


317 


318 


1266 


318 


319 


1267 ~ 7 


319 


320 


1268 


320 


321 


1269 


321 


322 


1270 


322 


323 


1271 


323 


324 


1272 


324 


Jim J 


1 1*71 


325 


326 


1274 


326 


327 


1275 


327 


328 


1276 


328 


329 


1277 


329 


330 


1278 


330 


331 


1279 


331 


332 


1280 


332 


333 


1281 


333 


334 


1282 


334 



996 



WO 02/070539 




PCT/US02/05095 



SEQ ID NO of Full-length 
Nucleotide Sequence 


SEQ ID NO of Full-length 
Peptide Sequence 


SEQ ID NO in Priority Application 
USSN 09/799,451 


335 


1283 


335 


336 


1284 


336 


337 


1285 


337 


338 


1286 


338 


339 


1287 


339 


340 


1288 


340 


341 


1289 


341 ! 


342 


1290 


342 


343 


1291 


343 


344 


1292 


344 


345 


1293 


345 


346 


1294 


346 


347 


1295 


347 


348 


1296 


348 


349 


1297 


349 


350 


1298 


350 


351 


1299 


351 


352 


1300 


352 


353 


1301 


353 


354 


1302 


354 


355 


1303 


355 


356 


1304 


356 


357 


1305 


357 


358 


1306 


358 


359 


1307 


359 


360 


1308 


360 


361 


1309 


361 


362 


1310 


362 


363 


1311 


363 


364 


1312 


364 


365 


1313 


365 


366 


1314 


366 


367 


1315 


367 


368 


1316 


368 


369 


1317 


369 


370 


1318 


370 


371 


1319 


371 


372 


1320 


372 


373 


1321 


373 


374 


1322 


374 


375 


1323 


375 


376 


1324 


376 


377 


1325 


377 


378 


1326 


378 


379 


1327 


379 


380 


1328 


380 


381 


1329 


381 


382 


1330 


382 


383 


1331 


383 


384 


1332 


384 


385 


1333 


385 


386 


3334 


386 


387 


1335 


387 


388 


1336 


388 


389 


1337 


389 


390 


1338 


390 



997 
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PCT/US02/05095 



SEQ ID NO of Full-length 
Nucleotide Sequence 


SEQ ID NO of Full-length 
Peptide Sequence 


SEQ ID NO in Priority Application 
! USSN 09/799,451 


391 


1339 


391 


392 


1340 


392 


393 


1341 


393 


394 


1342 


394 


395 


1343 


395 


396 


1344 




397 


1345 


397 


398 


1346 


398 


399 


1347 


399 


400 


1348 


400 


401 


1349 


401 


402 


1350 


402 


403 


1351 


403 


404 


1352 


404 


405 


1353 


405 


406 


1354 


406 


407 


1355 


407 


408 


1356 


408 


409 


1357 


409 


410 


1358 


410 


411 


1359 


411 


412 


1360 


412 


413 


1361 


413 


414 


1362 


414 


415 


1363 


415 


416 


1364 


416 


417 


1365 


417 


418 


1366 


418 


419 


1367 


419 


420 


1368 


420 


421 


1369 


421 


422 


1370 


422 


423 


1371 


423 


424 


1372 


424 


425 


1373 


425 


426 


1374 


426 


427 


1375 


427 


428 


1376 


428 


429 


1377 


429 


430 


1378 


430 


431 


1379 


431 


432 


1380 


432 1 


433 


1381 


433 


434 


1382 


434 


435 


1383 


435 


436 


1384 


436 


437 


1385 


437 


438 


1386 


438 


439 


1387 


439 


440 


1388 


440 


441 


1389 


441 


442 


1390 


442 


443 


1391 


443 


444 


1392 


444 


445 


1393 


445 


446 


1394 


446 



998 
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PCTVUS02/05095 



SEQ ID NO of Full-length 
Nucleotide Sequence 


SEQ ID NO of Full-length 
Peptide Sequence 


SEQ ID NO in Priority Application 
USSN 09/799,451 


447 


1395 


447 


448 


1396 i 


448 


449 


1397 


449 


450 


1398 


450 


451 


1399 


451 


452 


1400 


452 


453 


1401 


453 


454 


1402 


454 


455 


1403 


455 


456 


1404 


456 


457 


1405 


457 


458 


1406 


458 


459 


1407 ■ 


459 


460 


1408 


460 


461 


1409 


461 


462 


1410 


462 


463 


1411 


463 


464 


1412 


464 


465 


1413 


465 


466 


1414 


466 


467 


1415 


467 


468 


1416 


468 


469 


1417 


469 


470 


1418 


470 


471 


1419 


471 


472 


1420 


472 


473 


1421 


473 


474 


1422 


474 


475 


1423 


475 


476 


1424 


476 


477 


1425 


477 


478 


1426 


478 


479 


1427 


479 


480 


1428 


480 


481 


1429 


481 


482 


1430 


482 


483 


1431 


483 


484 


1432 


484 


485 


1433 


485 


486 


1434 


486 


487 


1435 


487 


488 


1436 


488 


489 


1437 


489 


490 


1438 


490 


491 


1439 


491 


492 


1440 


492 


493 


1441 


493 


494 


1442 


494 


495 


1443 


495 


496 


1444 


496 


497 


1445 


497 


498 


1446 


498 


499 


1447 


499 


500 


1448 


500 


501 


1449 


501 


502 


1450 


502 



999 



WO 02/070539 
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SEQ ID NO of Full-length 
Nucleotide Sequence 


SEQ ID NO of Full-length 
Peptide Sequence 


SEQ ID NO iu Priority Application 
USSN 09/799,451 


303 


1451 


503 




1452 


504 


jUj 


1453 


505 


DUO 


1454 


506 


507 


1455 


507 


508 


1456 


508 


509 


1457 


509 


510 


1458 


510 


5ll 


1459 


511 


512 


1460 


512 


513 


i46l 


513 


514 


1462 


514 


515 


1463 


515 


516 


1464 


516 


517 


1465 


517 , 


518 


1466 


518 


519 


1467 


519 


520 


1468 


520 


521 


1469 


521 


522 


1 470 


522 


523 


J47J 


523 


524 


1472 


524 


525 


1473 


525 


526 


1474 


526 


527 


1475 


527 


528 


1476 


528 


529 


1477 


529 


530 


1 478 


530 


531 


1479 


531 


532 


1480 


532 


533 


1481 


533 


534 


1482 


534 


535 


1 483 


535 j 


536 


1 484 


536 


537 


1485 


537 


538 


1486 


538 


539 


3487 


539 


540 


1488 


540 ] 


D41 


1489 


541 


j4Z 


1490 


542 


SA? "" 


1491 


543 


KAA 

M4 


1492 


544 


->4j 


1493 


545 


ZA6 


1494 


546 


Cvl7 


1495 


547 


048 


1496 


548 


549 


1497 


SAQ 
j4V 


550 


1498 


550 


55 1 


1499 


551 


552 


1500 


552 


553 


1501 


553 


554 


1502 


554 


555 


1503 


555 


556 


1504 


556 


557 


1505 


557 


558 


1506 


558 ^ 



1000 



WO 02/070539 
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SEQ ID NO of JHuU-lengtn 
Nucleotide Sequence 


INU 01 r ull-'ieiigrJi 
Peptide Sequence 


USSN 09/799.451 


559 


1 CA7 


JJ7 


56U 


1 CAft 


^60 


c^ 1 
561 


1 cag 


J\J t 


c^o 

562 


I CI A 
1 J IU 


JVJZ 


563 


i ci i 
i j 1 1 




564 


1 CI o 
1 jJz 


564 


565 


1 CI Q 


565 


566 


I J 14 


566 


567 


1 CI C 
1 J 1 J 


567 

JO/ 


565 


1 J Jo 


568 

■/I/O 


569 


if IT 

1517 


c»£q 
joy 


570 


If io 

1518 


C7A 
j/U 


571 


1519 


C71 
0/1 


572 


1 MA 

1 520 


C77 
J/Z 


573 


1521 


C71 
J/3 


574 


1 coo 
1522 


j/4 


575 


1 coo 
1523 


575 

j/ J 


576 


1 CO A 

1524 


576 
J/O 


577 


1525 


C77 
j/ / 


578 


1526 


C7Q 
j/O 


579 


1527 


C7Q 

j/y 


580 


1528 




581 


1529 


JO I 


582 


1530 


CBO 

Joz 


583 


1531 


f 81 

jo3 


584 


1 CIO 

1532 


5S4 


585 


1 en"* 
153j 


CQC 

joj 


586 


1534 


cq/; 

joO 


587 


1 c> c 
15jj 


JO/ 


588 


1536 


joo 


589 


153/ 




590 


1 C*3 Q 

1 j3o 


son 


591 


1 j39 


5Q1 


coo 
592 


1 c/in 
1 D4U 


50? 


coi 
093 


1 CA1 
1 J^l 


501 


594 


1 CAO 


504 

j^t 


595 


1 J*tJ 


J/J 


596 






59/ 




507 
J7 / 


COQ 

59o 




50R 


COO 


1 S47 




£AA 

ouu 




600 






601 


AAO 




602 


OU3 


1 CC1 


60T 


6U4 


1 ceo 

1 JjZ 


604 


605 


1553 


605 


606 


1554 


606 


607 


1555 


607 


608 


1556 


608 


609 


1557 


609 


610 


1558 


610 


611 


1559 


611 


612 


1560 


612 


613 


1561 


613 


614 


1562 


614 



1001 



WO 02/070539 
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SEQ ID NO of Full-length 
Nucleotide Sequence 


SEQ ID NO of Full-length 
Peptide Sequence 


SEQ ID NO in Priority Application 
USSN 09/799,451 


615 


1563 


615 


616 


1564 


616 


617 


1565 


617 


618 


1566 


618 


619 


1567 


619 


620 


1568 


620 


621 


1569 


621 


622 


1570 


622 


623 


1571 


623 


624 


1572 


624 


625 


1573 


625 


626 


1574 


626 


627 


1575 


627 


628 


1576 


628 


629 


1577 


629 


630 


1578 


630 


631 


1579 


631 


632 


1580 


632 


633 


1581 


633 


634 


1582 


634 


635 


1583 


635 


636 


1584 


636 


637 


1585 


637 


638 


1586 


638 


639 


1587 


639 


640 


1588 


640 


641 


1589 


641 


642 


1590 


642 


643 


1591 


643 


644 


1592 


644 


645 


1593 


645 | 


646 


1594 


646 j 


647 


1595 


647 1 


648 


1596 


648 


649 


1597 


649 


650 


1598 


650 


651 


1599 


651 


652 


1600 


652 


653 


1601 


653 


654 


1602 


654 


655 


1603 


655 


656 


1604 


656 


657 


1605 


657 


658 


1606 


658 


659 


1607 


659 


660 


1608 


660 


001 


1609 


661 


662 


1610 


662 


663 


I6ll 


663 


664 


1612 


664 


665 


1613 


665 


666 


1614 


666 


667 


1615 


667 


668 


1616 


668 1 


669 


1617 


669 


670 [ 


1618 


670 



1002 
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QFO in MO nf Fnll-lemrth 
r cpuuc uvi| uviiiv 


QffO in NO in PHnritv Annlirntion 

USSN 09/799.451 




1619 


671 


672 


1620 


672 


673 


1621 


673 


674 


1622 


674 


67S 

U/ J 


1673 


675 


67A 


J 694 


676 


677 




677 


67R 


1616 


678 


670 
o/y 


1697 


679 


aso 
oou 


1 698 


680 


AR1 


1 690 
l ozy 


6R1 

UO 1 


Aft 7 


1 ATO 


6 89 


AC* 


1 AT 1 


AR3 

OOJ 


ASA 


1 AT9 




as*; 
Ooj 


1 AT* 


ASS 

OOJ 


ASA 
080 


1 6^4 


6R6 


AS7 
Oo/ 


1 

X Oj J 


687 

VO / 


ASS 
Ooo 


1 6TA 
1 0J0 


ASS 


ASO 


1 A17 


ASO 
ooy 


oyu 


1 ATS 
J OJo 


600 
oyu 


AQ1 


1 £TQ 
1 Ojy 


601 
oy i 


AQO 

oyz 


1 04U 


609 
oyz 


AO* 


1 A41 


603 


ACM 

0y4 


1 f^AI 
1 04Z 


A04 


AOS 
OyD 


1 (\A.1 
1 OMO 


60S 


AQA 

oyo 


1 644 

l Oft 


606 


A07 


1fi4S 

i DHJ 


607 


AOS 
0^5 


1 A4A 


60S 


A00 

oyy 


1 647 


600 


7on 


I QHO 


700 


701 


1 A40 
I oty 


701 


709 




709 ! 


703 


1651 


703 


704. 


16S9 


704 


70S 


1653 


705 


706 


1654 


706 


707 


1655 


707 


70ft 


1656 


708 


700 


1657 


709 


71 0 




710 


71 1 


1659 


711 


712 


1660 


712 


713 


1661 


713 


714 


1662 


714 


715 


1663 


715 


716 

/IV 


1664 


716 


717 


1665 


717 


718 


1666 


718 


719 


1667 


719 


720 


1668 


720 


721 


1669 


721 


722 


1670 


722 


723 


1671 


723 


72.4 


1672 


724 


725 


1673 


725 


726 


1674 


726 



1003 
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SEQ ID NO of Full-length 
Nucleotide Sequence 


SEQ ID NO of Full-length 
Peptide Sequence 


SEQ ID NO in Priority Application 
USSN 09/799,451 


(MM 

727 


1675 


727 


TOO 

728 


1676 


728 


729 


1677 


729 


730 


1678 


730 


731 


1679 


731 


732 


1680 


732 


733 


1681 


733 


734 


1682 


734 


735 


1683 


735 


736 


1684 


736 


737 


1685 


737 


738 


1686 


738 


739 


1687 


739 


740 


1688 


740 


741 


1689 


741 


742 


1690 


742 


743 


1691 


743 


744 


1692 


744 


745 


1693 


745 


746 


1694 


746 


747 


1695 


747 


748 


1696 


748 


749 


1697 


749 


750 


1698 


750 


751 


1699 


751 


752 


1700 


752 


753 


1701 


753 


754 


1702 


754 


755 


1703 


755 


756 


1704 


756 


757 


1705 


757 


758 


1706 


758 


759 


1707 


759 


760 


1708 


760 


761 


1709 


761 


762 


1710 


762 


763 


1711 


763 


764 


1712 


764 


765 | 


1713 


765 


7oo 


1714 


766 


767 


1715 


767 


/OB 


1716 


768 


/oy 


1717 


769 


770 


1718 


770 


771 


1719 


771 ] 


III 


1720 


772 


773 


1791 


Til 


774 1 


1722 


774 


775 


1723 


775 


776 j 


1724 


776 


777 


1725 


777 


778 


1726 


778 


779 


1727 


779 


780 


1728 


780 


781 


1729 


781 


782 


1730 


782 



1004 



WO 02/070539 
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crn TTI NO nfEiilLlpnotli 
Nnolprttfilt* Sf»n licence 


Pent id e Sea ue nee 


SFO ID NO in Priority Annlication 
USSN 09/799,451 


783 


1731 


783 


784 


1732 


784 


7R5 


1733 


785 


/OV/ 


1734 
i /jt 


786 


787 


1735 


787 


788 


1736 


788 


789 


1737 


789 


7Q0 


1738 


790 


791 


1739 


791 ' 


709 


1740 


792 


70^ 


1741 

1 /HI 


793 


7QA 


1 74? 


794 


lyj 




705 


/yo 


1 74d 


706 


TOT 


17/15 


707 


7QR 


1 74fi 


70R 


700 


1747 


700 
/ yy 






ouu 


R01 


174Q 


R01 


R09 


1750 


R0? 


Rfn 


1751 
1 IJl 


R0^ ' 

OUJ 


OA/1 
Out 1 


1759 


804 

OUT 


QAC 
OUJ 


1 7*?** 
1 /Jj 


R05 

OUJ 


RHA 


1754 


R06 
ouu 


R07 


1 755 
l / jj 


807 




1756 
1 /ju 


808 




1757 
i / «/ / 


809 


Rlfl 


l / JO 


810 


R1 t 

Oil 


1 / J7 


811 


R1? 


1 / VU 


812 


Rl^ 


1761 


813 

O iJ 




1762 


814 


815 


1763 


815 


816 


1764 


816 


817 


1765 


817 


818 

OlO 


1766 


818 


R10 


1767 


819 


820 


1768 


820 


821 


1769 


821 


822 


1770 


822 


823 


1771 


823 


824 


1772 


824 


825 


1773 


825 


826 


1774 


826 


827 


1775 


827 


828 


1776 


828 


829 


1777 


829 


830 


1778 


830 


831 


1779 


831 


832 


1780 


832 


833 


1781 


833 


834 


1782 


834 


835 


1783 


835 


836 


1784 


836 


837 


1785 


837 


838 


1786 


838 
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SEQ ID NO of Full-length 
Nucleotide Sequence 


SEQ ID NO of Full-length 
Peptide Sequence 


SEQ ID NO in Priority Application 
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839 


1787 


839 


840 


1788 


840 


841 


1789 


841 


842 


1790 


842 


843 


1791 


843 


844 


1792 


844 


845 


1793 


845 


846 


1794 


846 


847 


1795 


847 


848 


1796 


848 


849 


1797 


849 


850 


1798 


850 


851 


1799 


851 


852 


1800 


852 


853 


1801 


853 


854 


1802 


854 


855 


1803 


855 


856 


1804 


856 


857 


1805 


857 


858 


1806 


858 


859 


1807 


859 


860 


1808 


860 


861 


1809 


861 


862 


1810 


862 


863 


1811 


863 


864 


1812 


864 


865 


1813 


865 


866 


1814 


866 


867 


1815 


867 


868 


1816 


868 


869 


1817 


869 


870 


1818 


870 


871 


1819 


871 


872 


1820 


872 


873 


1821 


873 


874 


1822 


874 


875 


1823 


875 


876 


1824 


876 


877 


1825 


877 


878 


1826 


878 


879 


1827 


879 


880 


1828 


880 


881 


1829 


881 


882 


1830 


882 


883 


1831 


883 


884 


1832 


884 


885 


1833 


885 


886 


1834 


886 


887 


1835 


887 


888 


1836 


888 


889 


1837 


889 


890 


1838 


890 


891 


1839 


891 


892 


1840 


892 


893 


1841 


893 


894 


1842 


894 
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SEQ ID NO of Full-length 
Nucleotide Sequence 


SEQ ID NO of Full-length 
Peptide Sequence 
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895 


1843 


one 

895 


on/ 

896 


1844 


o9o 


897 


1845 


00*7 

897 


OAO 

898 


1846 




899 


1847 


OAA 

oyy 


900 


1848 


AAA 
900 


901 


1849 


OA1 

yui 


902 


1 Of A 

1850 


AAO 

902 


903 


1851 


aao 

903 


904 


1852 


904 


905 


1853 


905 


906 


1854 


906 


907 


1855 


907 


908 


1856 


908 


909 


1857 


909 


910 


1858 


910 


911 


1859 


911 


912 


1860 


912 


913 


1861 


913 


914 


1862 


914 


915 


1863 


915 


916 


1864 


916 


917 


1865 


917 


918 


1866 


918 


919 


1867 


919 


920 


1868 


920 


921 


1869 


921 


922 


1870 


922 


923 


1871 


923 


924 


1872 


924 


925 


1873 


925 


926 


1874 


926 


927 


1875 


927 


928 


1876 


AOO 

928 


aoa 

929 


1877 


AO A 

929 


930 


1878 


930 


931 


1879 


931 


932 


1880 


932 


933 


1881 


933 


934 


1882 


934 


935 


188j 


AO. C 

935 


936 


% OO A 

1884 


AO. C 


937 




AOf 


AO O 


1886 


AO O 


939 


i o on 

1 887 


AO A 

939 




1 OOO 




941 


1889 


941 


942 


1890 


942 


943 


1891 


943 


944 


1892 


944 


945 


1893 


945 


946 


1894 


946 


947 


1895 


947 


948 


1896 


948 
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CLAIMS 

WHAT IS CLAIMED IS: 

1. An isolated polynucleotide comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1 - 948, a mature protein coding portion of SEQ ID NO: 1 - 948, 
an active domain coding protein of SEQ ID NO: 1 - 948, and complementary sequences 
thereof. 

2. An isolated polynucleotide encoding a polypeptide with biological activity, wherein 
said polynucleotide has greater than about 90% sequence identity with the polynucleotide of 
claim 1. 

3. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

4. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
complementary sequences. 

5. A vector comprising the polynucleotide of claim 1 . 

6. An expression vector comprising the polynucleotide of claim 1 . 

7. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 
operatively associated with a regulatory sequence that modulates expression of the 
polynucleotide in the host cell. 

9. An isolated polypeptide, wherein the polypeptide is selected from the group consisting 
of a polypeptide encoded by any one ofthe polynucleotides of claim 1 (i.e. SEQ ID NO: 949- 
1896). 

1 0. A composition comprising the polypeptide of claim 9 and a carrier. 
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11. An antibody directed against the polypeptide of claim 9. 

12. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a 
5 complex with the polynucleotide of claim 1 for a period sufficient to form the complex; and 

b) detecting the complex, so that if a complex is detected, the 
polynucleotide of claim 1 is detected. 

13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

10 a) contacting the sample under stringent hybridization conditions with 

nucleic acid primers that anneal to the polynucleotide of claim 1 under such conditions; 

b) amplifying a product comprising at least a portion of the 
polynucleotide of claim 1; and 
15 c) detecting said product and thereby the polynucleotide of claim 1 in the 

sample. 

14. The method of claim 13, wherein the polynucleotide is an RNA molecule and the 
method further comprises reverse transcribing an annealed RNA molecule into a cDNA 

20 polynucleotide. 

15. A method for detecting the polypeptide of claim 9 in a sample, comprising: 

a) contacting the sample with a compound that binds to and fonns a 
complex with the polypeptide under conditions and for a period sufficient to form the 

25 complex; and 

b) detecting formation of the complex, so that if a complex formation is 
detected, the polypeptide of claim 9 is detected. 

16. A method for identifying a compound that binds to the polypeptide of claim 9, 
30 comprising: 

a) contacting the compound with the polypeptide of claim 9 under 
conditions sufficient to form a polypeptide/compound complex; and 

b) detecting the complex, so that if the polypeptide/compound complex is 

detected, a compound that binds to the polypeptide of claim 9 is identified. 
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17. A method for identifying a compound that binds to the polypeptide of claim 9, 
comprising: 

a) contacting the compound with the polypeptide of claim 9, in a cell, 
5 under conditions sufficient to form a polypeptide/compound complex, wherein the complex 

drives expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence expression, 
so that if the polypeptide/compound complex is detected, a compound that binds to the 
polypeptide of claim 9 is identified. . 

10 

18. A method of producing the polypeptide of claim 9, comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected 
from the group consisting of a polynucleotide sequence of SEQ ID NO: 1-948, a mature 
protein coding portion of SEQ ID NO: 1-948, an active domain coding portion of SEQ ID 

15 NO: 1-948, complementary sequences thereof, under conditions sufficient to express the 
polypeptide in said cell; and 

b) isolating the polypeptide from the cell culture or cells of step (a). 

19. An isolated polypeptide comprising an amino acid sequence selected from the group 
20 " consisting of any one of the polypeptides from the Sequence Listing, the mature protein 

portion thereof, or the active domain thereof. 

20. The polypeptide of claim 21 wherein the polypeptide is provided on a polypeptide 
array. 

25 

21. A collection of polynucleotides, wherein the collection comprising the sequence 
information of at least one of SEQ ID NO: 1 - 948. 

22. The collection of claim 21, wherein the collection is provided on a nucleic acid array. 

30 

23. The collection of claim 22, wherein the array detects full-matches to any one of the 
polynucleotides in the collection. 



1010 



WO 02/070539 




PCT/US02/05095 



24. The collection of claim 22, wherein the array detects mismatches to any one of the 
polynucleotides in the collection. 

25. The collection of claim 21, wherein the collection is provided in a computer-readable 
5 format. 

26. A method of treatment comprising administering to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising a polypeptide of claim 9 or 19 and 
a pharmaceutically acceptable carrier. 

0 

27. A method of treatment comprising administering to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising an antibody that specifically binds 
to a polypeptide of claim 9 or 19 and a pharmaceutically acceptable carrier. 
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